Crisis detection from Arabic tweets

Alaa Ali H Alharbi; Mark Lee

Crisis detection from Arabic tweets

Alaa Ali H Alharbi, Mark Lee

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

100 Downloads (Pure)

Abstract

Social media (SM) platforms such as Twitter offer a rich source of real-time information about crises from which useful information can be extracted to support situational awareness. The task of automatically identifying SM messages related to a specific event poses many challenges, including processing large volumes of short, noisy data in real time. This paper explored the problem of extracting crisis-related messages from Arabic Twitter data. We focused on high-risk floods as they are one of the main hazards in the Middle East. In this work, we presented a goldstandard Arabic Twitter corpus for four highrisk floods that occurred in 2018. Using the annotated dataset, we investigated the performance of different classical machine learning (ML) and deep neural network (DNN) classifiers. The results showed that deep learning is promising in identifying flood-related posts.

Original language	English
Title of host publication	Proceedings of the 3rd Workshop on Arabic Corpus Linguistics
Publisher	Association for Computational Linguistics, ACL
Pages	72-79
Number of pages	8
ISBN (Electronic)	978-1-950737-32-1
Publication status	Published - 22 Jun 2019
Event	The 3rd Workshop on Arabic Corpus Linguistics (WACL-3) - Cardiff, United Kingdom Duration: 22 Jul 2019 → 24 Jul 2019

Conference

Conference	The 3rd Workshop on Arabic Corpus Linguistics (WACL-3)
Abbreviated title	WACL-3
Country/Territory	United Kingdom
City	Cardiff
Period	22/07/19 → 24/07/19

Access to Document

Alharbi_&_Lee_Crisis_detection_from_Arabic_tweets_Proceedings_of_the_3rd_Workshop_on_Arabic_Corpus_Linguistics_2019
Checked for eligibility: 04/09/2019
Final published version, 120 KBLicence: Creative Commons: Attribution (CC BY)

https://www.aclweb.org/anthology/W19-5609Licence: Creative Commons: Attribution (CC BY)

Cite this

@inproceedings{9d02a0d5c0c94feca65616a559c06de8,

title = "Crisis detection from Arabic tweets",

abstract = "Social media (SM) platforms such as Twitter offer a rich source of real-time information about crises from which useful information can be extracted to support situational awareness. The task of automatically identifying SM messages related to a specific event poses many challenges, including processing large volumes of short, noisy data in real time. This paper explored the problem of extracting crisis-related messages from Arabic Twitter data. We focused on high-risk floods as they are one of the main hazards in the Middle East. In this work, we presented a goldstandard Arabic Twitter corpus for four highrisk floods that occurred in 2018. Using the annotated dataset, we investigated the performance of different classical machine learning (ML) and deep neural network (DNN) classifiers. The results showed that deep learning is promising in identifying flood-related posts.",

author = "Alharbi, {Alaa Ali H} and Mark Lee",

year = "2019",

month = jun,

day = "22",

language = "English",

pages = "72--79",

booktitle = "Proceedings of the 3rd Workshop on Arabic Corpus Linguistics",

publisher = "Association for Computational Linguistics, ACL",

note = "The 3rd Workshop on Arabic Corpus Linguistics (WACL-3), WACL-3 ; Conference date: 22-07-2019 Through 24-07-2019",

}

TY - GEN

T1 - Crisis detection from Arabic tweets

AU - Alharbi, Alaa Ali H

AU - Lee, Mark

PY - 2019/6/22

Y1 - 2019/6/22

N2 - Social media (SM) platforms such as Twitter offer a rich source of real-time information about crises from which useful information can be extracted to support situational awareness. The task of automatically identifying SM messages related to a specific event poses many challenges, including processing large volumes of short, noisy data in real time. This paper explored the problem of extracting crisis-related messages from Arabic Twitter data. We focused on high-risk floods as they are one of the main hazards in the Middle East. In this work, we presented a goldstandard Arabic Twitter corpus for four highrisk floods that occurred in 2018. Using the annotated dataset, we investigated the performance of different classical machine learning (ML) and deep neural network (DNN) classifiers. The results showed that deep learning is promising in identifying flood-related posts.

AB - Social media (SM) platforms such as Twitter offer a rich source of real-time information about crises from which useful information can be extracted to support situational awareness. The task of automatically identifying SM messages related to a specific event poses many challenges, including processing large volumes of short, noisy data in real time. This paper explored the problem of extracting crisis-related messages from Arabic Twitter data. We focused on high-risk floods as they are one of the main hazards in the Middle East. In this work, we presented a goldstandard Arabic Twitter corpus for four highrisk floods that occurred in 2018. Using the annotated dataset, we investigated the performance of different classical machine learning (ML) and deep neural network (DNN) classifiers. The results showed that deep learning is promising in identifying flood-related posts.

M3 - Conference contribution

SP - 72

EP - 79

BT - Proceedings of the 3rd Workshop on Arabic Corpus Linguistics

PB - Association for Computational Linguistics, ACL

T2 - The 3rd Workshop on Arabic Corpus Linguistics (WACL-3)

Y2 - 22 July 2019 through 24 July 2019

ER -

Crisis detection from Arabic tweets

Abstract

Conference

Access to Document

Fingerprint

Cite this