Abstract—The Philippine archipelago is one of the most disaster-prone area due to its location. As Twitter grows everyday, it has now become a valuable source of people’s opinion. The main purpose of the study is to use Twitter, as text corpora in the attainment of disaster risk reduction. Earthquake related tweets posted from July 1, 2017 to August 31, 2017 were programmatically collected. Data cleaning was made by removing noisy words, hence, from 90,692 collected tweets this resulted to 41,500 cleaned tweets. Topic modeling identifies patterns in a corpus. Findings revealed that there is a need to strengthen the conduct of earthquake drill and early warning notices to lessen the vulnerability of people and property, and reduce the causal damages of earthquake, thus improve its community resiliency. Topic results shall be discussed with concerned agencies for its possible consideration and inclusion to enhance the earthquake-related disaster management plan.
Index Terms—Topic modeling, corpus analysis, tweets, earthquake.
Lany L. Maceda and Jennifer L. Llovido are with the Bicol University College of Science, Computer Science and Information Technology Department, Legazpi City, Philippines (e-mail: firstname.lastname@example.org, email@example.com).
Dr. Thelma D. Palaoag is with the University of the Cordilleras College of Information Technology and Computer Science, Baguio City, Philippines (e-mail: firstname.lastname@example.org).
Cite: Lany L. Maceda, Jennifer L. Llovido, and Thelma D. Palaoag, "Corpus Analysis of Earthquake Related Tweets through Topic Modelling," International Journal of Machine Learning and Computing vol. 7, no. 6, pp. 194-197, 2017.