Events Tagging in Twitter Using Twitter Latent Dirichlet Allocation

Ghaidaa A. Al-Sultany; Hiba J. Aleqabie

doi:10.14419/ijet.v7i4.19.28065

Authors

Ghaidaa A. Al-Sultany
Hiba J. Aleqabie

Received date: March 1, 2019

Accepted date: March 1, 2019

Published date: November 27, 2018

DOI:

https://doi.org/10.14419/ijet.v7i4.19.28065

Keywords:

Twitter, TLDA, PMI, and Perplexity.

Abstract

Twitter has become a great platform to publish and carrying news, advisements, events, topics and even daily events in our lives. Twitter Post has limitations on the length and noise. These limitations make that the post is unsuitable for topic modeling due to sparsity.Â Â In this paper, Twitter Latent Dirichlet allocation (TLDA) methodÂ for topics modeling was applied to overcome the sparsity problem of tweets modeling. Many steps were implemented for eventÂ tagging on Twitter. First: construct aÂ datasetÂ by hashtag pooling technique, and then theÂ preprocessingÂ was performed to extract the features.Â Secondly, find the suitable number of topics through Perplexity criterion, then,Â the topics are labeled by WordNet lexicon.Â Finally,Â events are tagging using Pricewise Mutual Information (PMI) criterion.Â The dataset is constructed about various topics including the American elections, Football world cup 2018, and a natural phenomenon and many others; the number of tweets is 63458. This study shows good results in training tweets dataset.
Â

References

[1] A. O. Steinskog, J. F. Therkelsen, and B. GambÃ¤ck, â€œTwitter Topic Modeling by Tweet Aggregation,â€ Proc. 21st Nord. Conf. Comput. Linguist., no. May, pp. 77â€“86, 2017.
[2] H. Cai, Y. Yang, X. Li, and Z. Huang, â€œWhat are Popular : Exploring Twitter Features for Event Detection , Tracking and Visualization,â€ MM â€™15 Proc. 23rd ACM Int. Conf. Multimed., pp. 89â€“98, 2015.
[3] X. Zhao, J. Jiang, and W. X. Zhao, â€œAn Empirical Comparison of Topics in Twitter and Traditional Media,â€ Singapore Manag. Univ. Sch. Inf. Syst. Tech. Pap. Ser., 2011.
[4] R. Mehrotra, S. Sanner, W. Buntine, and L. Xie, â€œImproving LDA topic models for microblogs via tweet pooling and automatic labeling,â€ Proc. 36th Int. ACM SIGIR Conf. Res. Dev. Inf. Retr. - SIGIR â€™13, p. 889, 2013.
[5] D. Alvarez-Melis and M. Saveski, â€œTopic Modeling in Twitter: Aggregating Tweets by Conversations,â€ $Icwsm16, no. Icwsm, pp. 519â€“522, 2016.
[6] W. D. Penniman, Social Informatics, vol. 6430. 2010.
[7] H. Kwak, C. Lee, H. Park, and S. Moon, â€œWhat is Twitter , a Social Network or a News Media?,â€ Int. World Wide Web Conf. Comm., pp. 1â€“10, 2010.
[8] K. Sarkar and R. Law, â€œA Novel Approach to Document Classification using WordNet,â€ arXiv1510.02755 [cs], pp. 1â€“14, 2015.
[9] G. Ifrim, B. Shi, and I. Brigadir, â€œEvent detection in Twitter using aggressive filtering and hierarchical tweet clustering,â€ CEUR Workshop Proc., vol. 1150, pp. 33â€“40, 2014.
[10] L. Liu, L. Tang, W. Dong, S. Yao, and W. Zhou, â€œAn overview of topic modeling and its current applications in bioinformatics,â€ Springerplus, vol. 5, no. 1, 2016.
[11] D. A. Ostrowski, â€œUsing latent dirichlet allocation for topic modelling in twitter,â€ Proc. 2015 IEEE 9th Int. Conf. Semant. Comput. IEEE ICSC 2015, pp. 493â€“497, 2015.
[12] X. Wan and T. Wang, â€œAutomatic Labeling of Topic Models Using Text Summaries,â€ Proc. 54th Annu. Meet. Assoc. Comput. Linguist. (Volume 1 Long Pap., pp. 2297â€“2305, 2016.
[13] C. C. MuÅŸat, Åž. TrÇŽuÅŸan-Matu, J. Velcin, and M.-A. Rizoiu, â€œAutomatic extraction of conceptual labels from topic models,â€ UPB Sci. Bull. Ser. C Electr. Eng., vol. 74, no. 2, pp. 57â€“68, 2012.
[14] A. Huang, R. Lehavy, A. Zang, and R. Zheng, â€œAnalyst Information Discovery and Interpretation Roles: A Topic Modeling Approach,â€ Ssrn, 2014.
[15] W. X. Zhao et al., â€œTopical keyphrase extraction from Twitter,â€ Proc. 49th Annu. Meet. Assoc. Comput. Linguist. Hum. Lang. Technol. 1, pp. 379â€“388, 2011.

Events Tagging in Twitter Using Twitter Latent Dirichlet Allocation

Authors

Ghaidaa A. Al-Sultany

Hiba J. Aleqabie

How to Cite

DOI:

Keywords:

Abstract

References

Downloads

How to Cite