Categorisation of Tweets Using Ensemble Classification Methods


  • S Mohanavalli
  • S Karthika
  • Srividya .
  • K R.Uthayan
  • N Sandya





Categorization, Twitter Analysis, Liblinear, Naïve bayes, SVM.


Twitter is a micro-blogging site that facilitates users to exchange short messages. Twitter is predominantly used in fields like business, healthcare, education and nation security. Twitter is being used by a large number of users for updating real time information and sentiment expression. The objective of this paper is to automate the classification of tweets into particular category using various machine learning algorithms like naïve bayes, SVM, and linear regression model. The proposed ensemble model aims to improve performance metrics of these algorithms. A comparative study of the algorithms used for tweet classification is done and results are discussed in the paper.




[1] Read, Jonathon "Using emoticons to reduce dependency in machine learning techniques for sentiment classification". Proceedings of the ACL Student Research Workshop. Association for Computational linguistics, pp 43-48 (2005).

[2] Wilson, Theresa, Janyce Wiebe, and Paul Hoffmann. "Recognizing contextual polarity in phrase-level sentiment analysis." Proceedings of the conference on human language technology and empirical methods in natural language processing, Vol. 7,pp 347-354 (2005).

[3] Pak, Alexander, and Patrick Paroubek. "Twitter as a corpus for sentiment analysis and opinion mining." In LREc, Vol. 10. (2010).

[4] Cambria, Erik, Bjorn Schuller, Bing Liu, Haixun Wang, and Catherine Havasi. "Statistical approaches to concept-level sentiment analysis." IEEE Intelligent Systems , Vol. 28, pp 6-9 (2013).

[5] Tare, Mohit, Indrajit Gohokar, Jayant Sable, Devendra Paratwar, and Rakhi Wajgi. "Multi-class tweet categorization using map reduce paradigm." International Journal of Computer Trends and Technology (IJCTT), Vol. 9, p 78 (2014).

[6] Stavrianou, Anna, Caroline Brun, Tomi Silander, and Claude Roux. "NLP-based feature extraction for automated tweet classification." Interactions between Data Mining and Natural Language Processing , Vol. 145 (2014).

[7] Sahni, Tapan, Chinmay Chandak, Naveen Reddy Chedeti, and Manish Singh. "Efficient Twitter sentiment classification using subjective distant supervision." In Communication Systems and Networks (COMSNETS), 2017 9th International Conference, Vol. 1 pp 548-553 (2017).

[8] Agarwal, A., Xie, B., Vovsha, I., Rambow, O., & Passonneau, R. “Sentiment analysis of twitter dataâ€. In Proceedings of the workshop on languages in social media (pp. 30-38) (2011, June).

[9] Kurniawan, Dwi Aji, Sunu Wibirama, and Noor Akhmad Setiawan "Real-time traffic classification with Twitter data mining." Information Technology and Electrical Engineering (ICITEE), 2016 8th International Conference on, pp. 1-5, (2016). IEEE.

[10] Yi, Jeonghee, and Wayne Niblack (2005). "Sentiment mining in Web Fountain." Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference on, pp 1073-1083, (2005). IEEE.

[11] Tan, Songbo and Jin Zhang An empirical study of sentiment analysis for Chinese documents". Expert System with application, Vol. 4, pp 2622-2629 (2008).".

[12] Sakaki , Takeshi and Makoto Okazaki and Yutaka Matsuo." Earthquake shakes Twitter users: real-time event detection by social sensors". Proceedings of the 19th international conference on World wide web, pp 851-860. (2010).

[13] Luo, Zhunchen and Miles Osborne and Ting Wang " An effective approach to tweets opinion retrieval". World Wide Web, Vol. 18, pp 545-566 (2015)..

[14] Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R. and Lin, C.J.,.†LIBLINEAR: A library for large linear classificationâ€. Journal of machine learning research, 9(Aug), vol.9, pp.1871-1874 (2008).

[15] Chung, Jessica Elan, and Eni Mustafaraj. " Can collective sentiment expressed on twitter predict political elections?". AAAI ,Vol. 11, pp 1770-1771 (2011).

[16] Deshwal, Ajay, and Sudhir Kumar Sharma"Twitter sentiment analysis using various classification algorithms." Reliability, Infocom Technologies and Optimization (Trends and Future Directions)(ICRITO), 2016 5th International Conference on, pp 251-257, (2016)..

[17] Wan, Yun, and Qigang Gao"An ensemble sentiment classification system of twitter data for airline services analysis." Data Mining Workshop (ICDMW), 2015 IEEE International Conference on, pp 1318-1325 (2015). .

[18] Cui, Renhao, Gagan Agrawal, Rajiv Ramnath, and Vinh Khuc"Ensemble of Heterogeneous Classifiers for Improving Automated Tweet Classification." Data Mining Workshops (ICDMW), 2016 IEEE 16th International Conference on, pp 1045-1052, (2016)..

[19] Xia, Rui, Chengqing Zong, and Shoushan Li."Ensemble of feature sets and classification algorithms for sentiment classification." Information Sciences ,Vol.181, pp 1138-1152 (2011)..

[20] Melville, Prem, Wojciech Gryc, and Richard D. Lawrence "Sentiment analysis of blogs by combining lexical knowledge with text classification." In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 1275-1284, (2009)..

[21] Kanakaraj, M. and Guddeti, R.M.R..†NLP based sentiment analysis on Twitter data using ensemble classifiersâ€. In Signal processing, communication and networking (ICSCN), 2015 3rd international conference on (pp. 1-5). (2015).

[22] Gull, Ratab, Umar Shoaib, Saba Rasheed, Washma Abid, and Beenish Zahoor"Pre Processing of Twitter's Data for Opinion Mining in Political Context." Procedia Computer Science ,Vol.96, pp 1560-1570 (2016). .

[23] Shrivastava, S. and Nair, P.S., “Mood prediction on tweets using classification algorithmâ€. Int J Sci Res (IJSR), Vol4 (11), pp.295-299 (2015).

View Full Article:

How to Cite

Mohanavalli, S., Karthika, S., ., S., R.Uthayan, K., & Sandya, N. (2018). Categorisation of Tweets Using Ensemble Classification Methods. International Journal of Engineering & Technology, 7(3.12), 722–725.
Received 2018-07-28
Accepted 2018-07-28
Published 2018-07-20