A comparative review of the challenges encountered in sentiment analysis of Indian regional language tweets vs English language tweets

  • Abstract
  • Keywords
  • References
  • PDF
  • Abstract

    With the developed use of online medium these days for sharing views, sentiments and opinions about products, services, organization and people, micro blogging and social networking sites are acquiring a huge popularity. One of the biggest social media sites namely Twitter is used by several people to share their life events, views and opinion about different areas and concepts. Sentiment analysis is the computational research of reviews, opinions, attitudes, views and peoples’ emotions about different products, services, firms and topics through categorizing them as negative and positive emotions. Sentiment analysis of tweets is a challenging task. This paper makes a critical review on the comparison of the challenges associated with sentiment analysis of Tweets in English Language versus Indian Regional Languages. Five Indian languages namely Tamil, Malayalam, Telugu, Hindi and Bengali have been considered in this research and several challenges associated with the analysis of Twitter sentiments in those languages have been identified and conceptualized in the form of a framework in this research through systematic review.



  • Keywords

    Sentiment analysis, Indian regional language tweets, challenges in sentiment analysis, twitter sentiment analysis of English tweets.

  • References

      [1] Venugopalan M & Gupta D, “Exploring sentiment analysis on twitter data”, Proceedings of eighth International Conference on Contemporary Computing, (2015), pp.241-243.

      [2] Chalothom T & Ellman J, “Simple approaches of sentiment analysis via ensemble learning”, Information science and applications, (2015), pp.631-639.

      [3] Narr S, Hulfenhaus M & Albayrak S, “Language-independent twitter sentiment analysis”, Knowledge discovery and machine learning (KDML), (2012), pp.12-14.

      [4] Riloff E, Qadir A, Surve P, De Silva L, Gilbert N & Huang R, “Sarcasm as contrast between a positive sentiment and negative situation”, Proceedings of Conference on Empirical Methods in Natural Language Processing, (2013), pp.704-714.

      [5] Kaur J, “A Review Paper on Twitter Sentiment Analysis Techniques”, International Journal for Research in Applied Science & Engineering Technology, Vol.4, No.10, (2016), pp.61-69.

      [6] Remus R, “Modeling and Representing Negation in Data-driven Machine Learning-based Sentiment Analysis”, ESSEM@ AI* IA , (2013), pp.22-33.

      [7] Irvine A & Callison-Burch C, “Combining bilingual and comparable corpora for low resource machine translation”, Proceedings of the eighth workshop on statistical machine translation, (2013), pp. 262–270.

      [8] Severyn A & Moschitti A, “Twitter sentiment analysis with deep convolutional neural networks”, SIGIR, (2015), pp. 959–962.

      [9] Xiang B & Zhou L, “Improving twitter sentiment analysis with topic-based mixture modeling and semi-supervised training”, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, Vol.2, (2014), pp.434-439.

      [10] Ghaleb OAM & Vijendran AS, “The Challenges of Sentiment Analysis on Social Web Communities”, International Conference on Intelligent Computing and Technology, (2017), pp.21-29.

      [11] Agrawal A & An A, “Kea: Sentiment Analysis of Phrases within short texts”, Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval), (2014), pp.380–384.

      [12] Makrynioti N & Vassalos V, “Sentiment extraction from tweets: multilingual challenges”, International Conference on Big Data Analytics and Knowledge Discovery, (2015), pp.136-148.

      [13] Bahrainian SA & Dengel A, “Sentiment analysis and summarization of twitter data”, IEEE 16th International Conference on Computational Science and Engineering, (2013), pp.227-234.

      [14] Asmi A & Ishaya T, “Negation Identification and Calculation in Sentiment Analysis”, Proceedings of the Second International Conference on Advances in Information Mining and Management, (2012), pp.1-7.

      [15] Sharmista A & Ramaswami M, “Tree Based Opinion Mining in Tamil for Product Recommendations using R”, International Journal of Computational Intelligence and Informatics, Vol.6, No.2, (2015), pp.110-119.

      [16] Bravo-Marquez F, Frank E & Pfahringer B, “From unlabelled tweets to twitter-specific opinion words”, Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, (2015), pp.743-746.

      [17] Nair DS, Jayan JP & Sherly E, “Sentiment Analysis of Malayalam film review using machine learning techniques”, IEEE International Conference on Advances in Computing, Communications and Informatics, (2015).

      [18] Anagha M, Kumar RR, Sreetha K & Reghu Raj PC, “A Novel Hybrid Approach on Maximum Entropy Classifier for Sentiment Analysis of Malayalam Movie Reviews”, International Journal Of Scientific Research, Vol.4, (2015).

      [19] Anu PC, Athila M, Heera BM, Lini KU & Cheerotha LR, “Aspect Based Sentiment Analysis in Malayalam”, International Journal of Advances in Engineering and Scientific Research, Vol.3, No.6, (2016), pp.27-34.

      [20] Shankar R, Shilpa KM, Patil S & Swamy S, “A Survey on Sentimental Analysis in Different Indian Dialects”, International Journal of Advanced Research in Computer and Communication Engineering, Vol.5, No.4, (2016), pp.1072-1076.

      [21] Nagaraju G, Mangathayaru N & Rani BP, “Dependency Parser for Telugu Language”, Proceedings of the Second ACM International Conference on Information and Communication Technology for Competitive Strategies, (2016), pp.138-139.

      [22] Rai V, Vijay S & Sharma DP, “A Karaka Based Approach to Cross Lingual Sentiment Analysis”, International Journal of Languages, Literature and Linguistics, Vol.3, No.4, (2017), pp.226-229.

      [23] Mishra D, Venugopalan M & Gupta D, “Context Specific Lexicon for Hindi Reviews”, Procedia Computer Science, (2016), pp.554-563.

      [24] Patra BG, Das D, Das A & Prasath R, “Shared task on sentiment analysis in Indian Languages (sail) tweets-an overview”, International Conference on Mining Intelligence and Knowledge Exploration, (2015), pp.650-655.

      [25] Cambria E, Olsher D & Rajagopal D, “SenticNet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis”, Proceedings of AAAI conference on Artificial Intelligence, (2014), pp.1515–1521.

      [26] Khan S, “Convergence in spelling, and spell-checker for Romanized Bangla in computers and mobile phones”, IEEE International Conference on Informatics, Electronics & Vision (ICIEV), (2014), pp.1-5.

      [27] Chowdhury S & Chowdhury W, “Performing sentiment analysis in Bangla microblog posts”, the IEEE International Conference on Informatics, Electronics & Vision (ICIEV), (2014), pp.1-6.

      [28] Narayanan V, Arora I & Bhatia A, “Fast and accurate sentiment classification using an enhanced Naive Bayes model”, the International Conference on Intelligent Data Engineering and Automated Learning Springer, (2013), pp.194-201.

      [29] Banea C, Mihalcea R & Wiebe J, “Sense-level subjectivity in a multilingual setting”, Computer Speech & Language, Vol.28, No.1, (2014), pp.7-19.

      [30] Karanasou M, Ampla A, Doulkeridis C & Halkidi M, “Scalable and Real-time Sentiment Analysis of Twitter Data”, 16th IEEE International Conference on Data Mining Workshops (ICDMW), (2016), pp.944-951.

      [31] Bilgin M & Şentürk İF, “Sentiment analysis on Twitter data with semi-supervised Doc2Vec”, IEEE International Conference on Computer Science and Engineering, (2017), pp.661-666.




Article ID: 12394
DOI: 10.14419/ijet.v7i2.21.12394

Copyright © 2012-2015 Science Publishing Corporation Inc. All rights reserved.