A comparative review of the challenges encountered in sentiment analysis of Indian regional language  tweets vs English language tweets

Saini Jacob Soman; P Swaminathan; R Anandan; K Kalaivani

doi:10.14419/ijet.v7i2.21.12394

Authors

Saini Jacob Soman
P Swaminathan
R Anandan
K Kalaivani

Received date: May 3, 2018

Accepted date: May 3, 2018

Published date: April 20, 2018

DOI:

https://doi.org/10.14419/ijet.v7i2.21.12394

Keywords:

Sentiment analysis, Indian regional language tweets, challenges in sentiment analysis, twitter sentiment analysis of English tweets.

Abstract

With the developed use of online medium these days for sharing views, sentiments and opinions about products, services, organization and people, micro blogging and social networking sites are acquiring a huge popularity. One of the biggest social media sites namely Twitter is used by several people to share their life events, views and opinion about different areas and concepts. Sentiment analysis is the computational research of reviews, opinions, attitudes, views and peoplesâ€™ emotions about different products, services, firms and topics through categorizing them as negative and positive emotions. Sentiment analysis of tweets is a challenging task. This paper makes a critical review on the comparison of the challenges associated with sentiment analysis of Tweets in English Language versus Indian Regional Languages. Five Indian languages namely Tamil, Malayalam, Telugu, Hindi and Bengali have been considered in this research and several challenges associated with the analysis of Twitter sentiments in those languages have been identified and conceptualized in the form of a framework in this research through systematic review.
Â
Â

References

[1] Venugopalan M & Gupta D, â€œExploring sentiment analysis on twitter dataâ€, Proceedings of eighth International Conference on Contemporary Computing, (2015), pp.241-243.[2] Chalothom T & Ellman J, â€œSimple approaches of sentiment analysis via ensemble learningâ€, Information science and applications, (2015), pp.631-639.[3] Narr S, Hulfenhaus M & Albayrak S, â€œLanguage-independent twitter sentiment analysisâ€, Knowledge discovery and machine learning (KDML), (2012), pp.12-14.[4] Riloff E, Qadir A, Surve P, De Silva L, Gilbert N & Huang R, â€œSarcasm as contrast between a positive sentiment and negative situationâ€, Proceedings of Conference on Empirical Methods in Natural Language Processing, (2013), pp.704-714.[5] Kaur J, â€œA Review Paper on Twitter Sentiment Analysis Techniquesâ€, International Journal for Research in Applied Science & Engineering Technology, Vol.4, No.10, (2016), pp.61-69.[6] Remus R, â€œModeling and Representing Negation in Data-driven Machine Learning-based Sentiment Analysisâ€, ESSEM@ AI* IA , (2013), pp.22-33.[7] Irvine A & Callison-Burch C, â€œCombining bilingual and comparable corpora for low resource machine translationâ€, Proceedings of the eighth workshop on statistical machine translation, (2013), pp. 262â€“270.[8] Severyn A & Moschitti A, â€œTwitter sentiment analysis with deep convolutional neural networksâ€, SIGIR, (2015), pp. 959â€“962.[9] Xiang B & Zhou L, â€œImproving twitter sentiment analysis with topic-based mixture modeling and semi-supervised trainingâ€, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, Vol.2, (2014), pp.434-439.
[10] Ghaleb OAM & Vijendran AS, â€œThe Challenges of Sentiment Analysis on Social Web Communitiesâ€, International Conference on Intelligent Computing and Technology, (2017), pp.21-29.
[11] Agrawal A & An A, â€œKea: Sentiment Analysis of Phrases within short textsâ€, Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval), (2014), pp.380â€“384.
[12] Makrynioti N & Vassalos V, â€œSentiment extraction from tweets: multilingual challengesâ€, International Conference on Big Data Analytics and Knowledge Discovery, (2015), pp.136-148.
[13] Bahrainian SA & Dengel A, â€œSentiment analysis and summarization of twitter dataâ€, IEEE 16th International Conference on Computational Science and Engineering, (2013), pp.227-234.
[14] Asmi A & Ishaya T, â€œNegation Identification and Calculation in Sentiment Analysisâ€, Proceedings of the Second International Conference on Advances in Information Mining and Management, (2012), pp.1-7.
[15] Sharmista A & Ramaswami M, â€œTree Based Opinion Mining in Tamil for Product Recommendations using Râ€, International Journal of Computational Intelligence and Informatics, Vol.6, No.2, (2015), pp.110-119.
[16] Bravo-Marquez F, Frank E & Pfahringer B, â€œFrom unlabelled tweets to twitter-specific opinion wordsâ€, Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, (2015), pp.743-746.
[17] Nair DS, Jayan JP & Sherly E, â€œSentiment Analysis of Malayalam film review using machine learning techniquesâ€, IEEE International Conference on Advances in Computing, Communications and Informatics, (2015).[18] Anagha M, Kumar RR, Sreetha K & Reghu Raj PC, â€œA Novel Hybrid Approach on Maximum Entropy Classifier for Sentiment Analysis of Malayalam Movie Reviewsâ€, International Journal Of Scientific Research, Vol.4, (2015).
[19] Anu PC, Athila M, Heera BM, Lini KU & Cheerotha LR, â€œAspect Based Sentiment Analysis in Malayalamâ€, International Journal of Advances in Engineering and Scientific Research, Vol.3, No.6, (2016), pp.27-34.
[20] Shankar R, Shilpa KM, Patil S & Swamy S, â€œA Survey on Sentimental Analysis in Different Indian Dialectsâ€, International Journal of Advanced Research in Computer and Communication Engineering, Vol.5, No.4, (2016), pp.1072-1076.[21] Nagaraju G, Mangathayaru N & Rani BP, â€œDependency Parser for Telugu Languageâ€, Proceedings of the Second ACM International Conference on Information and Communication Technology for Competitive Strategies, (2016), pp.138-139. [22] Rai V, Vijay S & Sharma DP, â€œA Karaka Based Approach to Cross Lingual Sentiment Analysisâ€, International Journal of Languages, Literature and Linguistics, Vol.3, No.4, (2017), pp.226-229.[23] Mishra D, Venugopalan M & Gupta D, â€œContext Specific Lexicon for Hindi Reviewsâ€, Procedia Computer Science, (2016), pp.554-563.[24] Patra BG, Das D, Das A & Prasath R, â€œShared task on sentiment analysis in Indian Languages (sail) tweets-an overviewâ€, International Conference on Mining Intelligence and Knowledge Exploration, (2015), pp.650-655.
[25] Cambria E, Olsher D & Rajagopal D, â€œSenticNet 3: a common and common-sense knowledge base for cognition-driven sentiment analysisâ€, Proceedings of AAAI conference on Artificial Intelligence, (2014), pp.1515â€“1521.
[26] Khan S, â€œConvergence in spelling, and spell-checker for Romanized Bangla in computers and mobile phonesâ€, IEEE International Conference on Informatics, Electronics & Vision (ICIEV), (2014), pp.1-5.
[27] Chowdhury S & Chowdhury W, â€œPerforming sentiment analysis in Bangla microblog postsâ€, the IEEE International Conference on Informatics, Electronics & Vision (ICIEV), (2014), pp.1-6.
[28] Narayanan V, Arora I & Bhatia A, â€œFast and accurate sentiment classification using an enhanced Naive Bayes modelâ€, the International Conference on Intelligent Data Engineering and Automated Learning Springer, (2013), pp.194-201.
[29] Banea C, Mihalcea R & Wiebe J, â€œSense-level subjectivity in a multilingual settingâ€, Computer Speech & Language, Vol.28, No.1, (2014), pp.7-19.
[30] Karanasou M, Ampla A, Doulkeridis C & Halkidi M, â€œScalable and Real-time Sentiment Analysis of Twitter Dataâ€, 16th IEEE International Conference on Data Mining Workshops (ICDMW), (2016), pp.944-951.
[31] Bilgin M & ÅžentÃ¼rk Ä°F, â€œSentiment analysis on Twitter data with semi-supervised Doc2Vecâ€, IEEE International Conference on Computer Science and Engineering, (2017), pp.661-666.

A comparative review of the challenges encountered in sentiment analysis of Indian regional language tweets vs English language tweets

Authors

Saini Jacob Soman

P Swaminathan

R Anandan

K Kalaivani

How to Cite

DOI:

Keywords:

Abstract

References

Downloads

How to Cite