A comparative review of the challenges encountered in sentiment analysis of Indian regional language tweets vs English language tweets

  • Authors

    • Saini Jacob Soman
    • P Swaminathan
    • R Anandan
    • K Kalaivani
    2018-04-20
    https://doi.org/10.14419/ijet.v7i2.21.12394
  • Sentiment analysis, Indian regional language tweets, challenges in sentiment analysis, twitter sentiment analysis of English tweets.
  • With the developed use of online medium these days for sharing views, sentiments and opinions about products, services, organization and people, micro blogging and social networking sites are acquiring a huge popularity. One of the biggest social media sites namely Twitter is used by several people to share their life events, views and opinion about different areas and concepts. Sentiment analysis is the computational research of reviews, opinions, attitudes, views and peoples’ emotions about different products, services, firms and topics through categorizing them as negative and positive emotions. Sentiment analysis of tweets is a challenging task. This paper makes a critical review on the comparison of the challenges associated with sentiment analysis of Tweets in English Language versus Indian Regional Languages. Five Indian languages namely Tamil, Malayalam, Telugu, Hindi and Bengali have been considered in this research and several challenges associated with the analysis of Twitter sentiments in those languages have been identified and conceptualized in the form of a framework in this research through systematic review.

     

     

  • References

    1. [1] Venugopalan M & Gupta D, “Exploring sentiment analysis on twitter dataâ€, Proceedings of eighth International Conference on Contemporary Computing, (2015), pp.241-243.[2] Chalothom T & Ellman J, “Simple approaches of sentiment analysis via ensemble learningâ€, Information science and applications, (2015), pp.631-639.[3] Narr S, Hulfenhaus M & Albayrak S, “Language-independent twitter sentiment analysisâ€, Knowledge discovery and machine learning (KDML), (2012), pp.12-14.[4] Riloff E, Qadir A, Surve P, De Silva L, Gilbert N & Huang R, “Sarcasm as contrast between a positive sentiment and negative situationâ€, Proceedings of Conference on Empirical Methods in Natural Language Processing, (2013), pp.704-714.[5] Kaur J, “A Review Paper on Twitter Sentiment Analysis Techniquesâ€, International Journal for Research in Applied Science & Engineering Technology, Vol.4, No.10, (2016), pp.61-69.[6] Remus R, “Modeling and Representing Negation in Data-driven Machine Learning-based Sentiment Analysisâ€, ESSEM@ AI* IA , (2013), pp.22-33.[7] Irvine A & Callison-Burch C, “Combining bilingual and comparable corpora for low resource machine translationâ€, Proceedings of the eighth workshop on statistical machine translation, (2013), pp. 262–270.[8] Severyn A & Moschitti A, “Twitter sentiment analysis with deep convolutional neural networksâ€, SIGIR, (2015), pp. 959–962.[9] Xiang B & Zhou L, “Improving twitter sentiment analysis with topic-based mixture modeling and semi-supervised trainingâ€, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, Vol.2, (2014), pp.434-439.

      [10] Ghaleb OAM & Vijendran AS, “The Challenges of Sentiment Analysis on Social Web Communitiesâ€, International Conference on Intelligent Computing and Technology, (2017), pp.21-29.

      [11] Agrawal A & An A, “Kea: Sentiment Analysis of Phrases within short textsâ€, Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval), (2014), pp.380–384.

      [12] Makrynioti N & Vassalos V, “Sentiment extraction from tweets: multilingual challengesâ€, International Conference on Big Data Analytics and Knowledge Discovery, (2015), pp.136-148.

      [13] Bahrainian SA & Dengel A, “Sentiment analysis and summarization of twitter dataâ€, IEEE 16th International Conference on Computational Science and Engineering, (2013), pp.227-234.

      [14] Asmi A & Ishaya T, “Negation Identification and Calculation in Sentiment Analysisâ€, Proceedings of the Second International Conference on Advances in Information Mining and Management, (2012), pp.1-7.

      [15] Sharmista A & Ramaswami M, “Tree Based Opinion Mining in Tamil for Product Recommendations using Râ€, International Journal of Computational Intelligence and Informatics, Vol.6, No.2, (2015), pp.110-119.

      [16] Bravo-Marquez F, Frank E & Pfahringer B, “From unlabelled tweets to twitter-specific opinion wordsâ€, Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, (2015), pp.743-746.

      [17] Nair DS, Jayan JP & Sherly E, “Sentiment Analysis of Malayalam film review using machine learning techniquesâ€, IEEE International Conference on Advances in Computing, Communications and Informatics, (2015).[18] Anagha M, Kumar RR, Sreetha K & Reghu Raj PC, “A Novel Hybrid Approach on Maximum Entropy Classifier for Sentiment Analysis of Malayalam Movie Reviewsâ€, International Journal Of Scientific Research, Vol.4, (2015).

      [19] Anu PC, Athila M, Heera BM, Lini KU & Cheerotha LR, “Aspect Based Sentiment Analysis in Malayalamâ€, International Journal of Advances in Engineering and Scientific Research, Vol.3, No.6, (2016), pp.27-34.

      [20] Shankar R, Shilpa KM, Patil S & Swamy S, “A Survey on Sentimental Analysis in Different Indian Dialectsâ€, International Journal of Advanced Research in Computer and Communication Engineering, Vol.5, No.4, (2016), pp.1072-1076.[21] Nagaraju G, Mangathayaru N & Rani BP, “Dependency Parser for Telugu Languageâ€, Proceedings of the Second ACM International Conference on Information and Communication Technology for Competitive Strategies, (2016), pp.138-139. [22] Rai V, Vijay S & Sharma DP, “A Karaka Based Approach to Cross Lingual Sentiment Analysisâ€, International Journal of Languages, Literature and Linguistics, Vol.3, No.4, (2017), pp.226-229.[23] Mishra D, Venugopalan M & Gupta D, “Context Specific Lexicon for Hindi Reviewsâ€, Procedia Computer Science, (2016), pp.554-563.[24] Patra BG, Das D, Das A & Prasath R, “Shared task on sentiment analysis in Indian Languages (sail) tweets-an overviewâ€, International Conference on Mining Intelligence and Knowledge Exploration, (2015), pp.650-655.

      [25] Cambria E, Olsher D & Rajagopal D, “SenticNet 3: a common and common-sense knowledge base for cognition-driven sentiment analysisâ€, Proceedings of AAAI conference on Artificial Intelligence, (2014), pp.1515–1521.

      [26] Khan S, “Convergence in spelling, and spell-checker for Romanized Bangla in computers and mobile phonesâ€, IEEE International Conference on Informatics, Electronics & Vision (ICIEV), (2014), pp.1-5.

      [27] Chowdhury S & Chowdhury W, “Performing sentiment analysis in Bangla microblog postsâ€, the IEEE International Conference on Informatics, Electronics & Vision (ICIEV), (2014), pp.1-6.

      [28] Narayanan V, Arora I & Bhatia A, “Fast and accurate sentiment classification using an enhanced Naive Bayes modelâ€, the International Conference on Intelligent Data Engineering and Automated Learning Springer, (2013), pp.194-201.

      [29] Banea C, Mihalcea R & Wiebe J, “Sense-level subjectivity in a multilingual settingâ€, Computer Speech & Language, Vol.28, No.1, (2014), pp.7-19.

      [30] Karanasou M, Ampla A, Doulkeridis C & Halkidi M, “Scalable and Real-time Sentiment Analysis of Twitter Dataâ€, 16th IEEE International Conference on Data Mining Workshops (ICDMW), (2016), pp.944-951.

      [31] Bilgin M & Åžentürk Ä°F, “Sentiment analysis on Twitter data with semi-supervised Doc2Vecâ€, IEEE International Conference on Computer Science and Engineering, (2017), pp.661-666.

  • Downloads

  • How to Cite

    Jacob Soman, S., Swaminathan, P., Anandan, R., & Kalaivani, K. (2018). A comparative review of the challenges encountered in sentiment analysis of Indian regional language tweets vs English language tweets. International Journal of Engineering & Technology, 7(2.21), 319-322. https://doi.org/10.14419/ijet.v7i2.21.12394