Sentiment Analysis using Logistic Regression and Effective Word Score Heuristic

  • Authors

    • Abhilasha Tyagi
    • Naresh Sharma
    2018-04-25
    https://doi.org/10.14419/ijet.v7i2.24.11991
  • Sentiment, Unigrams, Polarity, Machine Laerning, Twitter.
  • Sentiment Analysis is a method for judging somebody's sentiment or feeling with respect to a specific thing. It is utilized to recognize and arrange the sentiments communicated in writings. The web-based social networking sites like twitter draws in a huge number of clients that are online for imparting their insights in the form of tweets or comments. The tweets can be then classified into positive, negative, or neutral. In the proposed work, logistic regression classification is used as a classifier and unigram as a feature vector. For accuracy, k fold cross validation data mining technique is used. For choosing precise training sample, tweet subjectivity is utilized. The idea of Effective Word Score heuristic is likewise presented to find the polarity score of words that are frequently used. This additional heuristic can speed up the classification process of sentiments with standard machine learning approaches.

     

     

  • References

    1. [1] Alec Go, Richa Bhayani, and Lei Huang. Twitter Sentiment Classification using Distant Supervision. CS224N Project Report, Stanford, pages 1-12. 2009.

      [2] Pang B, Lee L, Opinion mining and sentiment analysis in Found Trends Inform Retriev, 2 (2008), pp. 1135.

      [3] Hatzivassiloglou V, McKeown K, Predicting the Semantic Orientation of Adjectives.

      [4] B. OConnor, R. Balasubramanyan, B. R. Routledge, and N. A. Smith, From tweets to polls: Linking text sentiment to public opinion time series, in Proc. 4th Int. AAAI Conf. Weblogs Social Media, Washington, DC, USA, 2010.

      [5] J. Bollen, H. Mao, and X. Zeng, Twitter mood predicts the stock market, J. Computer Science., vol. 2, no. 1, pp. 18, Mar. 2011.

      [6] G. Mishne and N. Glance, Predicting movie sales from blogger sentiment, in Proc. AAAI-CAAW, Stanford, CA, USA, 2006.

      [7] Liu B, Sentiment analysis and opinion mining in Synth Lect Human Lang Technol (2012)

      [8] Ohana B., Tierney B., Sentiment Classification of Reviews Using SentiWordNet in 9th. IT and T Conference, Dublin Institute of Technology, 2009.

      [9] Pang B, Lillian L, Thumbs up? Sentiment Classification using Machine Learning Techniques in Proceedings of EMNLP 2002, pp. 7986.

      [10] P. Waila, Marisha, V. K. Singh, M. K. Singh, Evaluating Machine Learning and Unsupervised Semantic Orientation approaches for sentiment analysis of textual reviews Computational Intelligence and Computing Research (ICCIC), 2012 IEEE International Conference on 18-20 Dec. 2012

      [11] Duda, R.O. Hart, P.E.(1973), Pattern classification and scene analysis, Wiley, New York

      [12] McCallullum A and Nigam K, A Comparison of Event Models for Naive Bayes Text Classification 1998.

      [13] Nicholls C., Song F., Improving sentiment analysis with Part-of-Speech weighting in 2009 International Conference on Machine Learning and Cybernetics (Volume: 3), 2009.

      [14] Haddi E, Liu Xi, Shi Y, The Role of Text Pre-processing in Sentiment Analysis in ITQM2013, Science Direct, 2013.

      [15] Andranik Tumasjan, Timm O. Sprenger, Philipp G. Sandner, Isabell M. Welpe, Predicting Elections with Twitter:What 140 Characters Reveal about Political Sentiment.

      [16] Bing Liu, Sentiment Analysis and Opinion Mining.

      [17] Bruno Ohana, Brendan Tierney, Sentiment Classification of Reviews Using Senti- WordNet.

      [18] Pang B, Lee L, S. Vaithyanathan Thumbs up? Sentiment Classification using Machine Learning Techniques.

      [19] Waila P., Marisha, Singh V.K. ,Singh M.K. ,Evaluating Machine Learning and Unsupervised Semantic Orientatio approaches for sentiment analysis of textual reviews.

      [20] .Gautam G. , Yadav P. , Sentiment Analysis of Twitter Data using Machine Learning Approaches and Semantic Analysis.

      [21] Sun B., Ng V. , Analysis Sentimental Influence of Posts on Social Network.

      [22] Bing L., Chan. K.C.C. , Public Sentiment Analysis in Twitter Data for Prediction of a Company’s Stock Price Movements.

      [23] Ms. Devaki V. Ingule , Prof. Gyankamal J. Chhajed, Survey of Public Sentiment Interpretation on Twitter.

      [24] Zhao L., Ren Y., Wang J., Meng L., and Zou C., Research on the opinion mining system for massive social media data.

      [25] Lo, Y.W, Potdar, V, A review of opinion mining and sentiment classification framework in social network.

      [26] Peter D. Turney, Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews.

      [27] Amit Goyal and Hal Daume III , Generating Semantic Orientation Lexicon using Large Data and Thesaurus.

      [28] Jonathon Read, John Carroll, Weakly Supervised Techniques for Domain- Independent Sentiment Classification.

      [29] Emma Haddi, Xiaohui Liu, Yong Shi, The Role of Text Pre-processing in Sentiment Analysis.

      [30] Hassan Saif, Yulan He and Harith Alani, Semantic Sentiment Analysis of Twitter.

      [31] Efthymios Kouloumpis, Theresa Wilson, Johanna Moore, Twitter Sentiment Analysis: The Good the Bad and the OMG!

      [32] Oaindrila Das, Rakesh Chandra Balabantaray, Sentiment Analysis of Movie Reviews using POS tags and Term Frequencies.

      [33] http://dataminingwarehousing.blogspot.in/2008/10/data-mining-steps-of-datamining.html

      [34] http://www.anderson.ucla.edu/faculty/jason.frand/teacher/technologies/palace/datamining.htm

      [35] http://www.thearling.com/text/dmwhite/dmwhite.htm

      [36] https://blog.semanticlab.net/2013/09/10/techniques-and-applications-forsentiment-analysis/

      [37] R. Narayanan, B. Liu, and A. Choudhary, Sentiment analysis of conditional sentences, in Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 Volume 1, Stroudsburg, PA, USA, 2009, pp. 180189.

      [38] D. Davidov, O. Tsur, and A. Rappoport, Semi-supervised recognition of sarcastic sentences in Twitter and Amazon, in Proceedings of the Fourteenth Conference on Computational Natural Language Learning, Stroudsburg, PA, USA, 2010, pp. 107116.

      [39] N. Jindal and B. Liu, Identifying comparative sentences in text documents, in Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, New York, NY, USA, 2006, pp. 244251.

      [40] https://www.iprospect.com/en/ca/blog/10-sentiment-analysis-tools-track-socialmarketing-

      [41] S.V.Manikanthan and D.Sugandhi “ Interference Alignment Techniques For Mimo Multicell Based On Relay Interference Broadcast Channel †International Journal of Emerging Technology in Computer Science & Electronics (IJETCSE) ISSN: 0976-1353 Volume- 7 ,Issue 1 –MARCH 2014.

      [42] T. Padmapriya and V. Saminadan, “Distributed Load Balancing for Multiuser Multi-class Traffic in MIMO LTE-Advanced Networksâ€, Research Journal of Applied Sciences, Engineering and Technology (RJASET) - Maxwell Scientific Organization , ISSN: 2040-7459; e-ISSN: 2040-7467, vol.12, no.8, pp:813-822, April 2016.

  • Downloads

  • How to Cite

    Tyagi, A., & Sharma, N. (2018). Sentiment Analysis using Logistic Regression and Effective Word Score Heuristic. International Journal of Engineering & Technology, 7(2.24), 20-23. https://doi.org/10.14419/ijet.v7i2.24.11991