A Study on Detecting Misleading Online News Using Bigram and Cosine Similarity

Normala Che Eembi; Iskandar Ishak; Fatimah Sidi; Lilly Suriani Affendey

doi:10.14419/ijet.v7i4.31.23375

Authors

Normala Che Eembi
Iskandar Ishak
Fatimah Sidi
Lilly Suriani Affendey

Received date: December 7, 2018

Accepted date: December 7, 2018

Published date: December 9, 2018

DOI:

https://doi.org/10.14419/ijet.v7i4.31.23375

Keywords:

Fake news, Deception, Lies, Misleading headlines, Deceiving news

Abstract

Fake news can impact negatively in terms of creating negative perception towards business, organization, and government. One of the ways that fake news is created is through deceptive news writing. Many researchers have developed approaches in detecting deceptive news content using machine-learning approach and each of the approach has its own focus. Previous researches emphasis on the components of the news content such as indetecting grammar, humor, punctuation, body-dependent and body-independent features. In this paper, a new approach in detecting deceptive news based on misleading news has been developed which is focusing on the similarity between the content and its headlines using bigram and cosine similarity. Based on the experiments, the proposed approach has better performance in terms of detecting deceptive news.
Â
Â

References

[1] N. C. Eembijamil, I. Ishak, and F. Sidi, â€œDeception detection approach for data veracity in online digital news: Headlines vs contents,â€ AIP Conf. Proc., vol. 1891, 2017.
[2] Y. R. Tausczik and J. W. Pennebaker, â€œThe Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods,â€ J. Lang. Soc. Psychol., vol. 29, no. 1, pp. 24â€“54, 2010.
[3] V. L. Rubin and T. Vashchilko, â€œExtending information quality assessment methodology: A new veracity/deception dimension and its measures,â€ Proc. Am. Soc. Inf. Sci. Technol., vol. 49, no. 1, pp. 1â€“6, 2012.
[4] E. Ferrara, â€œManipulation and abuse on social media,â€ 2015.
[5] V. Rubin, N. J. Conroy, V. L. Rubin, Y. Chen, and N. J. Conroy, â€œDeception Detection for News : Three Types of Fakes Deception Detection for News : Three Types of Fakes,â€ no. November, 2015.
[6] V. L. Rubin, N. J. Conroy, and Y. Chen, â€œTowards News Verification : Deception Detection Methods for News Discourse,â€ no. JANUARY, 2015.
[7] Y. Chen, N. J. Conroy, Y. Chen, N. J. Conroy, and V. L. Rubin, â€œNews in an Online World : The Need for an " Automatic Crap Detector ",â€ no. November, 2015.
[8] V. Rubin, N. J. Conroy, V. L. Rubin, N. J. Conroy, Y. Chen, and S. Cornwell, â€œFake News or Truth ? Using Satirical Cues to Detect Potentially Misleading News Fake News or Truth ? Using Satirical Cues to Detect Potentially Misleading News .,â€ no. April, 2016.
[9] H. Allcott and M. Gentzkow, â€œSocial Media and Fake News in the 2016 Election,â€ J. Econ. Perspect., vol. 31, no. 2, pp. 211â€“236, 2017.
[10] R. M. Entman, â€œFraming bias: Media in the distribution of power,â€ J. Commun., vol. 57, no. 1, pp. 163â€“173, 2007.
[11] S. Lee, â€œDetection of Political Manipulation in Online Communities through Measures of Effort and Collaboration,â€ ACM Trans. Web, vol. 9, no. 3, pp. 1â€“24, 2015.
[12] â€œâ€˜Fake newsâ€™ becomes a business model â€“ researchers - The East African.â€ [Online]. Available: http://www.theeastafrican.co.ke/business/Fake-news-a-business-model/2560-4189846-bbkysn/index.html. [Accessed: 29-Apr-2018].
[13] â€œIdentifying Fake News: Use Deception Detection Techniques | Globalytica.â€ [Online]. Available: http://www.globalytica.com/identifying-fake-news-deception-detection-techniques/. [Accessed: 30-Mar-2018].
[14] D. Dor, â€œOn newspaper headlines as relevance optimizers,â€ J. Pragmat., vol. 35, no. 5, pp. 695â€“721, 2003.
[15] V. PÃ©rez-Rosas and R. Mihalcea, â€œExperiments in Open Domain Deception Detection,â€ 2013.
[16] E. Ifantidou, â€œNewspaper headlines and relevance: Ad hoc concepts in ad hoc contexts,â€ J. Pragmat., vol. 41, no. 4, pp. 699â€“720, 2009.
[17] J. Oâ€™Shea, Z. Bandar, and K. Crockett, â€œA New Benchmark Dataset with Production Methodology for Short Text Semantic Similarity Algorithms,â€ ACM Trans. Speech Lang. Process., vol. 10, no. 4, p. Article No. 19, 2013.
[18] T. Lukoianova and V. L. Rubin, â€œVeracity roadmap: Is big data objective, truthful and credible?,â€ Adv. Classif. Res. Online, vol. 24, pp. 4â€“15, 2013.
[19] N. M. Turner, D. G. York, and H. A. Petousis-Harris, â€œThe use and misuse of media headlines: Lessons from the MeNZB??? immunisation campaign,â€ N. Z. Med. J., vol. 122, no. 1291, pp. 22â€“27, 2009.
[20] W. Wei and X. Wan, â€œLearning to Identify Ambiguous and Misleading News Headlines,â€ pp. 4172â€“4178, 2017.
[21] W. Wei and X. Wan, â€œLearning to Identify Ambiguous and Misleading News Headlines,â€ 2017.
[22] R. Ecker, U.K, Lewandowsky, S., Chang, E.P., Pillai, â€œThe Effects of Subtle Misinformation in News Headlines,â€ Uma Ã©tica para quantos?, vol. XXXIII, no. 2, pp. 81â€“87, 2014.
[23] T. Berger, D. Lettner, J. Rubin, P. GrÃ¼nbacher, A. Silva, M. Becker, M. Chechik, and K. Czarnecki, What is a feature? 2015.
[24] V. L. Rubin, N. J. Conroy, Y. Chen, and S. Cornwell, â€œFake News or Truth ? Using Satirical Cues to Detect Potentially Misleading News .,â€ no. April, pp. 7â€“17, 2016.
[25] L. Zhou, Y. Shi, D. Zhang, and A. Sears, â€œDiscovering Cues to Error Detection in Speech Recognition Output: A User-Centered Approach,â€ J. Manag. Inf. Syst., vol. 22, no. 4, pp. 237â€“270, 2006.
[26] S. Petrov and D. Klein, â€œImproved Inferencing for Unlexicalized Parsing,â€ Proc. NAACL-HLT 2007, no. April, pp. 404â€“411, 2007.
[27] G. Sidorov, F. Velasquez, E. Stamatatos, A. Gelbukh, and L. Chanona-HernÃ¡ndez, â€œSyntactic N-grams as machine learning features for natural language processing,â€ Expert Syst. Appl., vol. 41, no. 3, pp. 853â€“860, Feb. 2014.
[28] H. Zhang, Z. Fan, J. Zheng, and Q. Liu, â€œAn improving deception detection method in Computer-Mediated Communication,â€ J. Networks, vol. 7, no. 11, pp. 1811â€“1816, 2012.
[29] â€œ1. Language Processing and Python.â€ [Online]. Available: https://www.nltk.org/book/ch01.html. [Accessed: 29-Apr-2018].

A Study on Detecting Misleading Online News Using Bigram and Cosine Similarity

Authors

Normala Che Eembi

Iskandar Ishak

Fatimah Sidi

Lilly Suriani Affendey

How to Cite

DOI:

Keywords:

Abstract

References

Downloads

How to Cite