Comparison of twitter spam detection using various  machine learning algorithms

M. Sangeetha; S. Nithyanantham; M. Jayanthi

doi:10.14419/ijet.v7i1.3.9268

Article Summary Abstract References Full Article How to cite

Authors
- M. Sangeetha
- S. Nithyanantham
- M. Jayanthi
2017-12-31

https://doi.org/10.14419/ijet.v7i1.3.9268
Twitter, Spammer, tweet, machine learning algorithm, account, tweet content â€“based.
Online Social Networks(OSNs) have mutual themes such as information sharing, person-to-person interaction and creation of shared and collaborative content.Â Lots of micro blogging websites available like Twitter, Instagram, Tumblr. A standout amongst the most prominent online networking stages is Twitter. It has 313 million months to month dynamic clients which post of 500 million tweets for each day. Twitter allows users to send short text based messages with up to 140-character letters called "tweets". Enlisted clients can read and post tweets however the individuals who are unregistered can just read them. Due to the reputation it attracts the consideration of spammers for their vindictive points, for example, phishing true blue clients or spreading malevolent programming and promotes through URLs shared inside tweets, forcefully take after/unfollow valid clients and commandeer drifting subjects to draw in their consideration, proliferating obscenity. Twitter Spam has become a critical problem nowadays. By looking at the execution of an extensive variety of standard machine learning calculations, fundamentally expecting to distinguish the acceptable location execution in light of a lot of information by utilizing account-based and tweet content-based highlights.
References
1. [1] F. Benevenuto, G. Magno, T. Rodrigues, and V. Almeida, â€œDetecting spammers on Twitter,â€ in Proc. Collaboration, Electron. Messaging, Anti-Abuse Spam Conf. (CEAS), vol. 6. 2010, p. 12.
  [2] C. Chen, J. Zhang, X. Chen, Y. Xiang, and W. Zhou, â€œ6 million spam tweets: A large ground truth for timely Twitter spam detection,â€ in Proc.
  [3] C. Yang, R. Harkreader, and G. Gu, â€œEmpirical evaluation and new design for fighting evolving Twitter spammers,â€ IEEE Trans. Inf. Forensics Security, vol. 8, no. 8, pp. 12801293, Aug. 2013.
  [4] K. Thomas, C. Grier, J. Ma, V. Paxson, and D. Song, â€œDesign and evaluation of a real-time URL spam filtering service,â€ in Proc.
  [5] Cran R-Project, R Project Website. (Aug.6, 2015). A Short Introduction to the Caret Package.
  [6] M. Kuhn, â€œCaret package,â€ J. Statist. Softw., vol. 28, no. 5, pp. 1_26, 2008.
  [7] Z. Chu, S. Gianvecchio, H. Wang, S. Jajodia, Who is Tweeting on Twitter: Human, Bot, or Cyborg?, in: 26th Annu. Comput. Secur. Appl. Conf. (ACSAC 2010), Austin, Texas, USA, 2010: pp. 21â€“30. doi:10.1145/1920261.1920265.
  [8] P. Kaur, A. Singhal, J. Kaur, Spam Detection on Twitter: A Survey, in: 2016 Int. Conf. Comput. Sustain. Glob. Dev., IEEE, New Delhi, India, 2016: pp. 2570â€“2573.
  [9] C.D. Gowri, V. Mohanraj, A Survey on Spam Detection in Twitter: A Review, Int. J. Comput. Sci. Bus. Informatics. 14 (2014) 92â€“102.
  [10] J. Song, S. Lee, and J. Kim,â€œSpam filtering in Twitter using sender receiver relationship,â€ in Proc. Int.Workshop Recent Adv. Intrusion Detection, 2011, pp. 301317.
  [11] Statista. Number of Monthly Active Twitter Users Worldwide from 1st Quarter 2010 to 2nd Quarter 2016 (in millions), accessed on Aug. 9, 2016.
  [12] G.Stringhini, C. Kruegel, and G. Vigna, â€œDetecting spammers on social networks,â€ in Proc. 26th Annu. Comput. Secur. Appl. Conf., 2010,pp. 1-9.IEEE Int. Conf. Commun. (ICC), Jun. 2015, pp. 70657070.
  [13] G. Biau, â€œAnalysis of a random forests model,â€ J. Mach. Learn. Res.,vol. 13, pp. 1063_1095, Apr. 2012.
  [14] C.M.Bishop,â€œPattern recognition and machine learning,â€ New York,NY,USA: Springer, 2006.
  [15] D. Conway and J. White, Machine Learning for Hackers. Newton, MA,USA: O'Reilly Media, 2012.
  [16] M. Egele, G. Stringhini, C. Kruegel, and G. Vigna, â€œCOMPA: Detecting compromised accounts on social networks,â€ in Proc. NDSS, 2013.
  [17] J.Friedman, T. Hastie, and R. Tibshirani, â€œAdditive logistic regression: A statistical view of boosting,â€ Ann. Statist., vol. 28, no. 2, p. 2000, 1998.
  [18] J. H. Friedman, â€œGreedy function approximation: A gradient boosting machine,â€ Ann. Statist., vol. 29, no. 5, pp. 1189_1232, 2001.
  [19] K. Ghosh, P. Chaudhuri, and C. A. Murthy, â€œOn visualization and aggregation of nearest neighbor classifiers,â€ IEEE Trans. Pattern Anal.Mach. Intell., vol. 27, no. 10, pp. 1592_1602, Oct. 2005.
  [20] K. Hechenbichler and K. Schliep, â€œWeighted K-nearest-neighbor techniques and ordinal classifcation,â€Ludwigs_Maximilias Univ. Munich,Munich, Germany, Discussion Paper 399, SFB 386, 2004, p. 16
  [21] H. Wang, â€œDon't follow me: Spam detection in Twitter,â€ in Proc. Int. Conf. Secur. Cryptogr. (SECRYPT), 2010, pp. 1_10.
  [22] D. Wang, S. B. Navathe, L. Liu, D. Irani, A. Tamersoy, and C. Pu, â€œClick traffic analysis of short URL spam on Twitterâ€.
  [23] J. R. Quinlan. Data mining tools See5 and C5.0, accessed on Jun. 10, 2017.[Online]. Available: http://www.rulequest.com/see5-info.html
  [24] Abdullah Talha Kabakus , Resul Kara,â€A Survey of Spam Detection Methods on Twitteâ€.
  [25] K. Hechenbichler and K. Schliep, â€œWeighted K-nearest-neighbor techniques and ordinal classification,â€ LudwigsMaximilians Univ. Munich, Munich, Germany, Discussion Paper 399, SFB 386, 2004, p. 16
Downloads
How to Cite
Sangeetha, M., Nithyanantham, S., & Jayanthi, M. (2017). Comparison of twitter spam detection using various machine learning algorithms. International Journal of Engineering & Technology, 7(1.3), 61-65. https://doi.org/10.14419/ijet.v7i1.3.9268
ACM

ACS

APA

ABNT

Chicago

Harvard

IEEE

MLA

Turabian

Vancouver

Download Citation

Endnote/Zotero/Mendeley (RIS)

BibTeX

Comparison of twitter spam detection using various machine learning algorithms

Authors

References

Downloads

How to Cite

Published