Analysis of Writer Styles in Punjabi

  • Authors

    • A. Pandian
    • Stephen Wahi
    • Yash Tokas
    • k. Manikandan
    • V. V.Ramalingams
    2018-11-27
    https://doi.org/10.14419/ijet.v7i4.19.23174
  • Authorship Identification, Punjabi poetry corpus, Feature extraction, J48 Decision Tree, J48 Classifier.
  • Author Identification alludes to the issue of distinguishing the creator of a mysterious content. From the machine learning perspective, this is a solitary mark content arrangement assignment. This errand is done on the supposition that the creator of an obscure content can be separated by looking at a couple of lexical highlights extricated from that obscure content with those of writings having known writers. In this paper, Authorship Identification process is connected on Punjabi verse dataset comprising of Punjabi ballads composed by 5 unique writers. Different highlights extensively ordered as measurable (word-check, roast tally, and so forth.), linguistic (i.e. lexical) and semantically (dialect subordinate) are first chosen utilizing the J48 Decision Tree Algorithm. They chose highlights are thusly, utilized as a contribution to the J48 classifier and the approval of the proposed framework is assessed based on Precision, Recall, F-score and Accuracy.

     

     

  • References

    1. [1] Farkhund Iqbal, Hamad Binsalleeh, Benjamin C.M. Fung, Mourad Debbabi, 2015, “E-mail authorship attribution usingcustomized associative classificationâ€, Digital Investigation (Elsevier), Vol.7, pp.56-64

      [2] Sanjanasri J.P and Anand Kumar M, “A Computational Framework for Tamil Document Classification using Random Kitchen Sinkâ€, IEEE 2015, International Conference on Advances in Computing, Communications and Informatics(ICACCI)

      [3] Mahmoud Khonji, Youssef Iraqi, Andrew Jones,“An Evaluation of Authorship Attribution Using Random Forestsâ€, IEEE 2015, International Conference on Information and Communication Technology Research (ICTRC2015)

      [4] Ahmed Fawziotoom, Emad E Abdullah, Shifaa Jaafar, Aseer Hamdellh, Dana Amer, “Towards Author Identification of Arabic Text Articlesâ€, IEEE 2014, 5th International Conference on Information and Communication Systems(ICICS)

      [5] Pandian, A., and Md. Abdul Karim Sadiq, 2014, “Authorship Categorization In Email Investigations Using Fisher’s Linear Discriminate Method With Radial Basis Functionâ€, International Journal of Computer Science, Vol.10,No.6,pp.1003-1014 (SNIP: 0.874)

      [6] Al-Falahi Ahmed, Ramdani Mohammad, Bellahfkimustafa, Al-Sarem Mohammad, “Authorship Attribution in Arabic Poetryâ€,78-1- 4799-7560- 0/15, 2015, IEEE

      [7] Ahmed Fawzi Otoom, Emad E. Abdullah, Shifaa Jaafer, Aseel Hamdallh, Dana Amer “Towards Author Identification of Arabic Text Articlesâ€, 2014,IEEE, 5th International Conference on Information and Communication Systems (ICICS)

      [8] Bhargava Urala k, A.G.Ramakrishnan and Sahil Mohammad, “Recognition of Open Vocabulary, Online Tamil Handwritten Pages in Tamil Scriptâ€, 2014 IEEE, Vol.42, No.3, pp.6-9.

      [9] Pandian A. and Md. Abdul Karim Sadiq, 2012, “Detection ofFraudulent Emails by Authorship Extractionâ€, International Journal of Computer Application Vol.41, No.7, pp.7 – 12.

      [10] Pandian A. and Md. Abdul Karim Sadiq, 2013, “Authorship Attribution in Tamil Language Email For Forensic Analysisâ€, International Review on Computers and Software, Vol. 8, No. 12, pp.2882-2888, (SNIP: 1.178).

      [11] A Pandian, V V Ramalingam, K Manikandan, R P Vishnu Preet. "Authorship Identification for Tamil Classical Poem using Subspace Discriminant Algorithm", Journal of Physics: Conference Series, 2018.

  • Downloads

  • How to Cite

    Pandian, A., Wahi, S., Tokas, Y., Manikandan, k., & V.Ramalingams, V. (2018). Analysis of Writer Styles in Punjabi. International Journal of Engineering & Technology, 7(4.19), 407-411. https://doi.org/10.14419/ijet.v7i4.19.23174