Handwritten Malayalam Character Recognition using Regional Zone with Structural Features

  • Authors

    • Ajay James Government Engineering College, Thrissur
    • Raveena P V Government Engineering College, Thrissur
    • Chandran Saravanan NIT Durgapur
    https://doi.org/10.14419/ijet.v7i4.12551

    Received date: May 7, 2018

    Accepted date: July 25, 2018

    Published date: November 11, 2018

  • Abstract

    Optical Character Recognition (OCR) extracts features from an image of script and converts it to machine-readable code. OCR comprises of line segmentation, word segmentation, character segmentation and character Recognition. Printed documents are efficiently converted to the editable text format with almost 100% accuracy. Handwritten character recognition places difficulties in identifying and translating scripts because of the wide variation in human handwriting. The writing styles like line spacing, word spacing, character sizes and shape of each character varies from person to person. Feature extraction and character recognition are different for different languages and become the most complicated task among the phases of OCR. By language characteristics, feature extraction can differ for each language. The Malayalam characters are characterized by their curved and non-cursive nature. The handwritten character recognition for the Malayalam language that proposed here uses a regional zone based method with structural feature extraction.

  • References

    1. A. T. Jia, Y. Ayappally and K. Syama, ”Malayalam OCR: N-gram approach using SVM classifier,” 2013 International Conference on
    2. Advances in Computing, Communications and Informatics (ICACCI), Mysore, 2013, pp. 1799-1803.
    3. S. A. Chaudhari and R. M. Gulati, ”An OCR for separation and identification of mixed English — Gujarati digits using kNN classifier,” 2013 International Conference on Intelligent Systems and Signal Processing (ISSP), Gujarat, 2013, pp. 190-193.
    4. K. A. Mohand, T. Paquet, N. Ragot and L. Heutte, ”Structure Adaptation of HMM Applied to OCR,” 2010 20th International Conference
    5. on Pattern Recognition, Istanbul, 2010, pp. 2877-2880.
    6. P. N. Sastry, T. R. V. Lakshmi, N. V. K. Rao, T. V. Rajinikanth and A. Wahab, ”Telugu Handwritten Character Recognition Using Zoning Features,” 2014 International Conference on IT Convergence and Security (ICITCS), Beijing, 2014, pp. 1-4.
    7. Michael Blumenstein, Brijesh Verma, and Hasan Basli. 2003 ”A novel feature extraction technique for the recognition of segmented handwritten characters”, In Document Analysis and Recognition, 2003. Proceedings, Seventh International Conference on, IEEE 2003, pp. 137-141.
    8. Gayathri, P., and Sonal Ayyappan. 2014 ”Off-line handwritten character recognition using Hidden Markov Model”, In Advances in Computing, Communications and Informatics ICACCI, 2014 International Conference on, IEEE, 2014, pp. 518-523.
    9. Nisha Sharma, Bhupendra Kumar, Vandita Singh, 2014 ”Recognition of Offline Hand printed English Characters, Numerals and Special Symbols”, 5th International Conference, 2014, pp.640-645.
    10. R. S. V, V. R. P and M. A. V. N, ”Isolated Handwritten Kannada and Tamil Numeral Recognition: A Novel Approach,” 2008 First
    11. International Conference on Emerging Trends in Engineering and Technology, Nagpur, Maharashtra, 2008, pp. 1192-1195.
    12. R. Giridharan, E. K. Vellingiriraj and P. Balasubramanie, ”Identification of Tamil ancient characters and information retrieval from temple epigraphy using image zoning”, 2016 International Conference on Recent Trends in Information Technology (ICRTIT), Chennai, 2016, pp. 1-7.
    13. Dash, Kalyan Sourav, Niladri B. Puhan, and Ganapati Panda 2015 ”Handwritten numeral recognition using non-redundant Stockwell
    14. transform and bio-inspired optimal zoning”,IET Image Processing 9, no. 10, 2015, pp.874-882
    15. G. S. G. Rajput and R. Horakeri, ”Zone based handwritten Kannada character recognition using crack code and SVM,” 2013 International Conference on Advances in Computing, Communications and Informatics (ICACCI), Mysore, 2013, pp. 1817-1821.
    16. Pirlo, Giuseppe, and Donato Impedovo 2012 ”Adaptive membership functions for handwritten character recognition by voronoi-based image zoning”, IEEE Transactions on Image Processing,21, no. 9, 2012,pp. 3827-3837.
    17. M. K. Mahto, K. Bhatia and R. K. Sharma, ”Combined horizontal and vertical projection feature extraction technique for Gurmukhi handwritten character recognition,” 2015 International Conference on Advances in Computer Engineering and Applications, Ghaziabad, 2015, pp. 59-65.
    18. R. Gopakumar, N. V. Subbareddy, K. Makkithaya and U. Dinesh Acharya, ”Zone-based structural feature extraction for script identification from Indian documents,” 2010 5th International Conference on Industrial and Information Systems, Mangalore, 2010, pp. 420-425.
    19. K. S. Dash, N. B. Puhan and G. Panda, ”A hybrid feature and discriminant classifier for high accuracy handwritten Odia numeral recognition,” 2014 IEEE REGION 10 SYMPOSIUM, Kuala Lumpur, 2014, pp. 531-535.
    20. A. Raj, ”An optical character recognition of machine printed Oriya script,” 2015 Third International Conference on Image Information Processing (ICIIP), Waknaghat, 2015, pp. 543-547.
    21. Kartar Singh Siddharth, Mahesh Jangid, Renu Dhir, Rajneesh Rani, 2011”Handwritten Gurmukhi Character Recognition Using Statistical and Background Directional Distribution Features”, International Journal on Computer Science and Engineering, Vol. 3, No. 6 June 2011, pp. 0975-3397.
    22. R. M. O. Cruz, G. D. C. Cavalcanti and T. I. Ren, ”An ensemble classifier for offline cursive character recognition using multiple feature extraction techniques,” The 2010 International Joint Conference on Neural Networks (IJCNN), Barcelona, 2010, pp. 1-8.
    23. U. Pal, T. Wakabayashi and F. Kimura, ”A System for Off-Line Oriya Handwritten Character Recognition Using Curvature Feature,” 10th International Conference on Information Technology (ICIT 2007), Orissa, 2007, pp. 227-229.
    24. Shanjana C, Ajay James, 2014 ”Offline Recognition of Malayalam Handwritten Text”, 8th International Conference Interdisciplinary in Engineering, INTER-ENG 2014, 9-10 October 2014. pp. 772-779.
    25. Primekumar K.P., Idiculla S.M, ”Online Malayalam handwritten character recognition using HMM and SVM”, IEEE 2013 International
    26. Conference on Signal Processing, Image Processing and Pattern Recognition (ICSIPR).
    27. S. M. Joseph, V. A. Rahiman and K. M. Abdul Hameed, ”SVM based feature set analysis in dynamic malayalam handwritten character recognition,” 2015 IEEE International Conference on Signal and Image Processing Applications (ICSIPA), Kuala Lumpur, 2015, pp. 238-243.
    28. S. M. Obaidullah, N. Das, C. Halder and K. Roy, ”Indic script identification from handwritten document images — An unconstrained
    29. block-level approach,” 2015 IEEE 2nd International Conference on Recent Trends in Information Systems (ReTIS), Kolkata, 2015, pp.
    30. -218.
    31. P. K. Singh, S. K. Dalal, R. Sarkar and M. Nasipuri, ”Page-level script identification from multi-script handwritten documents,” Proceedings of the 2015 Third International Conference on Computer, Communication, Control and Information Technology (C3IT), Hooghly, 2015, pp. 1-6.
    32. A. T. Jia, Y. Ayappally and K. Syama, ”Malayalam OCR: N-gram approach using SVM classifier,” 2013 International Conference on
    33. Advances in Computing, Communications and Informatics (ICACCI), Mysore, 2013, pp. 1799-1803.
    34. P-ARTS Kayyezhuthu - https://tinyurl.com/ydecjnfy
  • Downloads

  • How to Cite

    James, A., P V, R., & Saravanan, C. (2018). Handwritten Malayalam Character Recognition using Regional Zone with Structural Features. International Journal of Engineering and Technology, 7(4), 4629-4636. https://doi.org/10.14419/ijet.v7i4.12551