Improving Optical Character Recognition Techniques

 
 
 
  • Abstract
  • Keywords
  • References
  • PDF
  • Abstract


    Document text recognition uses a concept called OCR (optical character recognition),which is the recognition of printed or written text characters by a computer. This involves scanning a document containing text, and converting character by character to their digital form. Thus, it is defined as the process of digitizing a document image into its constituent characters. Equipment used to obtain clearer images for analysis are cameras and flatbed scanners. Even though it’s been out in the world since 1870, the OCR technology is yet to reach perfection. This demanding nature of Optical Character Recognition has made various researchers, industries and technology enthusiasts to divulge their attention to this field. In recent times one can notice a significant increase in the number of research organizations investing their time and effort in this field. In this research, the progress, different aspects and various issues revolving in this field have been summarized. The aim is to present a scrupulous overview of various proposals, advancements and discussions aimed at resolving various problems that arise in traditional OCR.

     

     


  • Keywords


    Text Recognition, OCR, Image Analysis, Photo Scanning, Scanned Image.

  • References


      [1] Image Text To Speech Conversion In The Desired Language By Translating With Raspberry Pi Rithika.H , B. Nithyasanthoshi, IEEE 2016

      [2] Image Preprocessing for Improving OCR Accuracy By WojciechBieniecki, Szymon Grabowski and WojciechRozenberg, IEEE July 2007

      [3] Real-Time Scene Text Localization and Recognition, Lukáš Neumann, Jiřı́ Matas IEEE 2012

      [4] Text Detection and Recognition in Imagery: A Survey Qixiang Ye, Member, IEEE and David Doermann, Fellow, IEEE, published July 2015

      [5] Scene text recognition with high performance CNN classifier and efficient word inference XinhaoLiu,TakahitoKawanishi,XiaomengWu,KunioKashino IEEE 2016

      [6] Video Text Extraction and Recognition: A Survey, Pooja, RenuDhir IEEE 2016

      [7] https://github.com/tesseract-ocr

      [8] https://en.wikipedia.org/wiki/Optical_character_recognition

      [9] https://pypi.python.org/pypi/pytesseract

      [10] Digital Image Processing Book By Rafael C. Gonzalez and Richard Eugene Woods.

      [11] T. Padmapriya and V. Saminadan, “Inter-cell Load Balancing technique for multi-class traffic in MIMO-LTE-A Networks”, International Journal of Electrical, Electronics and Data Communication (IJEEDC), ISSN: 2320- 2084, vol.3, no.8, pp. 22-26, Aug 2015.

      S.V.Manikanthan and T.Padmapriya “Recent Trends In M2m Communications In 4g Networks And Evolution Towards 5g”, International Journal of Pure and Applied Mathematics, ISSN NO:1314-3395, Vol-115, Issue -8, Sep 2017.

 

View

Download

Article ID: 12085
 
DOI: 10.14419/ijet.v7i2.24.12085




Copyright © 2012-2015 Science Publishing Corporation Inc. All rights reserved.