Home            Contact us            FAQs
    
      Journal Home      |      Aim & Scope     |     Author(s) Information      |      Editorial Board      |      MSP Download Statistics

     Research Journal of Applied Sciences, Engineering and Technology


Separation of Text Components from Complex Colored Images

G. Gayathri Devi and C.P. Sumathi
Department of Computer Science, SDNB Vaishnav College for Women, Chennai, Tamil Nadu, India
Research Journal of Applied Sciences, Engineering and Technology  2014  4:556-564
http://dx.doi.org/10.19026/rjaset.8.1005  |  © The Author(s) 2014
Received: April ‎22, ‎2014  |  Accepted: ‎July ‎01, ‎2014  |  Published: July 25, 2014

Abstract

The objective of this study is to project a new methodology for text separation in an image. Gamma Correction Method is applied as a preprocess technique to suppress non text regions and retain text regions. Text Segmentation is achieved by applying Positional Connected Component Labeling, Text Region Extraction, Text Line Separation, Separation of Touching Text and Separation of Text Components algorithms. At last, the details of each word’s and the line’s starting text component position are stored in a text file. Experiments are conducted on various images from the datasets collected and tagged by the ICDAR Robust Reading Dataset Collection Team. It is observed that the proposed method has an average recall rate of 97.5% on separation of text components in an image.

Keywords:

Connected components, gamma correction method, segmentation, text extraction, text line, text separation , thinning,


References

  1. Chethan, H.K. and G.H. Kumar, 2010. Graphics separation and skew correction for mobile captured documents and comparative analysis with existing methods. Int. J. Comput. Appl., 7(3): 42-47.
    CrossRef    
  2. Coates, A., B. Carpenter, C. Case and S. Satheesh, 2011. Text detection and character recognition in scene images with unsupervised feature learning. Proceeding of International Conference on Document Analysis and Recognition (ICDAR, 2011), pp: 440-445.
    CrossRef    
  3. Devi, G.G. and C.P. Sumathi, 2014. Positional connected component labeling algorithm. Indian J. Sci. Technol., 7(3): 306-311.
  4. Gonzalez, A., L.M. Bergasa, J.J. Yebes and S. Bronte, 2012. Text location in complex images. Proceeding of 21st International Conference on Pattern Recognition (ICPR, 2012). Tsukuba, Japan, pp: 617-620.
  5. Khedekar, S., V. Ramanaprasad, S. Setlur and V. Govindaraju, 2003. Text- image separation in Devanagari documents. Proceeding of the 7th International Conference on Document Analysis and Recognition, pp: 1265-1269.
    CrossRef    
  6. Neumann, L. and J. Matas, 2012. Real-time scene text localization and recognition. Proceeding of IEEE Conference on Computer Vision and Pattern Recognition (CVPR, 2012), pp: 3538-3545.
    CrossRef    
  7. Patil, U. and M. Begum, 2012. Word level handwritten and printed text separation based on shape features. Int. J. Emerg. Technol. Adv. Eng., 2(4): 590-594.
  8. Peng, X., S. Setlur, V. Govindaraju and R. Sitaram, 2013. Handwritten text separation from annotated machine printed documents using Markov Random Fields. Int. J. Doc. Anal. Recog., 16: 1-16, DOI 10.1007/s10032-011-0179.
  9. Phan, T.Q., P. Shivakumara and C.L. Tan, 2012. Detecting text in the real world. Proceeding of the 20th ACM International Conference on Multimedia (MM '12), pp: 765-768.
    CrossRef    PMid:22368056    
  10. Shi, Z. and V. Govindaraju, 2004. Line separation for complex document images using fuzzy runlength. Proceeding of 1st International Workshop on Document Image Analysis for Libraries, pp: 306-312.
    PMCid:PMC4717027    
  11. Sumathi, C.P. and G.G. Devi, 2014. Automatic text extraction from complex colored images using gamma correction method. J. Comput. Sci., 10(4): 706-715.
    CrossRef    
  12. Yao, C., X. Bai, W. Liu, Y. Ma and Z. Tu, 2012. Detecting texts of arbitrary orientations in natural images. Proceeding of IEEE Conference on Computer Vision and Pattern Recognition (CVPR, 2012), pp: 1083-1090.

Competing interests

The authors have no competing interests.

Open Access Policy

This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Copyright

The authors have no competing interests.

ISSN (Online):  2040-7467
ISSN (Print):   2040-7459
Submit Manuscript
   Information
   Sales & Services
Home   |  Contact us   |  About us   |  Privacy Policy
Copyright © 2024. MAXWELL Scientific Publication Corp., All rights reserved