International Journal of Applied Information Systems |
Foundation of Computer Science (FCS), NY, USA |
Volume 4 - Number 5 |
Year of Publication: 2012 |
Authors: Mamatha H.r, Srikantamurthy K |
10.5120/ijais12-450704 |
Mamatha H.r, Srikantamurthy K . Morphological Operations and Projection Profiles based Segmentation of Handwritten Kannada Document. International Journal of Applied Information Systems. 4, 5 ( October 2012), 13-19. DOI=10.5120/ijais12-450704
Segmentation is an important task of any Optical Character Recognition (OCR) system. It separates the image text documents into lines, words and characters. The accuracy of OCR system mainly depends on the segmentation algorithm being used. Segmentation of handwritten text of some Indian languages like Kannada, Telugu, Assamese is difficult when compared with Latin based languages because of its structural complexity and increased character set. It contains vowels, consonants and compound characters. Some of the characters may overlap together. Despite several successful works in OCR all over the world, development of OCR tools in Indian languages is still an ongoing process. Character segmentation plays an important role in character recognition because incorrectly segmented characters are unlikely to be recognized correctly. In this paper, a segmentation scheme for segmenting handwritten Kannada scripts into lines, words and characters using morphological operations and projection profiles is proposed. The method was tested on totally unconstrained handwritten Kannada scripts, which pays more challenge and difficulty due to the complexity involved in the script. Usage of the morphology made extracting text lines efficient by an average extraction rate of 94. 5% . Because of the varying inter and intra word gaps an average segmentation rate of 82. 35% and 73. 08% for words and characters respectively is obtained.