Abstract
Optical Character Recognition alludes to the methodology of taking images or photos of letters or typewritten content and changing over them into information that a machine can easily interpret, e.g. organizations and libraries taking physical duplicates of books, magazines, or other old printed material and utilizing OCR to put them into computers. Segmentation is the indispensable and most difficult part of OCR process, and it gets to be additionally difficult with handwritten text due to varieties in writing styles and presence of abnormalities. This paper shows a new strategy for the segmentation of conjuncts, and overlapping characters in Devanagari script on Hindi language. The proposed algorithm is focused around Cluster Detection technique and gives 95% correctness for segmenting touching, conjunct characters and 88% effectiveness for overlapping characters.

This publication has 9 references indexed in Scilit: