Often in the process of OCR, an image file is essentially cut into segments, and each character is recgnised as a segment each. For instance,
has to be transformed to something like
Also, is there any algorithm for Asian languages like Telugu readily available for this purpose? If not, how is this done for English?
It can be easily done using OpenCV. Below is a sample code:
Output will look like below: