How to represent:
- Create new image with paint (any size)
- Add letter A to this image
- Try to recognize -> tesseract will not find any letters
- Copy-paste this letter 5-6 times to this image
- Try to recognize -> tesseract will find all the letters
Why?
How to represent:
Why?
python code to do that configuration is like this:
the
--psm
flag defines the page segmentation mode.according to documentaion of tesseract,
10
means :so to recognize a single character you just need to use :
--psm 10
flag.Have you seen this?
https://code.google.com/p/tesseract-ocr/issues/detail?id=581
The bug list shows it as "no longer an issue".
baseApi.setVariable("tessedit_char_whitelist", "ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz");
code before theinit Tesseract
You need to set Tesseract's page segmentation mode to "single character."
You must set the "page segmentation mode" to "single char".
For example, in Android you do the following: