I am able to get word level confidence score using tesseract 4.0 through the command line. Interested to know if there is a way to get the character confidence too.
For word level confidence used the below command:
tesseract [Image name] outputbase --oem 1 -l eng --psm 8 tsv
Set
hocr_char_boxes to 1
in your config file. Or, at the command line, your updated command would be:Note the hocr output option and look in that file for ...
_wconf
, e.g.Let me know if this works for you, otherwise I'll just delete the answer.
Source: https://github.com/tesseract-ocr/tesseract/issues/1465#issuecomment-513139976