Text blocks positions and sizes detection in comma

2019-01-24 20:37发布

tesseract OCR have a command line interface, which allow us to recognize text from images with some parameters.

Input argumetns are imagename (path to image) outputbase (name of recognized text) and -psm pagesegmode parameters.

pagesegmode values are:
 0 = Orientation and script detection (OSD) only.
 1 = Automatic page segmentation with OSD.
 2 = Automatic page segmentation, but no OSD, or OCR
 3 = Fully automatic page segmentation, but no OSD. (Default)
 4 = Assume a single column of text of variable sizes.
 5 = Assume a single uniform block of vertically aligned text.
 6 = Assume a single uniform block of text.
 7 = Treat the image as a single text line.
 8 = Treat the image as a single word.
 9 = Treat the image as a single word in a circle.
 10 = Treat the image as a single character.
-l lang and/or -psm pagesegmode must occur before anyconfigfile.

But can it library write positions and sizes of recognized text blocks to the specific file or it is an internal information?

标签： ocr command-line-arguments textblock tesseract

1条回答

贼婆χ

2楼-- · 2019-01-24 21:20

Tesseract 3.0x supports a "hocr" command option, which produces a HTML-format output file consisting of recognized words and their coordinates. It does not have size/font info, though.

0人赞添加讨论(0) 举报

Text blocks positions and sizes detection in comma

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间