Tesseract - Entire line output

2019-08-20 04:30发布

站内文章 / 前端开发

17 0

该账号已被封号

女 | 书童

私信

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效，请关闭广告屏蔽插件后再试):

问题:

I am trying to OCR few tables using Tesseract. These tables have following format:

Item One name                       Item One category
(Item description if any)

Item Two name                       Item Two category
(Item description if any)

There is some space between the name and category. The output produced is like this

Item One name
(Item description if any)

Item Two name
(Item description if any)


Item One category

Item Two category

Is there a way that I can produce output for the entire line and not get this column wise output one below the other?

I am running Tesseract through simple command line:

tesseract ~/Desktop/imagename.jpg out

回答1:

Try with a different page segmentation mode (PSM), such as 4 or 6.

标签： ocr tesseract

该账号已被封号

女 | 书童

私信

收藏的人(0)

Ta的文章更多文章

0条评论

还没有人评论过~

Tesseract - Entire line output

问题:

回答1:

收藏的人(0)

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮