Tesseract: How to run tesseract with multiple lang

2020-06-09 04:26发布

I have to analyzed a image which containing both English and Japanese texts. When I run tesseract by default (eng), some Japanese characters lost. Otherwise, if I run tesseract with japanese (-l jpn) some English characters lost (e.p. Email). How can I run one process which recognize both English and Japanese characters. Thanks.

标签： image-processing ocr tesseract

1条回答

Summer. ? 凉城

2楼-- · 2020-06-09 05:23

Since tesseract 3.02 it is possible to specify multiple languages for the -l parameter.

-l lang The language to use. If none is specified, English is assumed. Multiple languages may be specified, separated by plus characters. Tesseract uses 3-character ISO 639-2 language codes.

An example:

tesseract myscan.png out -l deu+eng

0人赞添加讨论(0) 举报

Tesseract: How to run tesseract with multiple lang

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间