chinese character recognition using Tesseract OCR

2019-03-13 03:06发布

I have been using Tesseract 3.0.2 OCR SDK for image text extraction. But if I use Chinese text images and pass through OCR then Tesseract doesn't provide me the Chinese characters instead of that I am getting numeric and english characters. But I need Chinese characters as displayed in the image I am using.

How can I achieve this? Is there any way I can obtain Chinese characters rather than any other characters?

标签： iphone ios ocr tesseract

1条回答

再贱就再见

2楼-- · 2019-03-13 03:33

You need to download chinese trained data (it will be a file like chi_sim.traineddata) and add it to your tessdata folder.

To download the file https://github.com/tesseract-ocr/tessdata/raw/master/chi_sim.traineddata

and use like this

Tesseract* tesseract= [[Tesseract alloc] initWithDataPath:@"tessdata" language:@"chi_sim"];

if you have any problem you can download my experiment with tessaract (with chinese language support) from https://github.com/aryansbtloe/ExperimentWithTesseract.git

I have tested this one...Hope you will find this useful.

0人赞添加讨论(0) 举报

chinese character recognition using Tesseract OCR

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间