可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效，请关闭广告屏蔽插件后再试):

问题:

I am struggling with tesseract ocr on ios. Everything works fine but it is really slow. 2 - 3 seconds recogintion time for a single line of digits.

I am reading from a Video Stream.

I am using tesseract 3.01 with a custom training file for my font.

Here is what I do:

Setting up tesseract only to find numbers (0-9)

Shrink, Deskew and Binarize Image
use GetLines to find the line I want the text of
setRectangle to only recognize the line i want
getUTF8Text to get my text <- this alone takes 2-3 seconds

Are there any suggestions to speed up the process?

回答1:

I switched to the SVN of tesseract 3.02

After having some problems with crosscompiling,

I am down to about 1.2 - 0.8 Seconds for getUTF8Text and 0.3 - 0.2 seconds for image preprocessing.

I did some testing

Binarization and Shrinking by 0.7 gains 0.3 Seconds but costs you 0.1
chop_enable=0 has almost no effect on speed about a speed gain of 0.1 in average althoug it counts for 50% of the cpu usage if profiled

The main gain in usability was to optimize the code with setRectangle not calling getUTF8Text if I suspect the bounds to be incorrect. And some String postprocessing by checking the received string and apply some algorithm to eliminate the most common misinterpretations from tesseract.

maybe this is useful for someone else.

回答2:

One thing you might want to try is do it in the background. It will not improve the recognition speed, but by having the user do something else it might appear faster. This, of course, depends on the use case (I have one where this is a nice improvement as it seems very fast, while I just start sooner than expected). The trick is that ios apps are about perceived performance, not actual performance (although it naturally has a big impact on perceived performance).