unicharset_extractor: command not found

2019-04-07 14:55发布

问题:

I want create new train data using tesseract. So follow step which mentioned in below website.

https://blog.cedric.ws/how-to-train-tesseract-301

I got below error while i execute Unicharset in OS X terminal.

Command:

unicharset_extractor eng.micrtest.exp.box

Error: -bash: unicharset_extractor: command not found

I have using below software versions

OS: OSX EI caption 10.11.1

tesseract 3.04.01 leptonica-1.72 libjpeg 8d : libpng 1.6.21 : libtiff 4.0.6 : lib 1.2.5

is this possible to execute unchaste_extractor command in OSx?

Thanks in advance.

回答1:

Problem is "Unicharset_extractor" not install in your system. So please install tesseract training tools

INSTALL TRAINING TOOL

brew install --with-training-tools tesseract

Please uninstall tesseract if you already installed it.

UNINSTALL TESSERACT

brew uninstall tesseract

NOTE: please read logs while install training tools and follow the steps which mentioned in logs.

It’ll take some more time to install the training tool so you’ll never quiet the terminal meantime.

Please install Homebrew before work out above command.



回答2:

You will need to build Tesseract training executables; they are separate from the regular Tesseract executable.

https://github.com/tesseract-ocr/tesseract/wiki/Compiling