Create a configuration file (for example, "letters") in the tessdata / configs directory - usually /usr/share/tesseract/tessdata/configs
or
/usr/share/tesseract-ocr/tessdata/configs
And add this line to the configuration file:
tessedit_char_whitelist abcdefghijklmnopqrstuvwxyz
... or maybe [az] works .. dunno :-)
Then call tesseract similar to this:
tesseract input.tif output nobatch letters
This will limit tesseract to only recognize the characters you need.
Blomman Jun 06 '10 at 6:08 2010-06-06 06:08
source share