How to make tesseract give appropriate results in the presence of noise?

I am using tesseract 3.0.0 and I ran into the following problem:

When there is something too small to recognize tesseract, it seems to merge with other fragments. As a result, nothing returns.

The figure below shows 3 cases. Only a rectangle with a dashed line is passed to tesseract. On the rectangle, the result (V over T means a new line).

The last case is one problem. Is there a way to improve tesseract in such situations?

enter image description here

+3
source share
1 answer

, Tesseract ( Document Analysis, OCR). , , OCR , , , , -, . OCR ares , .

Tesseract , Tesseract , , , , .

, 3.0, , , , , , .

- OCRopus, , , - Document Analisys (aka Segmentation) OCR. Tesseract OCR . OCR ( ) Tesseract .

:

  • , , Tesseract. , , , .
  • ckeck OCRopus , . , , OCRopus + Tesseract .
  • , , , OCR, ABBYY. , , OCR, , , .

: ABBYY

+5

Source: https://habr.com/ru/post/1790948/


All Articles