I am looking for OCR software that displays HTML overlay on an image. I am currently using an unnamed product. It has an OCR function that will make an embedded OCR of a PDF document with images.
The built-in OCR is very convenient, it allows you to search for a PDF document with images for text. Also, the text can be directly selected in the document, the OCR text is aligned with the main image. Unfortunately, I cannot export or store the embedded OCR from an unnamed product.
Is there any other software around which you can run and export embedded OCR? I would be particularly interested in exporting to HTML consisting of positional paragraphs that are aligned with the main image.
See also:
https://stackoverflow.com/questions/11404805/ocr-and-the-location-of-the-image-where-the-scanned-document-came-from
source share