Is there a good PDF to XHTML strict converter

This is basically all in the name, I need to take a bunch of large PDF files and have them in XHTML 1.0 strict, close is good enough, then I can clear it. Thanks

+3
source share
1 answer

This is a complex query because it depends on the PDF itself (and how it was created), whether it can be done or not. As a first attempt, I tried using the Adobe PDF PDF to HTML converter

http://www.adobe.com/products/acrobat/access_onlinetools.html

and then try to fix the HTML after the fact with something like tidy

http://tidy.sourceforge.net/

PDF , , - , , , JPG, - OCR PDF .

, PDF , , , , , , . , / .., JPG/GIF HTML-, , , , .

+2

Source: https://habr.com/ru/post/1704535/


All Articles