How to use PDFTextExtractor on iTextSharp

I want to get text from a pdf file using iTextSharp. However, I could not use the PDFTextExtractor, as in the itextsharp (itext) JAVA library. I need the readPDFOffline class to return the contents of a file. I will give the pseudo below so that you understand well what I want.

private line readPDFOffline (line fileUri);
read the PDF,
get the text content of this pdf; *
save content to the string contentOfflineFile; return contentOfflineFile;

I would like to do * part of the code

+3
source share
1 answer

PdfTextExtractor is present in the latest versions of iTextSharp available here .

PDF . , , , , - OCR. PdfTextExtractor . , .

, , :

  • ... -. OCR .
  • "" - . , OCR.
  • "", . OCR .

OCR: . Google Code, .

+2

Source: https://habr.com/ru/post/1779657/


All Articles