I used an external conversion utility and called it from R. All files had a master table with the required information
Set the path to pdftotxt.exe and convert pdf to text
exeFile <- "C:/Projects/xpdfbin-win-3.04/bin64/pdftotext.exe" for(i in 1:length(pdfFracList)){ fileNumber <- str_sub(pdfFracList[i], start = 1, end = -5) pdfSource <- paste0(reportDir,"/", fileNumber, ".pdf") txtDestination <- paste0(reportDir,"/", fileNumber, ".txt") print(paste0("File number ", i, ", Processing file ", pdfSource)) system(paste(exeFile, "-table" , pdfSource, txtDestination, sep = " "), wait = TRUE) }
Alastair Muir Mar 07 '16 at 23:08 2016-03-07 23:08
source share