Convert docx to pdf in java

Question

Convert docx to pdf in java

I am trying to convert a docx file that contains a table and images to pdf format.

I searched everywhere, but did not find the right solution, please give a correct and correct solution:

here is what i tried:

 import java.io.File; import java.io.FileInputStream; import java.io.FileNotFoundException; import java.io.FileOutputStream; import java.io.IOException; import java.io.InputStream; import java.io.OutputStream; import org.apache.poi.xwpf.converter.pdf.PdfConverter; import org.apache.poi.xwpf.converter.pdf.PdfOptions; import org.apache.poi.xwpf.usermodel.XWPFDocument; public class TestCon { public static void main(String[] args) { TestCon cwoWord = new TestCon(); System.out.println("Start"); cwoWord.ConvertToPDF("D:\\Test.docx", "D:\\Test1.pdf"); } public void ConvertToPDF(String docPath, String pdfPath) { try { InputStream doc = new FileInputStream(new File(docPath)); XWPFDocument document = new XWPFDocument(doc); PdfOptions options = PdfOptions.create(); OutputStream out = new FileOutputStream(new File(pdfPath)); PdfConverter.getInstance().convert(document, out, options); System.out.println("Done"); } catch (FileNotFoundException ex) { System.out.println(ex.getMessage()); } catch (IOException ex) { System.out.println(ex.getMessage()); } } }

An exception:

 Exception in thread "main" java.lang.IllegalAccessError: tried to access method org.apache.poi.util.POILogger.log(ILjava/lang/Object;)V from class org.apache.poi.openxml4j.opc.PackageRelationshipCollection at org.apache.poi.openxml4j.opc.PackageRelationshipCollection.parseRelationshipsPart(PackageRelationshipCollection.java:313) at org.apache.poi.openxml4j.opc.PackageRelationshipCollection.<init>(PackageRelationshipCollection.java:162) at org.apache.poi.openxml4j.opc.PackageRelationshipCollection.<init>(PackageRelationshipCollection.java:130) at org.apache.poi.openxml4j.opc.PackagePart.loadRelationships(PackagePart.java:559) at org.apache.poi.openxml4j.opc.PackagePart.<init>(PackagePart.java:112) at org.apache.poi.openxml4j.opc.PackagePart.<init>(PackagePart.java:83) at org.apache.poi.openxml4j.opc.PackagePart.<init>(PackagePart.java:128) at org.apache.poi.openxml4j.opc.ZipPackagePart.<init>(ZipPackagePart.java:78) at org.apache.poi.openxml4j.opc.ZipPackage.getPartsImpl(ZipPackage.java:239) at org.apache.poi.openxml4j.opc.OPCPackage.getParts(OPCPackage.java:665) at org.apache.poi.openxml4j.opc.OPCPackage.open(OPCPackage.java:274) at org.apache.poi.util.PackageHelper.open(PackageHelper.java:39) at org.apache.poi.xwpf.usermodel.XWPFDocument.<init>(XWPFDocument.java:121) at test.TestCon.ConvertToPDF(TestCon.java:31) at test.TestCon.main(TestCon.java:25)

My requirement is to create Java code to convert existing docx to pdf with the correct format and alignment.

Please suggest.

Used banks:

+10

java ms-word pdf apache-poi

user1999397 Apr 12 '17 at 7:58

source share

6 answers

In addition to VivekRatanSinha's answer, I would like to publish the full code and the necessary banks for people who need it in the future.

Code:

 import java.io.File; import java.io.FileInputStream; import java.io.FileOutputStream; import java.io.IOException; import java.io.InputStream; import java.io.OutputStream; import org.apache.poi.xwpf.converter.pdf.PdfConverter; import org.apache.poi.xwpf.converter.pdf.PdfOptions; import org.apache.poi.xwpf.usermodel.XWPFDocument; public class WordConvertPDF { public static void main(String[] args) { WordConvertPDF cwoWord = new WordConvertPDF(); cwoWord.ConvertToPDF("D:/Test.docx", "D:/Test.pdf"); } public void ConvertToPDF(String docPath, String pdfPath) { try { InputStream doc = new FileInputStream(new File(docPath)); XWPFDocument document = new XWPFDocument(doc); PdfOptions options = PdfOptions.create(); OutputStream out = new FileOutputStream(new File(pdfPath)); PdfConverter.getInstance().convert(document, out, options); } catch (IOException ex) { System.out.println(ex.getMessage()); } } }

and JARS:

Enjoy :)

+17

user1999397 Apr 12 '17 at 13:46

source share

I am using this code.

 private byte[] toPdf(ByteArrayOutputStream docx) { InputStream isFromFirstData = new ByteArrayInputStream(docx.toByteArray()); XWPFDocument document = new XWPFDocument(isFromFirstData); PdfOptions options = PdfOptions.create(); //make new file in c:\temp\ OutputStream out = new FileOutputStream(new File("c:\\tmp\\HelloWord.pdf")); PdfConverter.getInstance().convert(document, out, options); //return byte array for return in http request. ByteArrayOutputStream pdf = new ByteArrayOutputStream(); PdfConverter.getInstance().convert(document, pdf, options); document.write(pdf); document.close(); return pdf.toByteArray(); }

0

Jesús Sánchez Mar 6 '18 at 1:01

source share

You must add these Maven dependencies.

  <dependency> <groupId>org.apache.poi</groupId> <artifactId>poi</artifactId> <version>4.0.1</version> </dependency> <dependency> <groupId>org.apache.poi</groupId> <artifactId>poi-ooxml</artifactId> <version>4.0.1</version> </dependency> <dependency> <groupId>org.apache.poi</groupId> <artifactId>poi-scratchpad</artifactId> <version>4.0.1</version> </dependency> <dependency> <groupId>org.apache.poi</groupId> <artifactId>poi-ooxml-schemas</artifactId> <version>4.0.1</version> </dependency> <dependency> <groupId>org.apache.poi</groupId> <artifactId>poi-excelant</artifactId> <version>4.0.1</version> </dependency> <dependency> <groupId>org.apache.poi</groupId> <artifactId>poi-examples</artifactId> <version>4.0.1</version> </dependency> <dependency> <groupId>fr.opensagres.xdocreport</groupId> <artifactId>org.apache.poi.xwpf.converter.core</artifactId> <version>1.0.6</version> </dependency> <dependency> <groupId>fr.opensagres.xdocreport</groupId> <artifactId>org.apache.poi.xwpf.converter.pdf</artifactId> <version>1.0.6</version> </dependency>

0

iraqi eagle Dec 23 '18 at 9:19

source share

I did a lot of research and found Documents4j is the best free API for converting docx to pdf. Alignment, font everthing documents4j does a good job.

Maven Dependencies:

 <dependency> <groupId>com.documents4j</groupId> <artifactId>documents4j-local</artifactId> <version>1.0.3</version> </dependency> <dependency> <groupId>com.documents4j</groupId> <artifactId>documents4j-transformer-msoffice-word</artifactId> <version>1.0.3</version> </dependency>

Use the code below to convert docx to pdf.

 import java.io.File; import java.io.FileInputStream; import java.io.FileOutputStream; import java.io.InputStream; import java.io.OutputStream; import com.documents4j.api.DocumentType; import com.documents4j.api.IConverter; import com.documents4j.job.LocalConverter; public class Document4jApp { public static void main(String[] args) { File inputWord = new File("Tests.docx"); File outputFile = new File("Test_out.pdf"); try { InputStream docxInputStream = new FileInputStream(inputWord); OutputStream outputStream = new FileOutputStream(outputFile); IConverter converter = LocalConverter.builder().build(); converter.convert(docxInputStream).as(DocumentType.DOCX).to(outputStream).as(DocumentType.PDF).execute(); outputStream.close(); System.out.println("success"); } catch (Exception e) { e.printStackTrace(); } } }

0

Shakthifuture Feb 12 '19 at 10:37

source share

Thanks for sharing. This works great for me. I recommend you guys go to Documents4j. They did a clean job of converting Docx to PDF. Mine is pretty complicated because it includes graphics, images and some Chinese fonts. For several days, I searched for potential converters until I came across this article about StackOverflow. You guys will really save my life. haha

0

Reach009 Jun 04 '19 at 2:23

source share

VivekRatanSinha · Accepted Answer · 2017-04-12T10:40:17+0000

You are missing some libraries.

I can run your code by adding the following libraries:

  Apache POI 3.15
     org.apache.poi.xwpf.converter.core-1.0.6.jar
     org.apache.poi.xwpf.converter.pdf-1.0.6.jar
     fr.opensagres.xdocreport.itext.extension-2.0.0.jar
     itext-2.1.7.jar
     ooxml-schemas-1.3.jar

I have successfully converted a 6-page Word document (.docx) with tables, images and various formatting.

Convert docx to pdf in java

More articles: