I am using Apache PDFBox to read a PDF document that has a bookmark-defined hierarchy. The hierarchy is in the form of a tree with content only at the sheet level.
Extract text between two sheet-level tabs using the following code:
Stripper.setStartBookmark(), Stripper.setEndBookmark(), Stripper.writeText()),
Instead, returns text throughout the page. In short, my problem is similar to the problem mentioned in this thread .
Is there a way to extract content between two bookmarks?
If so, what should be the change in my code?
source share