Hi,

Yes, this is probably possible but this is several steps and there is no direct example.

You should first try text extraction (use the ExtractText command line application) to see whether the text appears, and would be easy to find / parse.

PDFBox also allows to split by page (try the PDFSplit command line application).

All this is also available programmatically, for text extraction see the ExtractTextSimple.java example in the example subdirectory of the source code download.

For splitting / joining use the PDPage elements directly and "addPage()" them to the destination file. Be careful not to close the source file before saving the destination file.

Tilman

Am 03.07.2020 um 01:23 schrieb A H:
Hello,
Before I explain my question, I want to apologize if I make mistakes when
writing, I speak little English.

I have 2 PDF files and each one contains a specific type of document. Each
file has hundreds of pages.

In file 1 each page is identified by a number. This number is unique for
each page. (Order number)
File 2 contains checklists. Each checklist can have 1 or more pages. Each
checklist relates to a purchase order in file 1.

What I need to achieve is to combine the two files or file. Each order form
must be followed by the corresponding checklist.

Is it possible to do this using PDFBox ?. If so, where can I get an example
of how to do it?
Beforehand thank you very much.



---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: users-h...@pdfbox.apache.org

Reply via email to