Hi Rubesh

On 04.11.2011 06:36:41 Rubesh.Xavier wrote:
> Hi,
> I want a clarification. I have a requirement to merge the pdf documents, I 
> managed to do that via PDFBox, but the question is can we merge the pdf 
> files based on the bookmark or at the beginning of the file?

I don't know what you mean by "based on the bookmark". Would you please
explain in different words? If you want to insert your PDF at the
beginning, rather than at the end, then you simply have to switch the
order in which you merge your PDFs.

> Also I know we can split the documents, but can this be done again based 
> on bookmarks or taking into consideration some field value?

Of course, you can traverse the bookmark tree (called document outline
in PDF) and find the destination page.

- From the PDFDocument, get to the catalog using getDocumentCatalog()
- From there get to the document outline using getDocumentOutline()
- From there you can navigate through the tree by
getFirstChild()/getLastChild() and
PDOutlineItem.getNext/PreviousSibling()
- On PDOutlineItem, you can call findDestinationPage(PDDocument) to find
the page you're looking for.
- Then split according to the extracted page indices.

(The PrintBookmarks example might help you here)

If by "field value" you mean fields from an AcroForm, I guess that can
be done, too, but I imagine this could be a bit more complicated.

> Also add bookmarks at relative position?

You simply have to keep track of the pages you add and update the
document outline accordingly. Take a look at the sources or javadoc of
the package org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline
and at the CreateBookmarks example.

> I have seen various issues in Jira and have searched the website, samples 
> etc., but could not find anything specific for this tasks, your 
> suggestions on this will be of great help.
> We have some requirements based on this and need to decide on this at the 
> earliest. Thanks.
> 
> 
> Regards, 
> Rubesh 
> ----- Forwarded by Rubesh M Xavier/AABS/GSS/ErnstYoung/IN on 11/04/2011 
> 11:01 AM -----
> 
> From:
> "Jeremias Maerki (Resolved) (JIRA)" <[email protected]>
> To:
> [email protected]
> Date:
> 11/03/2011 07:36 PM
> Subject:
> [jira] [Resolved] (PDFBOX-1158) Functionality in PDFBOX
> 
> 
> 
> 
>      [ 
> https://issues.apache.org/jira/browse/PDFBOX-1158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
>  
> ]
> 
> Jeremias Maerki resolved PDFBOX-1158.
> -------------------------------------
> 
>     Resolution: Invalid
> 
> Please don't use JIRA to ask questions. Questions should be sent to 
> [email protected].
> 
> 1a. Apache PDFBox cannot directly create PostScript from PDF. But it 
> supports painting PDF pages to Graphics2D objects. That means you can use 
> the print function to print a PDF to a PostScript printer driver (set to 
> output to a file if needed). Or you can use PSDocumentGraphics2D from 
> Apache XML Graphics Commons to create PostScript files but the former is 
> easier. http://pdfbox.apache.org/commandlineutilities/PrintPDF.html can 
> serve as a starting point.
> 
> 1b. Apache PDFBox cannot create PDF from PostScript because it lacks a 
> complete PostScript interpreter to start with. You may need to look at 
> GhostScript for that functionality (available under the GPL or a 
> commercial license). GhostScript can do PDF->PS, too.
> 
> 2. See http://pdfbox.apache.org/commandlineutilities/PDFMerger.html
> 
> Any follow-up questions to [email protected], please.
>  
> > Functionality in PDFBOX
> > -----------------------
> >
> >                 Key: PDFBOX-1158
> >                 URL: https://issues.apache.org/jira/browse/PDFBOX-1158
> >             Project: PDFBox
> >          Issue Type: Test
> >            Reporter: Rubesh MX
> >              Labels: Feature
> >
> > HI, I want to know if the following features are possible with PDFBox; 
> 1. Convert PDF to postscript and vice versa 2. Append docuemnt at the 
> beginning of the file and append doc as per the bookmark; could you please 
> confirm on this. I could not find any details in the website/samples for 
> this.
> 
> --
> This message is automatically generated by JIRA.
> If you think it was sent incorrectly, please contact your JIRA 
> administrators: 
> https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
> For more information on JIRA, see: http://www.atlassian.com/software/jira
> 
>  
> 
> 
> 
> 
> The information contained in this communication is intended solely for the 
> use of the individual or entity to whom it is addressed and others authorized 
> to receive it.   It may contain confidential or legally privileged 
> information.   If you are not the intended recipient you are hereby notified 
> that any disclosure, copying, distribution or taking any action in reliance 
> on the contents of this information is strictly prohibited and may be 
> unlawful. If you have received this communication in error, please notify us 
> immediately by responding to this email and then delete it from your system. 
> The firm is neither liable for the proper and complete transmission of the 
> information contained in this communication nor for any delay in its receipt.




Jeremias Maerki

Reply via email to