Am 29.05.2014 um 18:51 schrieb John Hewson <j...@jahewson.com>: >> # splitting files (e.g. remove no longer needed resources) > > Each page has its own Resources dictionary, so it shouldn't be too difficult. > One thing to watch out for is is the "page tree" which allows pages to > inherit resources from each other, this is handled as PDPageNode but it's > kind of messy.
thanks for the hint. Splitting and merging is somewhat similar as splitting is typically done by creating a new document and importing the needed pages into the newly created document. Using the current code this might lead to duplicate resources. > >> # merging files (e.g. avoid duplicating resources) > > Sounds like the files are pretty similar, is this actually an overlay? Or are > you wanting to insert entire pages? it’s merging individual files together inserting entire pages. Although the files are created individually they share some common elements like company logos or fonts. > > I imagine you probably want to implement both these features at the COS level > rather than the PD level, as it's pretty low-level processing. > It will involve a lot of COS processing. I haven’t decided yet if it will sit on top of COS or PD. Typically we do encourage people to use PD so I tend to start from there and dig down internally as needed. WDYT? > -- John > >> On 29 May 2014, at 00:39, Maruan Sahyoun <sahy...@fileaffairs.de> wrote: >> >> Hi, >> >> for a current project I need to work on enhancing PDFBox for >> >> # splitting files (e.g. remove no longer needed resources) >> # merging files (e.g. avoid duplicating resources) >> # page handling (adding/removing individual pages with resource handling) >> # enhancements to forms handling (pre fill XFA forms - partially done, >> enhancing AP generation) >> >> Is someone else working on something similar? >> >> BR >> >> Maruan