> It will involve a lot of COS processing. I haven’t decided yet if it will sit > on top of COS or PD. Typically we do encourage people to use PD so I tend to > start from there and dig down internally as needed. WDYT?
Starting with PD and using COS where needed sounds reasonable. Ultimately you don’t need a high-level API to do the manipulations which you’re interested in, so COS should suffice, but PD might be quicker to get started with. -- John On 29 May 2014, at 23:25, Maruan Sahyoun <sahy...@fileaffairs.de> wrote: > > Am 29.05.2014 um 18:51 schrieb John Hewson <j...@jahewson.com>: > >>> # splitting files (e.g. remove no longer needed resources) >> >> Each page has its own Resources dictionary, so it shouldn't be too >> difficult. One thing to watch out for is is the "page tree" which allows >> pages to inherit resources from each other, this is handled as PDPageNode >> but it's kind of messy. > > thanks for the hint. Splitting and merging is somewhat similar as splitting > is typically done by creating a new document and importing the needed pages > into the newly created document. Using the current code this might lead to > duplicate resources. > >> >>> # merging files (e.g. avoid duplicating resources) >> >> Sounds like the files are pretty similar, is this actually an overlay? Or >> are you wanting to insert entire pages? > > it’s merging individual files together inserting entire pages. Although the > files are created individually they share some common elements like company > logos or fonts. > >> >> I imagine you probably want to implement both these features at the COS >> level rather than the PD level, as it's pretty low-level processing. >> > > It will involve a lot of COS processing. I haven’t decided yet if it will sit > on top of COS or PD. Typically we do encourage people to use PD so I tend to > start from there and dig down internally as needed. WDYT? > > >> -- John >> >>> On 29 May 2014, at 00:39, Maruan Sahyoun <sahy...@fileaffairs.de> wrote: >>> >>> Hi, >>> >>> for a current project I need to work on enhancing PDFBox for >>> >>> # splitting files (e.g. remove no longer needed resources) >>> # merging files (e.g. avoid duplicating resources) >>> # page handling (adding/removing individual pages with resource handling) >>> # enhancements to forms handling (pre fill XFA forms - partially done, >>> enhancing AP generation) >>> >>> Is someone else working on something similar? >>> >>> BR >>> >>> Maruan >