Am 29.05.2014 um 18:51 schrieb John Hewson <j...@jahewson.com>:

>> # splitting files (e.g. remove no longer needed resources)
> 
> Each page has its own Resources dictionary, so it shouldn't be too difficult. 
> One thing to watch out for is is the "page tree" which allows pages to 
> inherit resources from each other, this is handled as PDPageNode but it's 
> kind of messy.

thanks for the hint. Splitting and merging is somewhat similar as splitting is 
typically done by creating a new document and importing the needed pages into 
the newly created document. Using the current code this might lead to duplicate 
resources. 

> 
>> # merging files (e.g. avoid duplicating resources)
> 
> Sounds like the files are pretty similar, is this actually an overlay? Or are 
> you wanting to insert entire pages?

it’s merging individual files together inserting entire pages. Although the 
files are created individually they share some common elements like company 
logos or fonts. 

> 
> I imagine you probably want to implement both these features at the COS level 
> rather than the PD level, as it's pretty low-level processing.
> 

It will involve a lot of COS processing. I haven’t decided yet if it will sit 
on top of COS or PD. Typically we do encourage people to use PD so I tend to 
start from there and dig down internally as needed. WDYT?


> -- John
> 
>> On 29 May 2014, at 00:39, Maruan Sahyoun <sahy...@fileaffairs.de> wrote:
>> 
>> Hi,
>> 
>> for a current project I need to work on enhancing PDFBox for
>> 
>> # splitting files (e.g. remove no longer needed resources)
>> # merging files (e.g. avoid duplicating resources)
>> # page handling (adding/removing individual pages with resource handling)
>> # enhancements to forms handling (pre fill XFA forms - partially done, 
>> enhancing AP generation)
>> 
>> Is someone else working on something similar?
>> 
>> BR
>> 
>> Maruan

Reply via email to