Hi,

AFAIK we don't have such a capability as what you are looking for is mixed into 
the general parsing. What I would like to
understand is the reason for looking for such objects. 

# Objects that aren't in the tree -> from a PDF perspective are just ignored 
and not part of the parsed PDF
# Objects with generation > 1 -> is a capability in the spec but rarely used. 
Typically if an object changes is will get a new
number instead of incrementing the generation number
# Some per object stats ... -> what are you looking for?  
# Examples of multiple XMP files -> not so commonly used. Although the spec 
allows for that in a typical PDF there there is
mostly 1 XMP file which cotains the document properties.

OTOH I'd love to see a SAX type parser -> a while ago I started with a low 
level lexer/tokenizer in a similar way but that never
made it into PDFBox.

BR
Maruan
 
> All,
>   Apologies for not digging into our codebase more before asking this...
> If I wanted a low level SAX-like parser where an event is a COSObject,
> where would I start?
>   Should I start with the new on-demand parser in master/trunk or should I
> go back to 1.8.x?
> 
> I'm interested in finding:
> 
> * Objects that aren't in the tree
> * Objects with generation > 1
> * Some per object stats...
> * Examples of multiple XMP files
> 
> Thank you!
> 
>        Cheers,
> 
>                    Tim
-- 
Maruan Sahyoun

FileAffairs GmbH
Josef-Schappe-Straße 21
40882 Ratingen

Tel: +49 (2102) 89497 88
Fax: +49 (2102) 89497 91
sahy...@fileaffairs.de
www.fileaffairs.de

Geschäftsführer: Maruan Sahyoun
Handelsregister: AG Düsseldorf, HRB 53837
UST.-ID: DE248275827


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: users-h...@pdfbox.apache.org

Reply via email to