Hi, AFAIK we don't have such a capability as what you are looking for is mixed into the general parsing. What I would like to understand is the reason for looking for such objects.
# Objects that aren't in the tree -> from a PDF perspective are just ignored and not part of the parsed PDF # Objects with generation > 1 -> is a capability in the spec but rarely used. Typically if an object changes is will get a new number instead of incrementing the generation number # Some per object stats ... -> what are you looking for? # Examples of multiple XMP files -> not so commonly used. Although the spec allows for that in a typical PDF there there is mostly 1 XMP file which cotains the document properties. OTOH I'd love to see a SAX type parser -> a while ago I started with a low level lexer/tokenizer in a similar way but that never made it into PDFBox. BR Maruan > All, > Apologies for not digging into our codebase more before asking this... > If I wanted a low level SAX-like parser where an event is a COSObject, > where would I start? > Should I start with the new on-demand parser in master/trunk or should I > go back to 1.8.x? > > I'm interested in finding: > > * Objects that aren't in the tree > * Objects with generation > 1 > * Some per object stats... > * Examples of multiple XMP files > > Thank you! > > Cheers, > > Tim -- Maruan Sahyoun FileAffairs GmbH Josef-Schappe-Straße 21 40882 Ratingen Tel: +49 (2102) 89497 88 Fax: +49 (2102) 89497 91 sahy...@fileaffairs.de www.fileaffairs.de Geschäftsführer: Maruan Sahyoun Handelsregister: AG Düsseldorf, HRB 53837 UST.-ID: DE248275827 --------------------------------------------------------------------- To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org For additional commands, e-mail: users-h...@pdfbox.apache.org