On Fri, 13 Jan 2012, P. Hill wrote:
Anyone know about the (future?) ability of Tika to parse PDF Portfolio Files? http://help.adobe.com/en_US/Acrobat/9.0/Standard/WSA2872EA8-9756-4a8c-9F20-8E93D59D91CE.html
My hunch is that this'll need some PDFBox support too, to let us at the original files, and to let us know what parts are a portfolio.
As a first step, I'd suggest you ask on the PDFBox list about their support for Portfolio files
Nick
