On Mon, 23 Jan 2012, P. Hill wrote:
I finally got a moment to ask about PDF Portfolio files and the folks over at PDFBox directed me to: http://pdfbox.apache.org/userguide/file_references.html
Thats interesting, just a shame the examples only cover writing! If you're able to get some information on how to read them too, we can certainly have a look.
I pass that along for Tika developers, but it seems there might be some issues about combining all the content in a portfolio not unlike e-mails with attachments or other compound documents
I think we've now largely got that model sorted, so we'd support them in the same way that we currently support emails with attachments, word documents with embedded images etc
I can report my company has seen a least one end user using Portfolio files, but they don't seem very common.
We would ideally want a test document, for both sanity checking and unit testing. Don't suppose you can ask your end user to do us a sample one?
Nick
