Re: extracting embedded documents -- will getEmbeddedFile() alone miss embedded DOS/Unix/Mac files?

2014-07-24 Thread Andreas Lehmkühler
Hi, Allison, Timothy B. talli...@mitre.org hat am 23. Juli 2014 um 20:21 geschrieben: All,    Over on Tika, it looks like we copied org.apache.pdfbox.examples.pdmodel.ExtractEmbeddedFiles to extract embedded files.  As I look at the source code for PDComplexFileSpecification, I notice that

extracting embedded documents -- will getEmbeddedFile() alone miss embedded DOS/Unix/Mac files?

2014-07-23 Thread Allison, Timothy B.
All, Over on Tika, it looks like we copied org.apache.pdfbox.examples.pdmodel.ExtractEmbeddedFiles to extract embedded files. As I look at the source code for PDComplexFileSpecification, I notice that getEmbeddedFile() does not behave like getFilename(); that is, it doesn't iterate through