using scratch files in PDFParser

2015-07-09 Thread Allison, Timothy B.
In working on the PDFParser, I was looking at these lines: if (tstream != null && tstream.hasFile()) { // File based, take that as a cue to use a temporary file pdfDocument = PDDocument.load(new CloseShieldInputStream(stream), password, true);//, true); } else { ...If there's already a f

RE: DomXmpParser: namespace not found

2015-07-09 Thread Allison, Timothy B.
>From my perspective, it would be great to have a general xmp parser that also >allows for some variance from spec (PDFBOX-2855). We've been using jempbox >for pdfs as well as images over on Tika, and it has worked well for us. I'd prefer to continue using your xmp parser, but I understand if

RE: PDFBox 2.0.0 release

2015-07-09 Thread Allison, Timothy B.
>> Perhaps getting a “big” user, such as Tika onboard will give us proof that >> 2.0 is ready for production? Some APIs are still in flux, e.g. font mapping >> has undergone recent changes to handle CJK better. Consider us onboard. Our dev version of Tika with 2.0.0-trunk is here: https://gith

[jira] [Resolved] (TIKA-1676) Fix logic error in batch driver that prevents correct restarting of child process

2015-07-09 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-1676. --- Resolution: Fixed r1690090 > Fix logic error in batch driver that prevents correct restarting of child

[jira] [Created] (TIKA-1676) Fix logic error in batch driver that prevents correct restarting of child process

2015-07-09 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1676: - Summary: Fix logic error in batch driver that prevents correct restarting of child process Key: TIKA-1676 URL: https://issues.apache.org/jira/browse/TIKA-1676 Project: Tika