In working on the PDFParser, I was looking at these lines:
if (tstream != null && tstream.hasFile()) {
// File based, take that as a cue to use a temporary file
pdfDocument = PDDocument.load(new CloseShieldInputStream(stream), password,
true);//, true);
} else {
...If there's already a f
>From my perspective, it would be great to have a general xmp parser that also
>allows for some variance from spec (PDFBOX-2855). We've been using jempbox
>for pdfs as well as images over on Tika, and it has worked well for us.
I'd prefer to continue using your xmp parser, but I understand if
>> Perhaps getting a “big” user, such as Tika onboard will give us proof that
>> 2.0 is ready for production? Some APIs are still in flux, e.g. font mapping
>> has undergone recent changes to handle CJK better.
Consider us onboard. Our dev version of Tika with 2.0.0-trunk is here:
https://gith
[
https://issues.apache.org/jira/browse/TIKA-1676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1676.
---
Resolution: Fixed
r1690090
> Fix logic error in batch driver that prevents correct restarting of child
Tim Allison created TIKA-1676:
-
Summary: Fix logic error in batch driver that prevents correct
restarting of child process
Key: TIKA-1676
URL: https://issues.apache.org/jira/browse/TIKA-1676
Project: Tika