Hi Folks, Let's mark this RC#2 as failed and shift the vote to the updated RC#3 ( http://markmail.org/message/m5gpgmr7hedgpjdj), which has Tesseract metadata fixes and David's test fix.
Thanks, Tyler On Thu, Jan 8, 2015 at 6:25 AM, Peter Bowyer <[email protected]> wrote: > +1. > > Worked great once I manually > edited > tika-parsers/src/main/resources/org/apache/tika/parser/pdf/PDFParser.properties > and set useNonSequentialParser to true > > Peter >
