Error parsing PDFs

2016-10-17 Thread Vincent
cutor.java:617) at java.lang.Thread.run(Thread.java:745) I tested the document with PDFBox ExtractText, and it works fine. An example of a failing document is: https://gemeente.groningen.nl/system/files/1._jaarstukken_groninger_archieven_br_raad.pdf Any suggestions? Thanks in advanc

Re: Error parsing PDFs

2016-10-17 Thread Vincent
Hi, After some additional trying I found that this error does not occur for this document in Tika 1.11. I forgot to mention in my last message that I was using Tika 1.13. So is this perhaps a bug in the new Tika version? Regards, Vincent On 17-10-16 13:37, Vincent wrote: Hi all, I have