Hi Shaun,
thank you for the research. I’ll give it a try because we have issues with some
PDFs, too.
Kind regards,
Paul Münch
> Am 28.04.2020 um 19:12 schrieb Shaun donovan :
>
> Hi All.
>
> Answered my own question. Disabled pdftoolkit and re-enabled XPDF. Works with
> files that are set
Hi All.
Answered my own question. Disabled pdftoolkit and re-enabled XPDF. Works
with files that are set to prevent copying and seems to do a better job
with the text extraction.
Kind Regards.
Shaun.
On 2020/04/24 13:39, Shaun donovan wrote:
Hi all.
I am receiving the following error
Hi all.
I am receiving the following error when trying to run filter-media on
certain pdf files:
java.io.IOException
at
org.apache.pdfbox.filter.FlateFilter.decode(FlateFilter.java:108)
at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:379)
at