Am 15.07.2022 um 15:41 schrieb PGNet Dev:
    Jul 15 08:41:27 mx tika[1143]: INFO  [qtp1837533591-27] 08:41:27,224 org.apache.tika.server.core.resource.TikaResource /tika (application/pdf)     Jul 15 08:41:27 mx tika[1143]: WARN  [qtp1837533591-27] 08:41:27,453 org.apache.pdfbox.pdfparser.COSParser The end of the stream doesn't point to the correct offset, using workaround to read the stream, stream start position: 104315, length: 356, expected end position: 104671     Jul 15 08:41:27 mx tika[1143]: ERROR [qtp1837533591-27] 08:41:27,457 org.apache.pdfbox.filter.FlateFilter FlateFilter: stop reading corrupt stream due to a DataFormatException     Jul 15 08:41:27 mx tika[1143]: WARN  [qtp1837533591-27] 08:41:27,730 org.apache.pdfbox.pdfparser.COSParser The end of the stream doesn't point to the correct offset, using workaround to read the stream, stream start position: 101699, length: 1472, expected end position: 103171     Jul 15 08:41:27 mx tika[1143]: ERROR [qtp1837533591-27] 08:41:27,735 org.apache.pdfbox.filter.FlateFilter FlateFilter: stop reading corrupt stream due to a DataFormatException     Jul 15 08:41:27 mx tika[1143]: WARN  [qtp1837533591-27] 08:41:27,742 org.apache.pdfbox.pdfparser.COSParser The end of the stream doesn't point to the correct offset, using workaround to read the stream, stream start position: 101509, length: 66, expected end position: 101575     Jul 15 08:41:27 mx tika[1143]: ERROR [qtp1837533591-27] 08:41:27,744 org.apache.pdfbox.filter.FlateFilter FlateFilter: stop reading corrupt stream due to a DataFormatException     Jul 15 08:41:27 mx tika[1143]: WARN  [qtp1837533591-27] 08:41:27,748 org.apache.pdfbox.pdfparser.COSParser The end of the stream doesn't point to the correct offset, using workaround to read the stream, stream start position: 2011, length: 2482, expected end position: 4493


    Jul 15 08:41:27 mx tika[1143]: Caused by: java.io.IOException: Page tree root must be a dictionary

likely invalid PDFs. Please upload them somewhere for inspection

Tilman

Reply via email to