Am 15.07.2022 um 15:41 schrieb PGNet Dev:
Jul 15 08:41:27 mx tika[1143]: INFO [qtp1837533591-27] 08:41:27,224 org.apache.tika.server.core.resource.TikaResource /tika (application/pdf) Jul 15 08:41:27 mx tika[1143]: WARN [qtp1837533591-27] 08:41:27,453 org.apache.pdfbox.pdfparser.COSParser The end of the stream doesn't point to the correct offset, using workaround to read the stream, stream start position: 104315, length: 356, expected end position: 104671 Jul 15 08:41:27 mx tika[1143]: ERROR [qtp1837533591-27] 08:41:27,457 org.apache.pdfbox.filter.FlateFilter FlateFilter: stop reading corrupt stream due to a DataFormatException Jul 15 08:41:27 mx tika[1143]: WARN [qtp1837533591-27] 08:41:27,730 org.apache.pdfbox.pdfparser.COSParser The end of the stream doesn't point to the correct offset, using workaround to read the stream, stream start position: 101699, length: 1472, expected end position: 103171 Jul 15 08:41:27 mx tika[1143]: ERROR [qtp1837533591-27] 08:41:27,735 org.apache.pdfbox.filter.FlateFilter FlateFilter: stop reading corrupt stream due to a DataFormatException Jul 15 08:41:27 mx tika[1143]: WARN [qtp1837533591-27] 08:41:27,742 org.apache.pdfbox.pdfparser.COSParser The end of the stream doesn't point to the correct offset, using workaround to read the stream, stream start position: 101509, length: 66, expected end position: 101575 Jul 15 08:41:27 mx tika[1143]: ERROR [qtp1837533591-27] 08:41:27,744 org.apache.pdfbox.filter.FlateFilter FlateFilter: stop reading corrupt stream due to a DataFormatException Jul 15 08:41:27 mx tika[1143]: WARN [qtp1837533591-27] 08:41:27,748 org.apache.pdfbox.pdfparser.COSParser The end of the stream doesn't point to the correct offset, using workaround to read the stream, stream start position: 2011, length: 2482, expected end position: 4493
Jul 15 08:41:27 mx tika[1143]: Caused by: java.io.IOException: Page tree root must be a dictionary
likely invalid PDFs. Please upload them somewhere for inspection Tilman
