https://issues.apache.org/bugzilla/show_bug.cgi?id=52991
RM <[email protected]> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|RESOLVED |REOPENED Resolution|FIXED |--- --- Comment #2 from RM <[email protected]> --- Verified on with the current trunk, revision 1337825, not fixed yet: The source is a ppt, error is exactly the same: xception in thread "main" org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from org.apache.tika.parser.microsoft.OfficeParser@bd928a at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:248) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120) at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:126) at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:395) at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:97) Caused by: org.apache.tika.io.TaggedIOException: Unexpected end of ZLIB input stream at org.apache.tika.io.TaggedInputStream.handleIOException(TaggedInputStream.java:133) at org.apache.tika.io.ProxyInputStream.read(ProxyInputStream.java:103) at org.apache.tika.io.ProxyInputStream.read(ProxyInputStream.java:99) at java.io.BufferedInputStream.fill(BufferedInputStream.java:218) at java.io.BufferedInputStream.read1(BufferedInputStream.java:258) at java.io.BufferedInputStream.read(BufferedInputStream.java:317) at java.io.FilterInputStream.read(FilterInputStream.java:90) at org.apache.tika.io.IOUtils.copyLarge(IOUtils.java:933) at org.apache.tika.io.IOUtils.copy(IOUtils.java:907) at org.apache.tika.io.TikaInputStream.getFile(TikaInputStream.java:536) at org.apache.tika.io.TikaInputStream.getFileChannel(TikaInputStream.java:564) at org.apache.tika.parser.microsoft.POIFSContainerDetector.getTopLevelNames(POIFSContainerDetector.java:335) at org.apache.tika.parser.microsoft.POIFSContainerDetector.detect(POIFSContainerDetector.java:152) at org.apache.tika.detect.CompositeDetector.detect(CompositeDetector.java:61) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:113) at org.apache.tika.parser.DelegatingParser.parse(DelegatingParser.java:72) at org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor.parseEmbedded(ParsingEmbeddedDocumentExtractor.java:102) at org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.handleEmbeddedResource(AbstractPOIFSExtractor.java:68) at org.apache.tika.parser.microsoft.HSLFExtractor.handleSlideEmbeddedResources(HSLFExtractor.java:236) at org.apache.tika.parser.microsoft.HSLFExtractor.parse(HSLFExtractor.java:117) at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:188) at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:160) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242) ... 5 more Caused by: java.io.EOFException: Unexpected end of ZLIB input stream at java.util.zip.InflaterInputStream.fill(InflaterInputStream.java:223) at java.util.zip.InflaterInputStream.read(InflaterInputStream.java:141) at java.io.BufferedInputStream.fill(BufferedInputStream.java:218) at java.io.BufferedInputStream.read1(BufferedInputStream.java:258) at java.io.BufferedInputStream.read(BufferedInputStream.java:317) at org.apache.tika.io.ProxyInputStream.read(ProxyInputStream.java:99) ... 26 more --------- Debian Squeeze with tika from source ( also tried 1.0 and 1.1 ) -- You are receiving this mail because: You are the assignee for the bug.
