[ https://issues.apache.org/jira/browse/COMPRESS-661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17818863#comment-17818863 ]
Tilman Hausherr edited comment on COMPRESS-661 at 2/20/24 7:38 PM: ------------------------------------------------------------------- {code:java} ArArchiveInputStream ar = new ArArchiveInputStream(new BufferedInputStream(new FileInputStream("....../testARofText.ar"))); System.out.println("ar.markSupported(): " + ar.markSupported()); ArArchiveEntry aentry; while ((aentry = ar.getNextEntry()) != null) { ar.mark(10); ar.read(new byte[10]); ar.reset(); System.out.println("AR: " + new String(ar.readAllBytes())); } {code} This code will fail with 1.25.0 because mark/release is not supported and markSupported() is false: {noformat} ar.markSupported(): false Exception in thread "main" java.io.IOException: mark/reset not supported at java.base/java.io.InputStream.reset(InputStream.java:655) at com.mycompany.maventikaproject.TilmanSevenTest.main(TilmanSevenTest.java:62) {noformat} With 1.26.0 it will bring this, while markSupported() is true: {noformat} ar.markSupported(): true AR: Test d'indexation de Txt http://www.a Exception in thread "main" java.io.IOException: Truncated ar archive at org.apache.commons.compress.archivers.ar.ArArchiveInputStream.getNextArEntry(ArArchiveInputStream.java:281) at org.apache.commons.compress.archivers.ar.ArArchiveInputStream.getNextEntry(ArArchiveInputStream.java:351) at com.mycompany.maventikaproject.TilmanSevenTest.main(TilmanSevenTest.java:58) {noformat} was (Author: tilman): {code:java} ArArchiveInputStream ar = new ArArchiveInputStream(new BufferedInputStream(new FileInputStream("....../testARofText.ar"))); System.out.println("ar.markSupported(): " + ar.markSupported()); ArArchiveEntry aentry; while ((aentry = ar.getNextEntry()) != null) { ar.mark(10); ar.read(new byte[10]); ar.reset(); System.out.println("AR: " + new String(ar.readAllBytes())); } {code} This code will fail with 1.25.0 because mark/release is not supported and markSupported() is false: {code:java} ar.markSupported(): false Exception in thread "main" java.io.IOException: mark/reset not supported at java.base/java.io.InputStream.reset(InputStream.java:655) at com.mycompany.maventikaproject.TilmanSevenTest.main(TilmanSevenTest.java:62) {code} With 1.26.0 it will bring this, while markSupported() is true: ar.markSupported(): true AR: Test d'indexation de Txt http://www.a Exception in thread "main" java.io.IOException: Truncated ar archive at org.apache.commons.compress.archivers.ar.ArArchiveInputStream.getNextArEntry(ArArchiveInputStream.java:281) at org.apache.commons.compress.archivers.ar.ArArchiveInputStream.getNextEntry(ArArchiveInputStream.java:351) at com.mycompany.maventikaproject.TilmanSevenTest.main(TilmanSevenTest.java:58) > commons-compress 1.26.0 breaks Apache Tika 2.9.1 > ------------------------------------------------ > > Key: COMPRESS-661 > URL: https://issues.apache.org/jira/browse/COMPRESS-661 > Project: Commons Compress > Issue Type: Bug > Components: Compressors > Affects Versions: 1.26.0 > Reporter: Alexander Veit > Priority: Critical > Attachments: testARofText.ar > > > Apache Commons Compress 1.26.0 fixes > * https://www.cve.org/CVERecord?id=CVE-2024-25710 and > * https://www.cve.org/CVERecord?id=CVE-2024-26308. > We have tried to replace Apache Commons Compress 1.25.0 with 1.26.0 in our > deployments in order to fix these security vulnerabilities. But unfortunately > now Apache Tika is broken: > {noformat} > org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from > org.apache.tika.parser.iwork.IWorkPackageParser@41fcb910 > at > app//org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:304) > at > app//org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:298) > at > app//org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:203) > at app//org.apache.tika.Tika.parseToString(Tika.java:525) > at app//org.apache.tika.Tika.parseToString(Tika.java:495) > at ... > Caused by: java.io.IOException: Resetting to invalid mark > at > java.base/java.io.BufferedInputStream.reset(BufferedInputStream.java:446) > at > org.apache.tika.parser.iwork.IWorkPackageParser.parse(IWorkPackageParser.java:97) > at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:298) > ... 42 more > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)