https://bz.apache.org/bugzilla/show_bug.cgi?id=66197
earl <[email protected]> changed: What |Removed |Added ---------------------------------------------------------------------------- Summary|OutOfMemoryError occurs |OutOfMemoryError occurs |while parsing doc file |while parsing doc file |using tika-app which |using tika-app which |contains poi of the above |contains poi of the above |version |version. When tried to use | |command line Caused by: | |org.apache.poi.util.RecordF | |ormatException: Tried to | |allocate an array of length | |102,853,589 OS| |All --- Comment #1 from earl <[email protected]> --- Increasing this value IOUtils.setByteArrayMaxOverride() would not help since we cannot assume the max size in the customer end. full stacktrace: Exception in thread "main" org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@662706a7 at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:312) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:298) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:180) at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:1086) at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:510) at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:259) Caused by: org.apache.poi.util.RecordFormatException: Tried to allocate an array of length 102,853,589, but the maximum length for this record type is 100,000,000. If the file is not corrupt or large, please open an issue on bugzilla to request increasing the maximum allowable size for this record type. As a temporary workaround, consider setting a higher override value with IOUtils.setByteArrayMaxOverride() at org.apache.poi.util.IOUtils.throwRFE(IOUtils.java:599) at org.apache.poi.util.IOUtils.checkLength(IOUtils.java:276) at org.apache.poi.util.IOUtils.safelyAllocateCheck(IOUtils.java:561) at org.apache.poi.util.IOUtils.safelyClone(IOUtils.java:575) at org.apache.poi.hwpf.model.TextPieceTable.<init>(TextPieceTable.java:118) at org.apache.poi.hwpf.model.ComplexFileTable.newTextPieceTable(ComplexFileTable.java:111) at org.apache.poi.hwpf.model.ComplexFileTable.<init>(ComplexFileTable.java:72) at org.apache.poi.hwpf.model.ComplexFileTable.<init>(ComplexFileTable.java:77) at org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:283) at org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.java:152) at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:218) at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:175) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:298) -- You are receiving this mail because: You are the assignee for the bug. --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
