[
https://issues.apache.org/jira/browse/TIKA-521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12917819#action_12917819
]
Sjoerd Smeets commented on TIKA-521:
------------------------------------
I'm facing the same issue. Increasing the heapssize to the maximum will cover
for a certain amount of xlsx files, but there are still a lot of files causing
an OutOfMemoryError (> 10 Mb XLS files). The XSSFEventBasedExcelExtractor
indeed processes these files as we would like to. What would be the draw back
of using XSSFEventBasedExcelExtractor?
> OutOfMemoryError Parsing XSLX File
> ----------------------------------
>
> Key: TIKA-521
> URL: https://issues.apache.org/jira/browse/TIKA-521
> Project: Tika
> Issue Type: Bug
> Affects Versions: 0.7, 0.8
> Reporter: Stephen Duncan Jr
> Attachments: memory-test.xlsx
>
>
> I have several XSLX files I'm trying to parse with Tika that are failing with
> an OutOfMemoryError even when using a large heap size. For instance the
> attached 1.26MB excel file fails using a 512MB heap.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.