[
https://issues.apache.org/jira/browse/TIKA-734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Anirban Mitra updated TIKA-734:
-------------------------------
Attachment: Sample BIG Excel 2007 File.xls
Hi,
The out of memory issue is resolved now. but we are seeing a huge performance
issue with 10 concurrent users when we tried to parse the attached 10 MB xlsx
file.it takes around 15 mins in average for 10 concurrent users to parse the
document.After profiling the code using JProfiler, we found
AutoDetectParser.Parse() takes most of the time. and many threads are
waiting/blocked.i am using XML beans jar xmlbeans-2.3.0.jar and
xml-apis-1.0.b2.jar. any suggestions will be helpful.
Thanks
Anirban
> Out of memory exception with Xlsx file less than 5 MB
> -----------------------------------------------------
>
> Key: TIKA-734
> URL: https://issues.apache.org/jira/browse/TIKA-734
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 0.7
> Environment: Windows Vista , JUnit test cases running in RAD, JVM
> heap memory - 500MB
> Reporter: Anirban Mitra
> Attachments: Sample BIG Excel 2007 File.xls
>
>
> I am trying to parse and extract a pattern from Xlsx files.i tried using a 5
> MB file and when i run my
> JUnit test cases, it fails and i see heap memory out of size exception.Do we
> have any resolution for the same ?
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira