https://issues.apache.org/bugzilla/show_bug.cgi?id=50993

--- Comment #4 from David Fisher <[email protected]> 2011-03-29 16:15:10 
EDT ---
@Ryan -

This is a huge file. The 13 MB of the docx expands into a 33MB
word/document.xml

POI turns this into a Java object for each bit of xml in those 33MB and it all
must be in memory in the standard case. This easily is in the 4GB to 8GB range.
We are not going to fix the standard method. We would consider patches that
might help.

As far as the architecture is concerned I would not want to ever load anything
so large into a web server like Apache Tomcat.

By asking what the use case is allows an intelligent discussion about what
techniques are available and the correct place to do that is on the POI User
list and it is not in a bugzilla entry.

That is how POI works.

We have no idea for what purpose you are loading this data into your web
server. Do you intend to find results? Are you analyzing them? So, let's have a
dialog, but on the user list.

If you want to discuss different algorithms that might solve the problem then
please try the POI Developer list.

If you search bugzilla and the lists about OutOfMemory I think you will find my
response consistent. I'm sorry if it was short this time.

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to