https://bz.apache.org/bugzilla/show_bug.cgi?id=61665

Javen O'Neal <[email protected]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Severity|critical                    |enhancement

--- Comment #2 from Javen O'Neal <[email protected]> ---
XML as a serialization and deserialization format will always be slower than an
optimized binary format. HApache POI's internal model for an xlsx file
maintains XML beans, updating them as needed, writing out the XML beans as is.
The benefit of this strategy is that features that POI doesn't understand or
implement are kept, unmodified. Had we converted the information in the XML
beans to pojos and discarded the XML beans immediately after reading the
workbook, it's likely information would have been lost.

We are investigating replacing XMLBeans with a different XML library
(constrained by ASL 2.0 license compatibility) that may be more performant and
memory efficient, and this may provide some improvements in speed. This is an
extremely large task that requires modifying nearly every XSSF class and OOXML
class. Any help would be greatly appreciated.

On a smaller scale, if after profiling the code you find a section that can be
improved, please submit your profiling results and a patch that doesn't break
backwards compatibility.

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to