I am using
dom4j-1.4-dev-8.jar, the version that came with my last maven build of
jelly.
My xml
document:
159
mbs
2,438,791
lines/tags -> 1 tag per line, all attributes
~6 attributes per
tag
4 out of 6
attributes are numeric values, so they are not huge strings.
Attributes 5 and 6
could probably be interned as well, but this would require additional api
support.
This document
expands to 1100mb's in memory. Could this be right? Seems high to me. I assume
all element names and attribute names are interned. I tried to force interning
by doing this:
SAXReader reader = new
SAXReader();
reader.setFeature("http://xml.org/sax/features/string-interning", true);
reader.setFeature("http://xml.org/sax/features/string-interning", true);
Which I think is
the default anyway. I am using xerces-2.0.2.jar for SAXReader via the system
property.
Are things being
interned? Are there any other tricks to reducing memory
consumption?
-jason
horman
This email message and any attachments are for the sole use of the intended recipient(s) and may contain confidential and privileged information. Any unauthorized review, use, disclosure or distribution is prohibited. If you are not the intended recipient or his/her representative, please contact the sender by reply email and destroy all copies of the original message.