Hi,
I've been running out of memory reusing the same XMLReader
(xercesImpl-2.8.0) to parse many large documents.
The documents reference the same DTD which references many entities.
Profiling (with netbeans-5.0) reveals that the problem is with char[]s
allocated by:
org.apache.xerces.util.SymbolTable.$Entry.<init>
org.apache.xerces.util.SymbolTable.addSymbol()
org.apache.xerces.impl.XMLEntityScanner.scanName()
org.apache.xerces.impl.XMLDTDscannerImpl.scanEntityDecl()
...
Maybe its storing the symbol table for the same DTD for each new
document and never discarding it?
Should it recognize a previously parsed DTD and reuse the existing
symbol table?
I've worked around it by using a new XMLReader for each document.
Can I get DTDs and entities cached to improve performance?
I'm using org.apache.xerces.util.XMLCatalogResolver.
Cheers,
Neil.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]