I don't believe "unresolved" is an option. You can get the parser to tell you what the boundaries of entities are -- SAX has begin/endEntity events, and the DOM has Entity nodes -- but if you don't want the contents, it's your responsibility to not look inside them.Xerces doesn't always generate Entity nodes; see http://xml.apache.org/xerces2-j/features.htm for the create-entity-ref-nodes option which controls that. (But the default is to produce them.)
For effiency reasons, built-in character references such as &, and numeric character references such as are almost always expanded into the character they represent rather than being left as an entity reference. If you really can't live with that,; there are parser features options which will cause the parser to leave them in entity form. Again, this does not keep them from being expanded, it only tells you where the boundaries were. ______________________________________ Joe Kesselman / IBM Research --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
