Joe, 

Joseph Kesselman/CAM/Lotus wrote:
> HOWEVER -- when I turn on the schema validator, parser performance falls
> through the floor -- even though none of the test documents references a
> schema, and only two of them reference a DTD. The parse() operation takes
> almost twice as long to complete.

This is a single parse(), correct? I mean you did not use any warm-up..?
The performance falls for a single parse since the cost of adding XML
Schema Validator is included in the parse() method. To get correct
performance measure you should exclude the first parse from your
performance tests. If you do so, the time difference in parsing with
"-v" or "-v -s" options is insignificant.

I am not sure how and if we can improve initialization time for
XMLSchemaValidator, but since initialization happens just one, for the
parsers used in run-more-than-once scenario the performance difference
should be minor.

As Henry noticed if XML Schema is included in the pipeline we do some
work in startElement(). I am not sure how we can change it unless we can
come up with a new feature stating something like: "if DOCTYPE is found
validate only against DTD" ... (I am not sure we want to do it).

Currently we try to validate against both: DTDs and XML Schemas. That is
why we do check if XML Schema is found on some element.
 
-- 
Elena Litani / IBM Toronto

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to