Joe, Joseph Kesselman/CAM/Lotus wrote: > HOWEVER -- when I turn on the schema validator, parser performance falls > through the floor -- even though none of the test documents references a > schema, and only two of them reference a DTD. The parse() operation takes > almost twice as long to complete.
This is a single parse(), correct? I mean you did not use any warm-up..? The performance falls for a single parse since the cost of adding XML Schema Validator is included in the parse() method. To get correct performance measure you should exclude the first parse from your performance tests. If you do so, the time difference in parsing with "-v" or "-v -s" options is insignificant. I am not sure how and if we can improve initialization time for XMLSchemaValidator, but since initialization happens just one, for the parsers used in run-more-than-once scenario the performance difference should be minor. As Henry noticed if XML Schema is included in the pipeline we do some work in startElement(). I am not sure how we can change it unless we can come up with a new feature stating something like: "if DOCTYPE is found validate only against DTD" ... (I am not sure we want to do it). Currently we try to validate against both: DTDs and XML Schemas. That is why we do check if XML Schema is found on some element. -- Elena Litani / IBM Toronto --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
