Thanks to the folks from Sun for all their hard work in making this
happen! Now we have some work to do!   :-)

The xml-contrib area is designed so that people can look at the code,
try it out, etc.  The license issues have all been worked out, and the
code is now under the Apache 1.1 license, so feel free to look at it, 
play with it, figure it out, etc.

I have a couple of major suggestions (I'm sure that other people have
more):

1) It has been reported that the Crimson code is 50% faster than
Xerces-J when running on a Sparc Ultra-5, however Xerces-J is 40% faster
than Crimson on a Windows NT machine. It's not obvious to me why this
would be true!  We need to figure out WHY, so we can create a single
code base that is fast on BOTH.

2) Crimson has a DOM implementation that is particularly interesting. 
It has been reported that it "scales better" as the size of an XML
document goes up, but that is not my experience (but, I've been looking
only at Windows NT, so this could again be a Sparc/Windows difference).
This could be due to differences in memory consumption, or something
else altogether.  We should be able to figure out what's going on here,
and get the best of both worlds.  Because the Xerces DOM is pluggable,
we might need to end up with two DOM's that are optimized for two
different things:  a) the current deferred DOM is optimized for
performance, but maybe not for memory consumption, and b) perhaps the
Crimson DOM is optimized for memory consumption.  

3) Now that we can see the XHTML code, we should be able to compare
Assaf's HTML parser code, and the new Crimson code, so we can end up
with the best of both.  We routinely get requests for HTML parsing, and
this is a pretty self-contained area, so it's a great opportunity to
jump in and contribute!

All of these things are high on my list -- does anybody want to take a
crack at them? This is a great opportunity for some new people to jump
in, and check out all the code...

Mike

P.S.  Traffic is now moving to the xerces-j list...please adjust your
mailing list subscriptions accordingly!

Rajiv Mordani wrote:
> 
> Announicing the release of the code for Crimson XML Parsing Core Library..
> This code is based on Sun's Java Project X and is available via the cvs
> module xml-contrib/crimson for people to look at... Please read the README
> for directions on how to build the source. The list of features to be
> included into xerces is yet to be decided.
> 
> - Rajiv
> 
> --
> :wq

Reply via email to