Re: std.xml2 (collecting features)

Jonathan M Davis via Digitalmars-d Mon, 04 May 2015 12:16:15 -0700

On Sunday, 3 May 2015 at 17:39:48 UTC, Robert burner Schadekwrote:

std.xml has been considered not up to specs nearly 3 years now.Time to build a successor. I currently plan the followingfeatues for it:
- SAX and DOM parser
- in-situ / slicing parsing when possible (forward range?)
- compile time switch (CTS) for lazy attribute parsing
- CTS for encoding (ubyte(ASCII), char(utf8), ... )
- CTS for input validating
- performance
Not much code yet, I'm currently building the performance testsuite https://github.com/burner/std.xml2
Please post you feature requests, and please keep the posts DRYand on topic.


If I were doing it, I'd do three types of parsers:

1. A parser that was pretty much as low level as you can get,where you basically a range of XML atributes or tags. Exactly howto build that could be a bit entertaining, since it would have tobe hierarchical, and ranges aren't, but something like a range oftags where you can get a range of its attributes and sub-tagsfrom it so that the whole document can be processed withoutactually getting to the level of even a SAX parser. That parsercould then be used to build the other parsers, and anyone whoneeded insanely fast speeds could use it rather than the SAX orDOM parser so long as they were willing to pay the inevitableloss in user-friendliness.


2. SAX parser built on the low level parser.

3. DOM parser built either on the low level parser or the SAXparser (whichever made more sense).

I doubt that I'm really explaining the low level parser wellenough or have even though through it enough, but I really thinkthat even a SAX parser is too high level for the base parser andthat something that slightly higher than a lexer (high enough toactually be processing XML rather than individual tokens butpretty much only as high as is required to do that) would be afar better choice.

IIRC, Michel Fortin's work went in that direction, and he linkedto his code in another post, so I'd suggest at least looking atthat for ideas.

Regardless, by building layers of XML parsers rather than justthe standard ones, it should be possible to get higherperformance while still having the more standard, user-friendlyones for those that don't need the full performance and do needthe user-friendliness (though of course, we do want the SAX andDOM parsers to be efficient as well).


- Jonathan M Davis

Re: std.xml2 (collecting features)

Reply via email to