Re: std.jgrandson

Sean Kelly via Digitalmars-d Sun, 03 Aug 2014 13:46:27 -0700

On Sunday, 3 August 2014 at 17:40:48 UTC, Andrei Alexandrescuwrote:

On 8/3/14, 10:19 AM, Sean Kelly wrote:
I don't want to pay for anything I don't use. No allocationsshouldoccur within the parser and it should simply slice up theinput.
What to do about arrays and objects, which would naturallyallocate arrays and associative arrays respectively? What aboutstrings with backslash-encoded characters?

This is tricky with a range. With an event-based parser I'd haveevents for object and array begin / end, but with a range you endup having an element that's a token, which is pretty weird. Forencoded characters (and you need to make sure you handlesurrogate pairs in your decoder) I'd still provide some means ofdecoding on demand. If nothing else, decode lazily when the userasks for the string value. That way the user isn't paying todecode strings he isn't interested in.

No allocation works for tokenization, but parsing is a wholedifferent matter.
So the
lowest layer should allow me to iterate across symbols in someway.
Yah, that would be the tokenizer.

But that will halt on comma and colon and such, correct? That'sa tad lower than I'd want, though I guess it would be easy enoughto build a parser on top of it.

When I've done this in the past it was SAX-style (ie. acallback per
type) but with the range interface that shouldn't be necessary.
The parser shouldn't decode or convert anything unless I askit to.Most of the time I only care about specific values, and payingfor
conversions on everything is wasted process time.
That's tricky. Once you scan for 2 specific characters you mayas well scan for a couple more, the added cost is negligible.In contrast, scanning once for finding termination and thenagain for decoding purposes will definitely be a lot moreexpensive.

I think I'm getting a bit confused. For the JSON parser I wrote,the parser performs full validation but leaves the content as-is,then provides a routine to decode values from their stringrepresentation if the user wishes to. I'm not sure where scanningfigures in here.

Andrei

Re: std.jgrandson

Reply via email to