Re: std.d.lexer: pre-voting review / discussion

deadalnix Wed, 11 Sep 2013 22:07:25 -0700

On Thursday, 12 September 2013 at 03:37:55 UTC, H. S. Teoh wrote:

I still don't understand why backtracking is necessary in thefirstplace. I would've thought a modern parser should be well ableto encodeseen tokens in its state so that backtracking is nevernecessary. Ordoes D grammar have tricky bits that cannot be handled thisway, that
I'm unaware of?

The problem is that it can cause a exponential (and I literallymean exponential here) amount of complexity.

In some cases, the complexity is manageable, but in other thatdon't make any sense (it has to be noted that even full lexingdon't make any sens here). For instance :


int foo()() {}
       ^

When you are at the caret position, you don't know if you face afunction declaration or a template declaration. You could go forsome ambiguous parsing, but each template argument can itself bea type, an expression or a symbol.

In such situation, it is much simpler to skip input to theclosing parentheses, check what's coming after and actaccordingly. The alternative is to go for some ambiguousfunction/template parameters parsing and resolve at the end, butas template argument are themselves ambiguoustype/expression/symbols, the amount of complexity in the parseris doomed to explode. Note that the example is not selfcontained, for instance, both template parameters and argumentcan imply template instantiation, which means ambiguous argumentparsing.

SDC does a good deal of ambiguous parsing without backtracking(more than DMD does), but you got to draw the line somewhere.

What I'd like to see is a go to the closing token feature, thatcan eventually take a fast path to do so, or an efficient tokenbuffering system (I'm not sure which feature would be thefastest, but I'm inclined to think this is the first one,however, that won't be suited for a dmd style parser).

Re: std.d.lexer: pre-voting review / discussion

Reply via email to