Re: Let's stop parser Hell

Roman D. Boiko Sat, 07 Jul 2012 14:50:24 -0700

On Saturday, 7 July 2012 at 21:35:52 UTC, Dmitry Olshansky wrote:

On 08-Jul-12 01:24, Roman D. Boiko wrote:
But PEG itself doesn't require backtracking parsing, does it?
It does. Specifically the algorithm is to try option A, tryoption B, try option C...

There is no algorithm, only __grammar__. It specifies that optionA has higher priority than option B in expression A / B / C. Butit doesn't forbid, for example, to analyze all in parallel (trackstate transitions) for each character consequently, as a properDFA implementation would do, until some option succeeds and allprevious fail. A greedy regex would also have to check allsuccessor options (C in the exapmle above) to determine thelongest one.

it's obvious how backtracking comes in ... whan say A failssomewhere in the middle of it.

Simply stop tracking respective state. Process others in parallelas I described above.

Of course there is a fair amount of bookkeeping that reducesdoing work all over again but it does ... allocate.

The amount of used memory would be proportional to the length ofinput after the branching point times number of rules (actually,of states in the DFA, which is likely larger but still finite fora particular grammar).

Tokens.. there is no such term in use if we talk about 'pure'PEG.
Terminal symbols.
Characters.

Nope. According to the definition, PEG is a set of non-terminalsymbols, terminal symbols, production rules, and a startingnon-terminal. Terminals are not necessarily characters.

Re: Let's stop parser Hell

Reply via email to