Re: Let's stop parser Hell

Roman D. Boiko Sat, 07 Jul 2012 03:25:46 -0700

On Saturday, 7 July 2012 at 09:06:57 UTC, Roman D. Boiko wrote:

http://stackoverflow.com/questions/11373644/performance-of-parsers-peg-vs-lalr1-or-llk

So far it looks like LALR parsers may have lower constant factorsthan Packrat.

The difference could be minimized by paying attention to parsingof terminal symbols, which was in my plans already. It is notnecessary to strictly follow Packrat parsing algorithm.

The benefits of Pegged, in my view, are its support of ParsingExpression Grammar (PEG) and compile-time evaluation. It iseasily extensible and modifiable.

When I implemented recursive-descent parser by hand in one ofearly drafts of DCT, I strongly felt the need to generalize codein a way which in retrospect I would call PEG-like. The structureof my hand-written recursive-descent parser was a one-to-onemapping to an implemented subset of D specification, and Iconsidered it problematic, because it was needed to duplicate thesame structure in the resulting AST.

PEG is basically a language that describes both, theimplementation of parser, and the language syntax. It greatlyreduces implicit code duplication.

I think that generated code can be made almost as fast as ahand-written parser for a particular language (probably, a fewpercent slower). Especially if that language is similar to D(context-free, with fine-grained hierarchical grammar).Optimizations might require to forget about strictly followingany theoretical algorithm, but that should be OK.

Re: Let's stop parser Hell

Reply via email to