Parsing indent-sensitive languages

Dave Whipp Thu, 08 Sep 2005 08:37:47 -0700

If I want to parse a language that is sensitive to whitespaceindentation (e.g. Python, Haskell), how do I do it using P6 rules/grammars?

The way I'd usually handle it is to have a lexer that examines leadingwhitespace and converts it into "indent" and "unindent" tokens. Thegrammer can then use these tokens in the same way that it would anyother block-delimiter.

This requires a stateful lexer, because to work out the number of"unindent" tokens on a line, it needs to know what the indentationpositions are. How would I write a P6 rule that defines <indent> and<unindent> tokens? Alternatively (if a different approach is needed) howwould I use P6 to parse such a language?

Parsing indent-sensitive languages

Reply via email to