Re: [jvm-l] Perl 5 on the JVM (reasons why not)

Rémi Forax Wed, 26 May 2010 09:23:02 -0700

Le 26/05/2010 17:48, Brian Hurt a écrit :

On Wed, May 26, 2010 at 11:27 AM, Attila Szegedi <[email protected]<mailto:[email protected]>> wrote:
    Yeah, but my point is that you can use *any* character after Q to
    specify the delimiter, i.e.:

     %Qafooa

    is equal to

     "foo"

    so your lexer must be ready to use as a terminating character
    whatever character follows immediately after %Q. It's just that
    it'll have special cases for some chars - most notably
    parentheses, brackets, and braces, so a string starting with %Q{
    will not be terminated by { but by }. Most people will use { and
    }, but the point is that you are free to use *any* character.
I'm not sure this feature is doable in a sane way in a classiclex/yacc parser. I think you can, in classic lex, drop down to ahand-rolled lexer if you need to, but this is a serious code smell.
However, this misses the point of my original post. If you're parsingan existing language, then you don't get a choice in features. Ifyou're parsing Ruby, you can't choose to not implement this feature.Thus, if lex and yacc can't handle this feature, you can't use lex andyacc to parse Ruby. This is where fancier parsers with more featuresand greater ability to handle weird syntaxes become really useful.
If you're creating a language, you have a choice- you can include hardto parse features or not. And the thing to remember is that there isa cost to adding the features- every hard to parse feature you addreduces the number of parsers for your language other people arewilling to write- and thus limiting the portability of the language,limiting the number of tools for the language, etc. Even worse, everyone of these features you add increases the likelihood that people whodo implement other parsers for your language get it subtly wrong. Addenough of these features and there will only ever be one parser foryour language- yours.
This isn't to say that you shouldn't add these features- it's that youshould be aware of the trade offs you are making, and be making themdeliberately and not accidentally. With more powerful/flexible parsergenerators, it's much easier to add these sorts of featuresaccidentally, and paint yourself into a corner (and it's even morelikely you will do this if you're implementing a hand-written parserand not using a parser generator at all).
Brian

In the same time, users don't want to understand parsers/grammarlimitations.

Users want things like optional semicolon like in Javascript or Groovy,
Generics Foo<> or XML literals: <foo/> and traditional less than: a < foo,
XPath query literals: document//node/* but also // to specify a comment,
HTTP literals and ?: expression, etc.

So there is a tension between having a parseable by any tools syntax and
be able to have some nice constructions in the language.

Rémi

--
You received this message because you are subscribed to the Google Groups "JVM 
Languages" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/jvm-languages?hl=en.

Re: [jvm-l] Perl 5 on the JVM (reasons why not)

Reply via email to