Re: Fwd: URI replacement pseudocode

Moritz Lenz Mon, 17 May 2010 12:34:33 -0700


Aaron Sherman wrote:
> Ooops, took this off-list by accident.
> 
> ---------- Forwarded message ----------
> From: ajs <a...@ajs.com>
> Date: Mon, May 17, 2010 at 2:59 PM
> Subject: Re: URI replacement pseudocode
> To: Moritz Lenz <mor...@faui2k3.org>
> 
> 
> Thank you for your responses!
> 
> On Mon, May 17, 2010 at 1:37 PM, Moritz Lenz <mor...@faui2k3.org> wrote:
> 
>> Aaron Sherman wrote:
>> > Here's the code:
>> >
>> >
>> https://docs.google.com/leaf?id=0B41eVYcoggK7YjdkMzVjODctMTAxMi00ZGE0LWE1OTAtZTg1MTY0Njk5YjY4&hl=en
>>
>> I think your code would benefit greatly from actually trying to get run
>> with Rakudo (or at least that parts that are yet implemented), as well
>> as from a version control system.
>>
> 
> (re: storage. yes, I intend to get this into something. not sure what, yet.
> git is preferred, I presume?)


Yes, but it's really your decision in the end.

> I had a hard time even getting basic code working like:
> 
>   token foo { blah }
>   if "blah" ~~ m/<foo>/ { say "blah!" }
> 
> (See my question to the list, last week)

Right. What works today is

grammar Foo {
   token TOP { <foo> }
   token foo { blah }
}

if Foo.parse('blah') {
   say "yes"
}

>> > So, my questions are:
>> >
>> > * Is this code doing anything that is explicitly not Perl 6ish?
>>
>> Some things I've noticed:
>> * you put lots of subs into roles - you probably meant methods
>>
> 
> Well... that's a fair question. What does a method mean in a grammar? I
> wasn't too clear on what being a method of a grammar meant. Should I be
> calling these as class-methods?

a grammar is really just a class that inherits from Grammar. So the
answer is "it means the same as in a class".

> 
>> * Don't inherit from roles, implement them with 'does'
>>
> 
> I did that, didn't I? Did I typo something?
> 
>    grammar URI::rfc2396 does URI::Grammarish ...
> 

and

grammarb URI::rfc3986_regex is URI::Grammarish

that's what I meant

>> * the grammars contain a mixture of tokens for parsing and of
>> methods/subs for data extraction; yet Perl 6 offers a nice way to
>> separate the two, in the form of action/reduction methods; your code
>> might benefit from them.
>>
> 
> Do you have a pointer for some discussion of this? I'd love to pursue it.


http://github.com/perl6/book/raw/master/src/grammars.pod

(that chapter still uses the outdated {*} rules - if you read about
them, ignore them, and instead know that the corresponding action method
is always called implicitly at the end of each named rule).


>> * class URI::GrammarType seems not very extensible... maybe keep a hash
>> of URI names that map to URIs, which can be extended by method calls?
>>
> 
> The idea that I was working with was that you would provide the grammar
> itself when you wanted to do something custom, and the string names were
> just a convenience for the default cases.  So, for example:
> 
>   my URI $privatewww .= new("ajs://perl**6", :spec(::MyURI::Spec));

Fair enough.

>> > * Should I hold off until R* to even try to convert this into working
>> code?
>>
>> No need for that. The support for grammars and roles is pretty good,
>> character classes and match objects are still a bit unstable/whacky.
>>
> 
> Is there any collected wisdom available on this? I'd love to not run around
> chasing my own tail trying to figure out why something doesn't work.

it's called #perl6, and is our IRC channel :-)
Writing down such volatile information isn't very useful, because it
becomes outdated rather quickly.

>> > * Am I correct in assuming that <...> in a regex is intended to allow the
>> > creation of interface roles for grammars?
>>
>> You lost me here. <identifier(...)> calls a named rule (with arguments).
>> Could you rephrase your question?
> 
> 
> Sure.
> 
> All S05 says is "The <...>, <???>, and <!!!> special tokens have the same
> "not-defined-yet" meanings within regexes that the bare elipses have in
> ordinary code." Which doesn't tell me a lot, but seems to imply that:
> 
> role blah { token bletch { <...> } }
> 
> is roughly analogous to:
> 
> role blah { method bletch {...} }
> 
> that is to say, the role should have an interface which, when applied to a
> grammar, would assert the presence of a bletch token. Am I reading too much
> into this?

Don't know...

> If yes, is there a way to assert role-based interfaces on
> grammars? The main reason I wanted this was for the very parametric grammar
> selection we were talking about, above, where the given block says:
> 
> given $type {
>     when .does(URI::Grammarish) { $.gtype = $_ }
>
> I'm assuming, of course, that I can make such assertions about a grammar in
> the same way that I would make them about a class. Is this true?

Yes. But you could just say

given $type {
    when Grammar { $!gtype = $_ }
    ...
}

and accept any grammar there. Not as type-safe, but probably a good start.

> Have I
> identified an interface token/rule correctly given that that was my goal?
> 
> 
>> * I guessed wildly at how I should be invoking the match against a saved
>> > "token" reference:
>> >         if $s ~~ m/^ <.$.spec.gtype.URI_reference> $/ {
>> >   is that correct?
>>
>> probably just $s ~~ /^ $regex $/;
>>
> 
> But what should $regex contain? I have a $.spec which contains a reference
> to the URI::GrammarType object whose $.gtype identifies the grammar I should
> be using. That grammar is guaranteed to have a URI_reference rule, so the
> variable is:
> 
>   $.spec.gtype.URI_reference
> 
> Should that be:
> 
>   if $s ~~ m/^ <$.spec.gtype.URI_reference> $/ { ... }
> 
> then? Do I need to copy that into a lexical to avoid confusing the rule
> parser? I think I'll do that, just to avoid all the confusion anyway.

Then probably

$.spec.gtype.parse($string)

is best, and make sure that the grammar stored in $.spec.gtype has a TOP
token/regex (which is then called by .parse)

Cheers,
Moritz

Re: Fwd: URI replacement pseudocode

Reply via email to