José, I figured that was what was happening but looking at the code/tests/docs it isn't clear that's what is supposed to happen. Maybe it's a lesser issue but some clarification in the docs/tests might help for next time and I wouldn't mind contributing if you're open to that.
Thanks, Johnny On Mon, Jun 6, 2016 at 2:59 PM, Onorio Catenacci <[email protected]> wrote: > Thanks José. This is why I asked before submitting a bug. > > On Mon, Jun 6, 2016 at 2:50 PM, José Valim < > [email protected]> wrote: > >> If you call Regex.compile, then you are passing a string which means \w >> must now be \\w. If you just pass \w, you can see how it becomes simply w, >> because \ was discarded by the *string* and never made to the regex compile. >> >> Since \d represents the delete escape character, it becomes the codepoint >> 007F before it ever enters the regex and so the regex needs to escape it >> using \x to avoid ambiguity with its own \d, so not a bug. :) >> >> On Monday, June 6, 2016, Onorio Catenacci <[email protected]> wrote: >> >>> Yeah I think you've hit on the problem Johnny. It does appear the \d is >>> the issue. >>> >>> iex(10)> {:ok, rc} = Regex.compile("\d") >>> {:ok, ~r/\x7F/} >>> iex(11)> {:ok, rc} = Regex.compile("\w") >>> {:ok, ~r/w/} >>> >>> As I say, I can certainly do this without having to compile the regex >>> but it sure looks like a bug (albeit a minor one as far as I can tell). >>> >>> -- >>> Onorio >>> >>> >>> On Mon, Jun 6, 2016 at 2:37 PM, Johnny Winn <[email protected]> wrote: >>> >>>> This would be a bug from what I can see. the `\` seems to be giving >>>> `Regex.compile/1` a fit. >>>> >>>> On a mac, Elixir 1.2.5: >>>> >>>> iex(1)> {:ok, rc} = Regex.compile("(?<version>\d+.\d+.\d+.\d+)*$") >>>> {:ok, ~r/(?<version>\x7F+.\x7F+.\x7F+.\x7F+)*$/} >>>> iex(2)> {:ok, rc} = Regex.compile("(?<version>\d+.\w+.\w+.\w+)*$") >>>> {:ok, ~r/(?<version>\x7F+.w+.w+.w+)*$/} >>>> >>>> If Jose is ok with you submitting the bug, I can take a look at it :) >>>> >>>> ~ Johnny >>>> >>>> On Mon, Jun 6, 2016 at 2:28 PM, Onorio Catenacci <[email protected]> >>>> wrote: >>>> >>>>> Hi all, >>>>> >>>>> Not sure if this is a bug or simply me misunderstanding the mechanism >>>>> of regex's. I hope no one minds me asking about this here so I don't file >>>>> a bug if it's just a misunderstanding on my part. >>>>> >>>>> On Windows 10 with Elixir 1.2.6, I've been working on a regex to pull >>>>> the version portion of a string. This works: >>>>> >>>>> iex(4)> v = >>>>> Regex.named_captures(~r/(?<version>\d+.\d+.\d+.\d+)*$/,"Version=15.0.4815.1002") >>>>> %{"version" => "15.0.4815.1002"} >>>>> iex(5)> v >>>>> %{"version" => "15.0.4815.1002"} >>>>> iex(6)> v["version"] >>>>> "15.0.4815.1002" >>>>> >>>>> But as I was playing with the regex, I found this which struck me as >>>>> sort of curious: >>>>> >>>>> iex(7)> {:ok, rc} = Regex.compile("(?<version>\d+.\d+.\d+.\d+)*$") >>>>> {:ok, ~r/(?<version>\x7F+.\x7F+.\x7F+.\x7F+)*$/} >>>>> iex(8)> rc >>>>> ~r/(?<version>\x7F+.\x7F+.\x7F+.\x7F+)*$/ >>>>> >>>>> Which when I try with named_captures gives me this: >>>>> >>>>> iex(9)> v = Regex.named_captures(rc,"Version=15.0.4815.1002") >>>>> %{"version" => ""} >>>>> >>>>> Knowing this is Windows 10 and knowing the issues we've had with >>>>> unicode over the years, I ran chcp 65001. The problem still occurs. It's >>>>> a >>>>> minor issue because as you can see, I've got a working regex. >>>>> >>>>> So am I misunderstanding something or is this a bug? Any advice would >>>>> be appreciated. >>>>> >>>>> -- >>>>> Onorio >>>>> >>>>> >>>>> >>>>> -- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "elixir-lang-core" group. >>>>> To unsubscribe from this group and stop receiving emails from it, send >>>>> an email to [email protected]. >>>>> To view this discussion on the web visit >>>>> https://groups.google.com/d/msgid/elixir-lang-core/d921ceaf-de00-4137-ac6e-dc11209d3db8%40googlegroups.com >>>>> <https://groups.google.com/d/msgid/elixir-lang-core/d921ceaf-de00-4137-ac6e-dc11209d3db8%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>> . >>>>> For more options, visit https://groups.google.com/d/optout. >>>>> >>>> >>>> -- >>>> You received this message because you are subscribed to a topic in the >>>> Google Groups "elixir-lang-core" group. >>>> To unsubscribe from this topic, visit >>>> https://groups.google.com/d/topic/elixir-lang-core/7KxqS9FW5vw/unsubscribe >>>> . >>>> To unsubscribe from this group and all its topics, send an email to >>>> [email protected]. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/elixir-lang-core/CALBtUW6aTpPG3rNsmNEzCdxvtwyyfQVhANYvdpvORF%2Bb7XCqCw%40mail.gmail.com >>>> <https://groups.google.com/d/msgid/elixir-lang-core/CALBtUW6aTpPG3rNsmNEzCdxvtwyyfQVhANYvdpvORF%2Bb7XCqCw%40mail.gmail.com?utm_medium=email&utm_source=footer> >>>> . >>>> >>>> For more options, visit https://groups.google.com/d/optout. >>>> >>> >>> >>> >>> -- >>> Onorio Catenacci >>> >>> http://onor.io >>> http://www.google.com/+OnorioCatenacci >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "elixir-lang-core" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to [email protected]. >>> To view this discussion on the web visit >>> https://groups.google.com/d/msgid/elixir-lang-core/CAP%3DvNq8KzoK_hBq8RxyOUEUSUCaWUcN6kifMGAvfRidQ2AZPdg%40mail.gmail.com >>> <https://groups.google.com/d/msgid/elixir-lang-core/CAP%3DvNq8KzoK_hBq8RxyOUEUSUCaWUcN6kifMGAvfRidQ2AZPdg%40mail.gmail.com?utm_medium=email&utm_source=footer> >>> . >>> For more options, visit https://groups.google.com/d/optout. >>> >> >> >> -- >> >> >> *José Valim* >> www.plataformatec.com.br >> Skype: jv.ptec >> Founder and Director of R&D >> >> -- >> You received this message because you are subscribed to a topic in the >> Google Groups "elixir-lang-core" group. >> To unsubscribe from this topic, visit >> https://groups.google.com/d/topic/elixir-lang-core/7KxqS9FW5vw/unsubscribe >> . >> To unsubscribe from this group and all its topics, send an email to >> [email protected]. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/elixir-lang-core/CAGnRm4JQXDM-9pPZu4kHzyUk2-rTQVA6Ge-4shyEN%3D0jd-k7qw%40mail.gmail.com >> <https://groups.google.com/d/msgid/elixir-lang-core/CAGnRm4JQXDM-9pPZu4kHzyUk2-rTQVA6Ge-4shyEN%3D0jd-k7qw%40mail.gmail.com?utm_medium=email&utm_source=footer> >> . >> >> For more options, visit https://groups.google.com/d/optout. >> > > > > -- > Onorio Catenacci > > http://onor.io > http://www.google.com/+OnorioCatenacci > > -- > You received this message because you are subscribed to the Google Groups > "elixir-lang-core" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/elixir-lang-core/CAP%3DvNq8kkLDh%3D-%3DQ%2BGDhNdCCuQgbNnr95Rf5wUh5AmETURE5dA%40mail.gmail.com > <https://groups.google.com/d/msgid/elixir-lang-core/CAP%3DvNq8kkLDh%3D-%3DQ%2BGDhNdCCuQgbNnr95Rf5wUh5AmETURE5dA%40mail.gmail.com?utm_medium=email&utm_source=footer> > . > > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "elixir-lang-core" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elixir-lang-core/CALBtUW5VWcm6fJF_4GAd4-9v1PcnELY-FAdht1VG%2BtGTx%3DK4Hg%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.
