...and now I understand an issue I had with Regex.compile 6 months ago and didn't take the time to ask anyone smarter than myself about. Onorio - your strategy is better!
On Mon, Jun 6, 2016 at 2:23 PM, José Valim <[email protected]> wrote: > Yes, contributions to the docs are always welcome. In particular, we > should make it clear in Regex.compile that entries must be double escaped > or a sigil like ~S must be used to avoid confusion. > > > On Monday, June 6, 2016, Johnny Winn <[email protected]> wrote: > >> José, >> >> I figured that was what was happening but looking at the code/tests/docs >> it isn't clear that's what is supposed to happen. Maybe it's a lesser issue >> but some clarification in the docs/tests might help for next time and I >> wouldn't mind contributing if you're open to that. >> >> Thanks, >> Johnny >> >> >> On Mon, Jun 6, 2016 at 2:59 PM, Onorio Catenacci <[email protected]> >> wrote: >> >>> Thanks José. This is why I asked before submitting a bug. >>> >>> On Mon, Jun 6, 2016 at 2:50 PM, José Valim < >>> [email protected]> wrote: >>> >>>> If you call Regex.compile, then you are passing a string which means \w >>>> must now be \\w. If you just pass \w, you can see how it becomes simply w, >>>> because \ was discarded by the *string* and never made to the regex >>>> compile. >>>> >>>> Since \d represents the delete escape character, it becomes the >>>> codepoint 007F before it ever enters the regex and so the regex needs to >>>> escape it using \x to avoid ambiguity with its own \d, so not a bug. :) >>>> >>>> On Monday, June 6, 2016, Onorio Catenacci <[email protected]> wrote: >>>> >>>>> Yeah I think you've hit on the problem Johnny. It does appear the \d >>>>> is the issue. >>>>> >>>>> iex(10)> {:ok, rc} = Regex.compile("\d") >>>>> {:ok, ~r/\x7F/} >>>>> iex(11)> {:ok, rc} = Regex.compile("\w") >>>>> {:ok, ~r/w/} >>>>> >>>>> As I say, I can certainly do this without having to compile the regex >>>>> but it sure looks like a bug (albeit a minor one as far as I can tell). >>>>> >>>>> -- >>>>> Onorio >>>>> >>>>> >>>>> On Mon, Jun 6, 2016 at 2:37 PM, Johnny Winn <[email protected]> >>>>> wrote: >>>>> >>>>>> This would be a bug from what I can see. the `\` seems to be giving >>>>>> `Regex.compile/1` a fit. >>>>>> >>>>>> On a mac, Elixir 1.2.5: >>>>>> >>>>>> iex(1)> {:ok, rc} = Regex.compile("(?<version>\d+.\d+.\d+.\d+)*$") >>>>>> {:ok, ~r/(?<version>\x7F+.\x7F+.\x7F+.\x7F+)*$/} >>>>>> iex(2)> {:ok, rc} = Regex.compile("(?<version>\d+.\w+.\w+.\w+)*$") >>>>>> {:ok, ~r/(?<version>\x7F+.w+.w+.w+)*$/} >>>>>> >>>>>> If Jose is ok with you submitting the bug, I can take a look at it :) >>>>>> >>>>>> ~ Johnny >>>>>> >>>>>> On Mon, Jun 6, 2016 at 2:28 PM, Onorio Catenacci <[email protected] >>>>>> > wrote: >>>>>> >>>>>>> Hi all, >>>>>>> >>>>>>> Not sure if this is a bug or simply me misunderstanding the >>>>>>> mechanism of regex's. I hope no one minds me asking about this here so >>>>>>> I >>>>>>> don't file a bug if it's just a misunderstanding on my part. >>>>>>> >>>>>>> On Windows 10 with Elixir 1.2.6, I've been working on a regex to >>>>>>> pull the version portion of a string. This works: >>>>>>> >>>>>>> iex(4)> v = >>>>>>> Regex.named_captures(~r/(?<version>\d+.\d+.\d+.\d+)*$/,"Version=15.0.4815.1002") >>>>>>> %{"version" => "15.0.4815.1002"} >>>>>>> iex(5)> v >>>>>>> %{"version" => "15.0.4815.1002"} >>>>>>> iex(6)> v["version"] >>>>>>> "15.0.4815.1002" >>>>>>> >>>>>>> But as I was playing with the regex, I found this which struck me as >>>>>>> sort of curious: >>>>>>> >>>>>>> iex(7)> {:ok, rc} = Regex.compile("(?<version>\d+.\d+.\d+.\d+)*$") >>>>>>> {:ok, ~r/(?<version>\x7F+.\x7F+.\x7F+.\x7F+)*$/} >>>>>>> iex(8)> rc >>>>>>> ~r/(?<version>\x7F+.\x7F+.\x7F+.\x7F+)*$/ >>>>>>> >>>>>>> Which when I try with named_captures gives me this: >>>>>>> >>>>>>> iex(9)> v = Regex.named_captures(rc,"Version=15.0.4815.1002") >>>>>>> %{"version" => ""} >>>>>>> >>>>>>> Knowing this is Windows 10 and knowing the issues we've had with >>>>>>> unicode over the years, I ran chcp 65001. The problem still occurs. >>>>>>> It's a >>>>>>> minor issue because as you can see, I've got a working regex. >>>>>>> >>>>>>> So am I misunderstanding something or is this a bug? Any advice >>>>>>> would be appreciated. >>>>>>> >>>>>>> -- >>>>>>> Onorio >>>>>>> >>>>>>> >>>>>>> >>>>>>> -- >>>>>>> You received this message because you are subscribed to the Google >>>>>>> Groups "elixir-lang-core" group. >>>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>>> send an email to [email protected]. >>>>>>> To view this discussion on the web visit >>>>>>> https://groups.google.com/d/msgid/elixir-lang-core/d921ceaf-de00-4137-ac6e-dc11209d3db8%40googlegroups.com >>>>>>> <https://groups.google.com/d/msgid/elixir-lang-core/d921ceaf-de00-4137-ac6e-dc11209d3db8%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>>>> . >>>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>>> >>>>>> >>>>>> -- >>>>>> You received this message because you are subscribed to a topic in >>>>>> the Google Groups "elixir-lang-core" group. >>>>>> To unsubscribe from this topic, visit >>>>>> https://groups.google.com/d/topic/elixir-lang-core/7KxqS9FW5vw/unsubscribe >>>>>> . >>>>>> To unsubscribe from this group and all its topics, send an email to >>>>>> [email protected]. >>>>>> To view this discussion on the web visit >>>>>> https://groups.google.com/d/msgid/elixir-lang-core/CALBtUW6aTpPG3rNsmNEzCdxvtwyyfQVhANYvdpvORF%2Bb7XCqCw%40mail.gmail.com >>>>>> <https://groups.google.com/d/msgid/elixir-lang-core/CALBtUW6aTpPG3rNsmNEzCdxvtwyyfQVhANYvdpvORF%2Bb7XCqCw%40mail.gmail.com?utm_medium=email&utm_source=footer> >>>>>> . >>>>>> >>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>> >>>>> >>>>> >>>>> >>>>> -- >>>>> Onorio Catenacci >>>>> >>>>> http://onor.io >>>>> http://www.google.com/+OnorioCatenacci >>>>> >>>>> -- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "elixir-lang-core" group. >>>>> To unsubscribe from this group and stop receiving emails from it, send >>>>> an email to [email protected]. >>>>> To view this discussion on the web visit >>>>> https://groups.google.com/d/msgid/elixir-lang-core/CAP%3DvNq8KzoK_hBq8RxyOUEUSUCaWUcN6kifMGAvfRidQ2AZPdg%40mail.gmail.com >>>>> <https://groups.google.com/d/msgid/elixir-lang-core/CAP%3DvNq8KzoK_hBq8RxyOUEUSUCaWUcN6kifMGAvfRidQ2AZPdg%40mail.gmail.com?utm_medium=email&utm_source=footer> >>>>> . >>>>> For more options, visit https://groups.google.com/d/optout. >>>>> >>>> >>>> >>>> -- >>>> >>>> >>>> *José Valim* >>>> www.plataformatec.com.br >>>> Skype: jv.ptec >>>> Founder and Director of R&D >>>> >>>> -- >>>> You received this message because you are subscribed to a topic in the >>>> Google Groups "elixir-lang-core" group. >>>> To unsubscribe from this topic, visit >>>> https://groups.google.com/d/topic/elixir-lang-core/7KxqS9FW5vw/unsubscribe >>>> . >>>> To unsubscribe from this group and all its topics, send an email to >>>> [email protected]. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/elixir-lang-core/CAGnRm4JQXDM-9pPZu4kHzyUk2-rTQVA6Ge-4shyEN%3D0jd-k7qw%40mail.gmail.com >>>> <https://groups.google.com/d/msgid/elixir-lang-core/CAGnRm4JQXDM-9pPZu4kHzyUk2-rTQVA6Ge-4shyEN%3D0jd-k7qw%40mail.gmail.com?utm_medium=email&utm_source=footer> >>>> . >>>> >>>> For more options, visit https://groups.google.com/d/optout. >>>> >>> >>> >>> >>> -- >>> Onorio Catenacci >>> >>> http://onor.io >>> http://www.google.com/+OnorioCatenacci >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "elixir-lang-core" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to [email protected]. >>> To view this discussion on the web visit >>> https://groups.google.com/d/msgid/elixir-lang-core/CAP%3DvNq8kkLDh%3D-%3DQ%2BGDhNdCCuQgbNnr95Rf5wUh5AmETURE5dA%40mail.gmail.com >>> <https://groups.google.com/d/msgid/elixir-lang-core/CAP%3DvNq8kkLDh%3D-%3DQ%2BGDhNdCCuQgbNnr95Rf5wUh5AmETURE5dA%40mail.gmail.com?utm_medium=email&utm_source=footer> >>> . >>> >>> For more options, visit https://groups.google.com/d/optout. >>> >> >> -- >> You received this message because you are subscribed to the Google Groups >> "elixir-lang-core" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/elixir-lang-core/CALBtUW5VWcm6fJF_4GAd4-9v1PcnELY-FAdht1VG%2BtGTx%3DK4Hg%40mail.gmail.com >> <https://groups.google.com/d/msgid/elixir-lang-core/CALBtUW5VWcm6fJF_4GAd4-9v1PcnELY-FAdht1VG%2BtGTx%3DK4Hg%40mail.gmail.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > > > -- > > > *José Valim* > www.plataformatec.com.br > Skype: jv.ptec > Founder and Director of R&D > > -- > You received this message because you are subscribed to the Google Groups > "elixir-lang-core" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/elixir-lang-core/CAGnRm4%2Ber74FpCYxLUnFqNfqLdRPYhdBhM4jRzmAXzui%3DySP1Q%40mail.gmail.com > <https://groups.google.com/d/msgid/elixir-lang-core/CAGnRm4%2Ber74FpCYxLUnFqNfqLdRPYhdBhM4jRzmAXzui%3DySP1Q%40mail.gmail.com?utm_medium=email&utm_source=footer> > . > > For more options, visit https://groups.google.com/d/optout. > -- Josh Adams -- You received this message because you are subscribed to the Google Groups "elixir-lang-core" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elixir-lang-core/CAA1-O0yKRkMxt-5sE07ijbg4oL6b6bHC18sXz3dYtX2KgKAwLQ%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.
