On Tue, Nov 14, 2017 at 10:36 PM, Nikolai Aleksandrovich Pavlov <[email protected]> wrote: > @tonymec @sergeevabc did not only use that notation for Cyrillic:
Ah, all right; I read too fast (as BTW I often do), sorry. > . Also it is implementation detail. And grep with glibc’s ERE or BRE which > do follow LC_COLLATE think that both A-z and А-я are invalid ranges and > valid variants in unicode locale are just the opposite: a-Z and а-Я (guess > at least first is defined somewhere in unicode standard though). Since (Latin) U+0041 A comes before U+007A z, and (Cyrillic) U+0410 А before U+044F я, in both Unicode codepoint sequence and Latin or Russian (respectively) dictionary order, I don't see why [A-z] or [А-я] would be invalid. If some program rejects them, then maybe it's a bug in that program; but that has nothing to do with Vim. As you say, implementation detail. [A-z], however, is a little weird in that, in C (i.e. Unicode codepoint number or, in this case, US-ASCII byte value) collating sequence it includes, as you said earlier, six nonalphabetic characters, so [A-Za-z] is more prudent for (unaccented) Latin. Best regards, Tony. -- -- You received this message from the "vim_dev" maillist. Do not top-post! Type your reply below the text you are replying to. For more information, visit http://www.vim.org/maillist.php --- You received this message because you are subscribed to the Google Groups "vim_dev" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
