Re: can [[:digit:]] match something other than 0123456789?

Steffen Nurpmeso Fri, 25 May 2018 08:47:35 -0700

Garrett Wollman <[email protected]> wrote:
 |<<On Thu, 24 May 2018 07:34:56 +0000, "Schwarz, Konrad" <konrad.schwarz@siem\
 |ens.com> said:
 |> Also, my feeling is that [[:digit:]] should match just the digits
 |> that are actually relevant for that locale, e.g., just "western"
 |> digits for en_GB.  And fractions and superscripts are not digits.
 |
 |Implementations often use the same character definitions for all
 |locales using the same character set -- such as the Unicode character
 |data file, for Unicode-based locales.  I think changing this may be a
 |tough sell for many implementers, just given the sheer number of
 |characters (and bikeshed-painting debates about which particular
 |character class or collation element should include which characters
 |in which locales would not be welcome).


..and bugs are everywhere, ... and take a long time to fix.
I think Unicode is pretty clear on what is a digit or a number,
and what not.  And i think they no longer officially support the
toolchain that can be used to turn Unicode data tables to
Unix/POSIX compliant (a.k.a. localedef) tables.  But of course
D'Amore from the Solaris faction seems to have done a great job,
and Daroussin imported that into FreeBSD (unforgotten the "this is
how i like OpenSource software" message, or very nearby that).

--steffen
|
|Der Kragenbaer,                The moon bear,
|der holt sich munter           he cheerfully and one by one
|einen nach dem anderen runter  wa.ks himself off
|(By Robert Gernhardt)

Re: can [[:digit:]] match something other than 0123456789?

Reply via email to