Glenn, I think Roland was thinking about the getconf shell builtin, because ksh93 does not have a locale(1) shell builtin, which could be used to reflect such data.
Olga On Mon, Oct 22, 2012 at 6:14 AM, Glenn Fowler <g...@research.att.com> wrote: > > ah but you may have been thinking getconf function and not getconf command > in that case doing it with the getconf function is probably the way to go > > On Mon, 22 Oct 2012 00:10:28 -0400 Glenn Fowler wrote: >> locale(1) would be my first choice >> but getonf(1) would be ok too > >> On Mon, 22 Oct 2012 01:34:38 +0200 Roland Mainz wrote: >> > On Fri, Oct 19, 2012 at 3:38 PM, Cedric Blancher >> > <cedric.blanc...@googlemail.com> wrote: >> > > Request for enhancement: .sh.regex.available_character_class >> > > >> > > What do you think about adding a .sh.regex.available_character_class >> > > array variable which contains the list of available wctype character >> > > classes for the current locale? I know there is no API to get a list >> > > from the OS but libast could probe well-known names and put only those >> > > in the array for which wctype() turned a non-0 value. > >> > IMO it's better to let "getconf" handle that job because these are >> > locale properties which are not limited to the shell. >> > AFAIK we need two different "getconf" properties - one for regex >> > character classes and one for |wctrans()| transformations. >> > I did some digging... and it seems Solaris 11 supports the following >> > transformations (beyond POSIX ; these are locale-dependant): >> > -- snip -- >> > tojhira >> > tojisx0201 >> > tojisx0208 >> > tojkata >> > tolower >> > toupper >> > -- snip -- >> > ... Linux adds "totitle". > >> > Character classes (beyond POSIX ; these are locale-dependant) >> > supported by Solaris 11 are: >> > -- snip -- >> > english >> > gb >> > ideogram >> > jalpha >> > jdigit >> > jgen >> > jgreek >> > jhankana >> > jhira >> > jisx0201r >> > jisx0208 >> > jisx0212 >> > jkanji >> > jkata >> > jparen >> > jpunct >> > jrussian >> > jsci >> > jspecial >> > junit >> > line >> > number >> > phonogram >> > special >> > wchar10 >> > wchar11 >> > wchar12 >> > wchar13 >> > wchar14 >> > wchar15 >> > wchar16 >> > wchar17 >> > wchar18 >> > wchar19 >> > wchar20 >> > wchar21 >> > wchar22 >> > wchar23 >> > wchar24 >> > wchar6 >> > wchar9 >> > -- snip -- >> > (note that some of these are errornously prefixed with "is" in some >> > older Solaris versions). FreeBSD/OSX and Illumos add "rune" as extra >> > class here. > >> > Glenn: What do you think about the idea of using "getconf" for this ? >> > If you think this is OK then I can provide code who can test these >> > "well-known" names (erm... including the "is"-prefix for character >> > classes) for both (note that we cannot cache the values because they >> > depend on LANG/LC_CTYPE/LC_ALL and IMO it's cheaper to probe the >> > values each time "getconf" is called than trying to add more code for >> > caching and tracking of the values of LANG/LC_CTYPE/LC_ALL). > >> > ---- > >> > Bye, >> > Roland > >> > -- >> > __ . . __ >> > (o.\ \/ /.o) roland.ma...@nrubsig.org >> > \__\/\/__/ MPEG specialist, C&&JAVA&&Sun&&Unix programmer >> > /O /==\ O\ TEL +49 641 3992797 >> > (;O/ \/ \O;) > > _______________________________________________ > ast-users mailing list > ast-us...@research.att.com > https://mailman.research.att.com/mailman/listinfo/ast-users -- , _ _ , { \/`o;====- Olga Kryzhanovska -====;o`\/ } .----'-/`-/ olga.kryzhanov...@gmail.com \-`\-'----. `'-..-| / http://twitter.com/fleyta \ |-..-'` /\/\ Solaris/BSD//C/C++ programmer /\/\ `--` `--` _______________________________________________ ast-developers mailing list ast-developers@research.att.com https://mailman.research.att.com/mailman/listinfo/ast-developers