On Fri, Oct 19, 2012 at 3:38 PM, Cedric Blancher
<[email protected]> wrote:
> Request for enhancement: .sh.regex.available_character_class
>
> What do you think about adding a  .sh.regex.available_character_class
> array variable which contains the list of available wctype character
> classes for the current locale? I know there is no API to get a list
> from the OS but libast could probe well-known names and put only those
> in the array for which wctype() turned a non-0 value.

IMO it's better to let "getconf" handle that job because these are
locale properties which are not limited to the shell.
AFAIK we need two different "getconf" properties - one for regex
character classes and one for |wctrans()| transformations.
I did some digging... and it seems Solaris 11 supports the following
transformations (beyond POSIX  ; these are locale-dependant):
-- snip --
tojhira
tojisx0201
tojisx0208
tojkata
tolower
toupper
-- snip --
... Linux adds "totitle".

Character classes (beyond POSIX  ; these are locale-dependant)
supported by Solaris 11 are:
-- snip --
english
gb
ideogram
jalpha
jdigit
jgen
jgreek
jhankana
jhira
jisx0201r
jisx0208
jisx0212
jkanji
jkata
jparen
jpunct
jrussian
jsci
jspecial
junit
line
number
phonogram
special
wchar10
wchar11
wchar12
wchar13
wchar14
wchar15
wchar16
wchar17
wchar18
wchar19
wchar20
wchar21
wchar22
wchar23
wchar24
wchar6
wchar9
-- snip --
(note that some of these are errornously prefixed with "is" in some
older Solaris versions). FreeBSD/OSX and Illumos add "rune" as extra
class here.

Glenn: What do you think about the idea of using "getconf" for this ?
If you think this is OK then I can provide code who can test these
"well-known" names (erm... including the "is"-prefix for character
classes) for both (note that we cannot cache the values because they
depend on LANG/LC_CTYPE/LC_ALL and IMO it's cheaper to probe the
values each time "getconf" is called than trying to add more code for
caching and tracking of the values of LANG/LC_CTYPE/LC_ALL).

----

Bye,
Roland

-- 
  __ .  . __
 (o.\ \/ /.o) [email protected]
  \__\/\/__/  MPEG specialist, C&&JAVA&&Sun&&Unix programmer
  /O /==\ O\  TEL +49 641 3992797
 (;O/ \/ \O;)
_______________________________________________
ast-users mailing list
[email protected]
https://mailman.research.att.com/mailman/listinfo/ast-users

Reply via email to