On 10 June 2013 03:50, Glenn Fowler <[email protected]> wrote:
>
> On Mon, 10 Jun 2013 03:47:08 +0200 Roland Mainz wrote:
>> On Sun, Jun 9, 2013 at 4:44 AM, Glenn Fowler <[email protected]> wrote:
>> > I knew I would get into semantic trouble here
>> > I'm not complaining/deriding the efficacy of iswrune()
>> > only that it has no bearing on any posix compliant utility
>
>> OK... here is the question which bothers me:
>> tr -C does require to sort characters, right ? How do we sort
>> characters which do not have an assigned meaning ?
>
> strcoll()
>
>> > if anyone wants to start a discussion about new utility option(s)
>> > that rely on iswrune() and what ast utilities should be affected, great
>> >
>> > for systems that do not supply iswrune() portability remains a big issue,
>> > current practice notwithstanding -- it will always be an
>> > iffe|config game of catchup vs. the iw*() collection du jour
>
>> BTW: re |iswrune()| emulation... perl has the perl regex match
>> \p{Unassigned} ... which creates the same matches as this script
>> (assuming LC_ALL='en_US.UTF-8' and locales Unicode version matches the
>> perl unicode version):
>> -- snip --
>> set -o nounset
>
>> typeset -i16 i
>
>> for (( i=0 ; i < 0x10FFFF ; i++ )) ; do
>>       ch="${ printf "\u[${i/~(El)16#/}]" ; }"
>
>>       if [[ "$ch" !=
>> ~(Elr)[[:alpha:][:alnum:][:digit:][:print:][:cntrl:][:space:][:blank:][:punct:]]
>> ]] ; then
>>               printf "# match found: %q\n" "${i}"
>>       fi
>> done
>
>> print '# done.'
>> -- snip --
>
>> |iswrune()| or not... IMO it would be nice to have something like
>> \p{Unassigned} in normal egrep/xgrep regex, e.g. something like a
>> [:_unassigned:] character class...
>
> [:rune:] would be a fine name for that class

There's still no [:rune:] emulation in libast :(

Ced
-- 
Cedric Blancher <[email protected]>
Institute Pasteur
_______________________________________________
ast-developers mailing list
[email protected]
http://lists.research.att.com/mailman/listinfo/ast-developers

Reply via email to