On 6 August 2010 17:26, Glenn Fowler <[email protected]> wrote:
>
> tr is on the ast l18n todo list
> what locale is your example?

en_US.UTF-8

> btw, I tried this on linux and solaris
>
> LC_ALL=de_DE.UTF-8 /usr/bin/tr '[:lower:]' '[:upper:]' <<<$'a\303\274z'

This works with AST tr, too. But my test case is backwards, upper to lower:
ksh93 -c "LC_ALL=de_DE.UTF-8 ./arch/sol11.i386/bin/tr '[:upper:]'
'[:lower:]' <<<$'aÄÄz'" "
<trash>

ksh93 -c "LC_ALL=de_DE.UTF-8 ./arch/sol11.i386/bin/tr '[:upper:]'
'[:lower:]' <<<$'a\303\274z'"
<trash>

> ($'\303\274' is UTF-8 lower case u-umlaut)
> and got lower case u-umlaut on the output

/usr/bin/tr is buggy and is NOT aware of multibyte characters. Use
/usr/xpg6/bin/tr or /usr/xpg4/bin/tr instead.

Ced
-- 
Cedric Blancher <[email protected]>
Institute Pasteur

_______________________________________________
ast-developers mailing list
[email protected]
https://mailman.research.att.com/mailman/listinfo/ast-developers

Reply via email to