On 6 August 2010 17:26, Glenn Fowler <[email protected]> wrote: > > tr is on the ast l18n todo list > what locale is your example?
en_US.UTF-8 > btw, I tried this on linux and solaris > > LC_ALL=de_DE.UTF-8 /usr/bin/tr '[:lower:]' '[:upper:]' <<<$'a\303\274z' This works with AST tr, too. But my test case is backwards, upper to lower: ksh93 -c "LC_ALL=de_DE.UTF-8 ./arch/sol11.i386/bin/tr '[:upper:]' '[:lower:]' <<<$'aÄÄz'" " <trash> ksh93 -c "LC_ALL=de_DE.UTF-8 ./arch/sol11.i386/bin/tr '[:upper:]' '[:lower:]' <<<$'a\303\274z'" <trash> > ($'\303\274' is UTF-8 lower case u-umlaut) > and got lower case u-umlaut on the output /usr/bin/tr is buggy and is NOT aware of multibyte characters. Use /usr/xpg6/bin/tr or /usr/xpg4/bin/tr instead. Ced -- Cedric Blancher <[email protected]> Institute Pasteur _______________________________________________ ast-developers mailing list [email protected] https://mailman.research.att.com/mailman/listinfo/ast-developers
