On Mon, Sep 2, 2013 at 1:06 AM, Roland Mainz <[email protected]> wrote: > On Mon, Sep 2, 2013 at 12:36 AM, Roland Mainz <[email protected]> > wrote: >> On Mon, Aug 5, 2013 at 5:01 AM, Roland Mainz <[email protected]> >> wrote: >>> On Mon, Aug 5, 2013 at 4:13 AM, Roland Mainz <[email protected]> >>> wrote: [snip] > ** More notes: > 1. $ ksh -c 'export LC_ALL=en_US.ISO8859-15 ; printf "x\u[20ac]x\n" | > iconv -f ISO8859-15 -t UTF-8' # now works... it the correct outpput is > "x€x" > 2. The reason why this didn't work in the *002* patch was that the > original code in ast-ksh.2013-08-29 used |wc2utf8()| on an "extended > single-byte locale" like "en_US.ISO8859-15" ... this can **never** > work because the locale is not UTF-8 based > > Glenn/David: What do you think about the patch ?
I forgot one note: - The patch _explicitly_ uses |iconv()| even for UTF-8 locales to weed-out unassigned codepoints to fullfit the unicode requirement that no unassigned codepoints should be accessible. ---- Bye, Roland -- __ . . __ (o.\ \/ /.o) [email protected] \__\/\/__/ MPEG specialist, C&&JAVA&&Sun&&Unix programmer /O /==\ O\ TEL +49 641 3992797 (;O/ \/ \O;) _______________________________________________ ast-developers mailing list [email protected] http://lists.research.att.com/mailman/listinfo/ast-developers
