On 29 June 2012 16:41, Irek Szczesniak <[email protected]> wrote:
> On Fri, Jun 29, 2012 at 4:16 PM, Irek Szczesniak <[email protected]> 
> wrote:
>> On Fri, Jun 29, 2012 at 2:25 PM, Lionel Cons
>> <[email protected]> wrote:
>>> On 29 June 2012 07:29, Glenn Fowler <[email protected]> wrote:
>>>>
>>>> the AT&T Software Technology ast 2012-06-28 source release
>>>> has been posted to the download site
>>>>        http://www.research.att.com/sw/download/
>>>> the notes and changes link has details on the release
>>>>
>>>> the git source repository will be updated later today
>>>>        http://www.research.att.com/sw/gitweb/
>>>
>>> Can anyone confirm that his release is broken? We've been unable to
>>> produce a ksh binary from this on Fedora and Suse which can be used to
>>> process autoconf scripts or any other production script. This is
>>> really bad, and if there are confirmations then please WITHDRAW that
>>> release.
>>
>> With less emphasis on strong language, and only speaking for Solaris
>> 9/10/11 on x86-64 (Intel Xenon) and SPARC64, both build with Sun
>> Studio 12, I can say that something is wrong. We've upgraded from the
>> last non-beta release to this beta and *lots* of things broke down
>> immediately. Analysis is still in progress but it's Friday afternoon
>> so a more detailed report may come Monday.
>
> David, I think ksh suffers from a data corruption problem in en_US.UTF-8.
>
> On Solaris 10/x86-64 (Intel Xenon) build with Sun Studio 12 we see this:
>
> #OK: input file
> md5sum /usr/pub/UTF-8
> 05f555672fd120af5b633f5bc89b3938  /usr/pub/UTF-8
>
> #OK: bash4.0 passes it through correctly:
> LC_ALL=en_US.UTF-8 bash -c 'sl=$( /bin/cat /usr/pub/UTF-8 ) ; md5sum
> <<<"$sl" ; true'
> 05f555672fd120af5b633f5bc89b3938  -
>
> #OK: dash (with patch for <<< syntax) passes it through correctly:
> LC_ALL=en_US.UTF-8 bash -c 'sl=$( /bin/cat /usr/pub/UTF-8 ) ; md5sum
> <<<"$sl" ; true'
> 05f555672fd120af5b633f5bc89b3938  -
>
> #FAIL: but astksh20120628 **CORRUPTS** the data in the en_US.UTF-8 locale:
> LC_ALL=en_US.UTF-8 ~/bin/ksh -c 'sl=$( /bin/cat /usr/pub/UTF-8 ) ;
> md5sum <<<"$sl" ; true'
> 756e8851f95e59b7a0bed28e20b72d50  -
>
> #OK: same ksh93 binary with C locale passes data through OK:
> LC_ALL=C ~/bin/ksh -c 'sl=$( /bin/cat /usr/pub/UTF-8 ) ; md5sum
> <<<"$sl" ; true'
> 05f555672fd120af5b633f5bc89b3938  -

Hurrah, my Friday afternoon is ruined.

So with ast-ksh 2012-06-28 I get a warning about a broken multibyte
character (UTF-8 locale):
kshbroken20120628 -c 'builtin wc ; wc -m -w -l <<< "$( cat
/usr/pub/UTF-8 )" ; true'
wc: line 2146: warning: invalid multibyte character
    2889   43115  183566

With ast-ksh 2011-02-08 (delivered with Illumos) this works as expected:
/bin/ksh -c 'builtin wc ; wc -m -w -l <<< "$( cat /usr/pub/UTF-8 )" ;
true'
   24576  258437 1619916

What was the last known working release?

Lionel

_______________________________________________
ast-users mailing list
[email protected]
https://mailman.research.att.com/mailman/listinfo/ast-users

Reply via email to