On Thu, 9 Aug 2018 15:14:21 -0400, Tony Harminc wrote:
>
>>>It's worth getting a copy of the SHARE ÆCS report to see what the
>>>state of character encoding and standardization was like in 1989.
>>>
>On 8 August 2018 at 13:03, Paul Gilmartin wrote:
>> Is it available?
>
>I thought I had seen it on Bitsavers, but though there are references
>to it I don't see it there, or indeed anywhere else. Well I have an
>original paper copy, so time to get scanning...
> 
Thanks.  Will it go to Bitsavers?  They do an incredible job of some sort
of 2-layer PDFs which are simultaneously images (even with fingerprints)
and searchable text.  I've found:
    https://en.wikipedia.org/wiki/EBCDIC_037
    https://en.wikipedia.org/wiki/EBCDIC_037-2
    https://en.wikipedia.org/wiki/EBCDIC_1047
Differences in a handful of code points; not enough to bring world
peace or solve climate change.

On Linux, the script below compares the output of "dd conv={ebcdic|ibm}"
to pages 037, 500, and 1047.  The best match seems to be "conv-ibm"
to IBM-1047.

I don't believe that the "dd" utility per se motivated a serious requirement,
but does "dd conv=ibm" reflect otherwise prevalent practice?
(And we still have the LF-NL irritant.)

# #################################
#! /bin/sh -x

S=$( awk 'BEGIN {
        for ( I=32; I<128; I++ ) printf( "%c", I ) }'; )

around() {
    echo; echo; echo EBCDIC "$1"
    printf %s     "$S" | dd conv=ebcdic |
    iconv -f "$1" -t ISO8859-1

    echo; echo IBM    "$1"
    printf '%s\n' "$S"
    printf %s     "$S" | dd conv=ibm    |
    iconv -f "$1" -t ISO8859-1
}

around CSIBM037
around CSIBM500
around IBM-1047
echo
exit
# #################################

Thanks again,
gil

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to lists...@listserv.ua.edu with the message: INFO IBM-MAIN

Reply via email to