On Thu, 9 Aug 2018 15:14:21 -0400, Tony Harminc wrote:
>
>>>It's worth getting a copy of the SHARE ÆCS report to see what the
>>>state of character encoding and standardization was like in 1989.
>>>
>On 8 August 2018 at 13:03, Paul Gilmartin wrote:
>> Is it available?
>
>I thought I had seen it on Bitsavers, but though there are references
>to it I don't see it there, or indeed anywhere else. Well I have an
>original paper copy, so time to get scanning...
>
Thanks. Will it go to Bitsavers? They do an incredible job of some sort
of 2-layer PDFs which are simultaneously images (even with fingerprints)
and searchable text. I've found:
https://en.wikipedia.org/wiki/EBCDIC_037
https://en.wikipedia.org/wiki/EBCDIC_037-2
https://en.wikipedia.org/wiki/EBCDIC_1047
Differences in a handful of code points; not enough to bring world
peace or solve climate change.
On Linux, the script below compares the output of "dd conv={ebcdic|ibm}"
to pages 037, 500, and 1047. The best match seems to be "conv-ibm"
to IBM-1047.
I don't believe that the "dd" utility per se motivated a serious requirement,
but does "dd conv=ibm" reflect otherwise prevalent practice?
(And we still have the LF-NL irritant.)
# #################################
#! /bin/sh -x
S=$( awk 'BEGIN {
for ( I=32; I<128; I++ ) printf( "%c", I ) }'; )
around() {
echo; echo; echo EBCDIC "$1"
printf %s "$S" | dd conv=ebcdic |
iconv -f "$1" -t ISO8859-1
echo; echo IBM "$1"
printf '%s\n' "$S"
printf %s "$S" | dd conv=ibm |
iconv -f "$1" -t ISO8859-1
}
around CSIBM037
around CSIBM500
around IBM-1047
echo
exit
# #################################
Thanks again,
gil
----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [email protected] with the message: INFO IBM-MAIN