On Wed, 23 May 2012 06:43:53 -0700, Charles Mills wrote:

>I don't understand what I am seeing from Unicode Services translation.
>
>I specify translation from 1047 (Encoding scheme 1100 - EBCDIC, SBCS; Name
>LATIN 1 / OPEN SYSTEM) to 1252 (Encoding scheme 4105 - ASCII, SBCS; Name
>MS-WIN LATIN-1).
>
>As both CCSIDs are SBCS I would expect that any "common" EBCDIC character
>would get translated into a single ASCII byte. But for an input byte of
>X'B0' (logical not in 1047) I am seeing translation to the 2-byte sequence
>C2AC. AC is by my reading correct: it's 1252 logical not. But what the heck
>is that C2 about (C2 is A with an acute accent in 1252).
>
>FWIW technique E, substitution 1A.
> 
That appears to be a variable-length encoding, such as UTF-8.  For example:

    387 $ awk 'BEGIN { printf( "%c", 16*11 ) }' |
        iconv -f IBM-1047 -t UTF-8 | od -x
0000000 acc2

(John M. was lately ranting on another forum about the richness of the
POSIX shell command structure compared to TSO.  I can only agree.
What would be required to accomplish the same from the TSO "READY"
prompt?)

-- gil

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to lists...@bama.ua.edu with the message: INFO IBM-MAIN

Reply via email to