I have a product that uses IBM Unicode Services to translate from 1047 to
UTF-8. When (due to an error or confused input) it translates non-printable
1047 data, I know that Unicode Services reports that it could not convert
all of the characters.

Well, two quibbly points:

- As Bill Clinton never said, it depends on what you mean by "character."
Every 8-bit code point is apparently not translatable from 1047 to UTF-8.
You could argue that X'01' is not a "character." I think every reasonable
glyph known to man has a UTF-8 code point, so in that sense, every
*character* is translatable.
- I suppose you could argue that every code point is translatable -- it's
just that some get translated to SUB.

Charles

-----Original Message-----
From: IBM Mainframe Discussion List [mailto:[email protected]] On
Behalf Of Shmuel Metz (Seymour J.)
Sent: Saturday, September 27, 2014 6:33 PM
To: [email protected]
Subject: Re: FTP Question

In <[email protected]>, on
09/25/2014
   at 11:15 AM, Paul Gilmartin
<[email protected]> said:

>There's *no* character that can't be converted from IBM-1047 to UTF-8.

While <https://www.iana.org/assignments/charset-reg/IBM1047> shows
translations for all code points, <http://en.wikipedia.org/wiki/EBCDIC_1047>
shows a bunch that allegedly have no equivalent. 

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [email protected] with the message: INFO IBM-MAIN

Reply via email to