I have a product that uses IBM Unicode Services to translate from 1047 to UTF-8. When (due to an error or confused input) it translates non-printable 1047 data, I know that Unicode Services reports that it could not convert all of the characters.
Well, two quibbly points: - As Bill Clinton never said, it depends on what you mean by "character." Every 8-bit code point is apparently not translatable from 1047 to UTF-8. You could argue that X'01' is not a "character." I think every reasonable glyph known to man has a UTF-8 code point, so in that sense, every *character* is translatable. - I suppose you could argue that every code point is translatable -- it's just that some get translated to SUB. Charles -----Original Message----- From: IBM Mainframe Discussion List [mailto:[email protected]] On Behalf Of Shmuel Metz (Seymour J.) Sent: Saturday, September 27, 2014 6:33 PM To: [email protected] Subject: Re: FTP Question In <[email protected]>, on 09/25/2014 at 11:15 AM, Paul Gilmartin <[email protected]> said: >There's *no* character that can't be converted from IBM-1047 to UTF-8. While <https://www.iana.org/assignments/charset-reg/IBM1047> shows translations for all code points, <http://en.wikipedia.org/wiki/EBCDIC_1047> shows a bunch that allegedly have no equivalent. ---------------------------------------------------------------------- For IBM-MAIN subscribe / signoff / archive access instructions, send email to [email protected] with the message: INFO IBM-MAIN
