On Tue, 27 Mar 2012 13:03:14 -0500, Paul Gilmartin wrote: >ftp> quote site sbdataconn=(IBM-424,UTF-8) >200-Some characters cannot be translated between UTF-8 and IBM-424 >200 SITE command was accepted >ftp> get TEST.TESTPRT(TESTPRT) >local: TEST.TESTPRT(TESTPRT) remote: TEST.TESTPRT(TESTPRT) >229 Entering Extended Passive Mode (|||25580|) >125 Sending data set SPPG.TEST.TESTPRT(TESTPRT) FIXrecfm 80 > 0 0.00 KiB/s >557 Data contains codepoints that cannot be translated >ftp> > ... > >WTF!? Didn't Shmuel tell us that UTF-8 contains all of Unicode? >(And all EBCDIC code points are defined in IBM-1047.) I gotta try >this on 1.13 and submit a PMR. Or am I missing something?
UTF-8 is a variable-width encoding (1 to 4 Bytes/"octets" per character), it's not a single byte character set. "sbdataconn" specifies single byte encoding. Use "site encoding=mbcs" and "site mbdataconn=(IBM-424,UTF-8)" to specify multibyte encoding. > >The Roman characters in the file as transferred with >sbdataconn=(IBM-1047,ISO8859-1) appear plausible. > IBM-1047 and ISO8859-1 are both single byte character sets. Norbert Friemel ---------------------------------------------------------------------- For IBM-MAIN subscribe / signoff / archive access instructions, send email to lists...@bama.ua.edu with the message: INFO IBM-MAIN