Re: UTF-8 woes on z/OS, a solution - comments invited

Charles Mills Mon, 04 Sep 2017 13:18:28 -0700

After I read @Robert's reply to my note I was mentally composing more or less 
what @Gil writes below.


Paraphrasing the vulgar cliché, you can't put 20 bits of data in an 8-bit byte. 
Ultimately, EBCDIC is what it is, and it ain't UTF-8.

I suppose you might be able to create a custom EBCDIC code page that included 
"your" European characters -- assuming no more than thirty or so, and then 
configure z Unicode Services to handle it. Otherwise, to invoke another vulgar 
cliché, you are indeed SOL (without your homegrown, um, solution).

Charles


-----Original Message-----
From: IBM Mainframe Discussion List [mailto:[email protected]] On Behalf 
Of Paul Gilmartin
Sent: Monday, September 4, 2017 12:26 PM
To: [email protected]
Subject: Re: UTF-8 woes on z/OS, a solution - comments invited

On Mon, 4 Sep 2017 20:59:12 +0000, Robert Prins wrote
>
>I can probably find a set of code-pages that correctly translate the 
>two byte
>UTF-8 "ü" character to a one byte EBCDIC "ü" character, but how would 
>those same two code-pages translate the Polish "ł", the Danish "ø", the 
>Baltic "ė", and the Greek "Θ", which appear in the same PC-side file to 
>one single character... And back to the correct UTF-8 character...
>
>That makes the problem maybe more understandable?
> 
If SBCS is a requirement, then if there is an EBCDIC SBCS code page that 
contains "ü", "ł", "ø", "ė", and "Θ", iconv can probably translate UTF-8 to 
that code page.  Otherwise, you're SOL.

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [email protected] with the message: INFO IBM-MAIN

Re: UTF-8 woes on z/OS, a solution - comments invited

Reply via email to