After I read @Robert's reply to my note I was mentally composing more or less what @Gil writes below.
Paraphrasing the vulgar cliché, you can't put 20 bits of data in an 8-bit byte. Ultimately, EBCDIC is what it is, and it ain't UTF-8. I suppose you might be able to create a custom EBCDIC code page that included "your" European characters -- assuming no more than thirty or so, and then configure z Unicode Services to handle it. Otherwise, to invoke another vulgar cliché, you are indeed SOL (without your homegrown, um, solution). Charles -----Original Message----- From: IBM Mainframe Discussion List [mailto:[email protected]] On Behalf Of Paul Gilmartin Sent: Monday, September 4, 2017 12:26 PM To: [email protected] Subject: Re: UTF-8 woes on z/OS, a solution - comments invited On Mon, 4 Sep 2017 20:59:12 +0000, Robert Prins wrote > >I can probably find a set of code-pages that correctly translate the >two byte >UTF-8 "ü" character to a one byte EBCDIC "ü" character, but how would >those same two code-pages translate the Polish "ł", the Danish "ø", the >Baltic "ė", and the Greek "Θ", which appear in the same PC-side file to >one single character... And back to the correct UTF-8 character... > >That makes the problem maybe more understandable? > If SBCS is a requirement, then if there is an EBCDIC SBCS code page that contains "ü", "ł", "ø", "ė", and "Θ", iconv can probably translate UTF-8 to that code page. Otherwise, you're SOL. ---------------------------------------------------------------------- For IBM-MAIN subscribe / signoff / archive access instructions, send email to [email protected] with the message: INFO IBM-MAIN
