UTF-8 is just a transform of Unicode, and the Unicode code point is AC. The string C2AC is just a way of encoding AC.
________________________________________ From: IBM Mainframe Discussion List <[email protected]> on behalf of Paul Gilmartin <[email protected]> Sent: Sunday, May 7, 2023 11:14 AM To: [email protected] Subject: Re: Logical Nor (¬) in ASCII-based code pages? On Sun, 7 May 2023 13:27:16 +0000, Seymour J Metz wrote: >I've seen Logical Not (¬) at AA and at AC. Are there and ASCII-based code >pages that have it at a third position? Put another way, is there a third code >point that ooRexx and Regina should recognize as ¬? > C2AC in UTF-8, which I consider the most important because it dominates my desktop, and the one I copied/pasted from regina.pdf.. Otherwise, for z/OS UNIX, AC in ISO8859-1 (matching Windows-1252.) Does Rexx use any other non-ASCII delimiters? But let EBCDIC be a warning. From the ISPF Edit Macros Ref.: Regular expressions (string, string1) ISPF queries the host code page defined for your TN3270 session. If the code page is one of the following: [list of 52] .... This is done so that the special symbols (such as square brackets) within the regular expression are correctly interpreted when the regcomp function is used to compile the regular expression. (In response to my RCF): in an Edit macro that is called from a batch Edit session (where no terminal is attached), code page 1047 is used. ... so copies of Edit macros for all code pages are required! The PCRE document mentions this, not kindly. The exec or Edit macro should reside in a UNIX directory, tagged with CCSID. -- gil ---------------------------------------------------------------------- For IBM-MAIN subscribe / signoff / archive access instructions, send email to [email protected] with the message: INFO IBM-MAIN ---------------------------------------------------------------------- For IBM-MAIN subscribe / signoff / archive access instructions, send email to [email protected] with the message: INFO IBM-MAIN
