Yes, it's true.  Marco had sent me his UTF-Morse proposal just
yesterday, along with a suggestion that I put together an implementation
for April Fool's Day.  And darned if I wasn't really going to do it.  As
a JOKE.

But Marco, you need to check your invented sequences again.  The leading
and trailing Morse code units for the (non-ASCII) multi-Morse characters
conflict with some of the single-unit characters.  For example,
U+002D -....- looks like a leading unit, and U+0023 .-.-.. looks like a
trailing unit.

(It's only a JOKE, guys.  Take a breath.)

-Doug Ewell
 Fullerton, California

----- Original Message -----
From: "Marco Cimarosti" <[EMAIL PROTECTED]>
To: "'Carl W. Brown'" <[EMAIL PROTECTED]>; <[EMAIL PROTECTED]>
Sent: Thursday, November 21, 2002 1:22 am
Subject: UTF-Morse (was RE: Morse coded Unicode(was: Morse code))


> Carl W. Brown wrote:
> > I think that the bigger issue might be how do you extend Morse code
to
> > incorporate the Unicode character set.
> > [...]
>
> Carl, this is unfair!! You spoiled my April 1st joke in mid November!
>
> Ciao.
> Marco :-)
>
>
>
> ----------------------------------------------------------------------
> UTF-Morse - "Bringing Unicode in the telegraph age!"
>
>
> 1. Unicode characters U+0020..U+007E are encoded according to the
> following table:
>
> Code:  UTF-Morse:  Character name:
> ------ ----------- --------------------------
> U+0020 /           SPACE
> U+0021 -----.      EXCLAMATION MARK [1]
> U+0022 .-..-.      QUOTATION MARK
> U+0023 .-.-..      NUMBER SIGN [1]
> U+0024 ..-...      DOLLAR SIGN [1]
> U+0025 ..-..-      PERCENT SIGN [1]
> U+0026 ..-.-.      AMPERSAND [1]
> U+0027 .----.      APOSTROPHE
> U+0028 -.--.-      LEFT PARENTHESIS
> U+0029 -.---.      RIGHT PARENTHESIS [1]
> U+002A -.----      ASTERISK [1]
> U+002B --....      PLUS SIGN [1]
> U+002C --..--      COMMA
> U+002D -....-      HYPHEN-MINUS
> U+002E .-.-.-      FULL STOP
> U+002F -..-.       SOLIDUS [1]
> U+0030 -----       DIGIT ZERO
> U+0031 .----       DIGIT ONE
> U+0032 ..---       DIGIT TWO
> U+0033 ...--       DIGIT THREE
> U+0034 ....-       DIGIT FOUR
> U+0035 .....       DIGIT FIVE
> U+0036 -....       DIGIT SIX
> U+0037 --...       DIGIT SEVEN
> U+0038 ---..       DIGIT EIGHT
> U+0039 ----.       DIGIT NINE
> U+003A ---...      COLON
> U+003B ---..-      SEMICOLON [1]
> U+003C ---.-.      LESS-THAN SIGN [1]
> U+003D ----..      EQUALS SIGN [1]
> U+003E ---.--      GREATER-THAN SIGN [1]
> U+003F ..--..      QUESTION MARK
> U+0040 -.-.-.      COMMERCIAL AT [1]
> U+0041 ..-- .-     LATIN CAPITAL LETTER A [2]
> U+0042 ..-- -...   LATIN CAPITAL LETTER B [2]
> U+0043 ..-- -.-.   LATIN CAPITAL LETTER C [2]
> U+0044 ..-- -..    LATIN CAPITAL LETTER D [2]
> U+0045 ..-- .      LATIN CAPITAL LETTER E [2]
> U+0046 ..-- ..-.   LATIN CAPITAL LETTER F [2]
> U+0047 ..-- --.    LATIN CAPITAL LETTER G [2]
> U+0048 ..-- ....   LATIN CAPITAL LETTER H [2]
> U+0049 ..-- ..     LATIN CAPITAL LETTER I [2]
> U+004A ..-- .---   LATIN CAPITAL LETTER J [2]
> U+004B ..-- -.-    LATIN CAPITAL LETTER K [2]
> U+004C ..-- .-..   LATIN CAPITAL LETTER L [2]
> U+004D ..-- --     LATIN CAPITAL LETTER M [2]
> U+004E ..-- -.     LATIN CAPITAL LETTER N [2]
> U+004F ..-- ---    LATIN CAPITAL LETTER O [2]
> U+0050 ..-- .--.   LATIN CAPITAL LETTER P [2]
> U+0051 ..-- --.-   LATIN CAPITAL LETTER Q [2]
> U+0052 ..-- .-.    LATIN CAPITAL LETTER R [2]
> U+0053 ..-- ...    LATIN CAPITAL LETTER S [2]
> U+0054 ..-- -      LATIN CAPITAL LETTER T [2]
> U+0055 ..-- ..-    LATIN CAPITAL LETTER U [2]
> U+0056 ..-- ...-   LATIN CAPITAL LETTER V [2]
> U+0057 ..-- .--    LATIN CAPITAL LETTER W [2]
> U+0058 ..-- -..-   LATIN CAPITAL LETTER X [2]
> U+0059 ..-- -.--   LATIN CAPITAL LETTER Y [2]
> U+005A ..-- --..   LATIN CAPITAL LETTER Z [2]
> U+005B ..---.      LEFT SQUARE BRACKET [1]
> U+005C .-....      REVERSE SOLIDUS [1]
> U+005D ..----      RIGHT SQUARE BRACKET [1]
> U+005E .-...-      CIRCUMFLEX ACCENT [1]
> U+005F ------      LOW LINE [1]
> U+0060 ...---      GRAVE ACCENT [1]
> U+0061 .-          LATIN SMALL LETTER A
> U+0062 -...        LATIN SMALL LETTER B
> U+0063 -.-.        LATIN SMALL LETTER C
> U+0064 -..         LATIN SMALL LETTER D
> U+0065 .           LATIN SMALL LETTER E
> U+0066 ..-.        LATIN SMALL LETTER F
> U+0067 --.         LATIN SMALL LETTER G
> U+0068 ....        LATIN SMALL LETTER H
> U+0069 ..          LATIN SMALL LETTER I
> U+006A .---        LATIN SMALL LETTER J
> U+006B -.-         LATIN SMALL LETTER K
> U+006C .-..        LATIN SMALL LETTER L
> U+006D --          LATIN SMALL LETTER M
> U+006E -.          LATIN SMALL LETTER N
> U+006F ---         LATIN SMALL LETTER O
> U+0070 .--.        LATIN SMALL LETTER P
> U+0071 --.-        LATIN SMALL LETTER Q
> U+0072 .-.         LATIN SMALL LETTER R
> U+0073 ...         LATIN SMALL LETTER S
> U+0074 -           LATIN SMALL LETTER T
> U+0075 ..-         LATIN SMALL LETTER U
> U+0076 ...-        LATIN SMALL LETTER V
> U+0077 .--         LATIN SMALL LETTER W
> U+0078 -..-        LATIN SMALL LETTER X
> U+0079 -.--        LATIN SMALL LETTER Y
> U+007A --..        LATIN SMALL LETTER Z
> U+007B --.-..      LEFT CURLY BRACKET [1]
> U+007C --.--.      VERTICAL LINE [1]
> U+007D --.-.-      RIGHT CURLY BRACKET [1]
> U+007E --.---      TILDE [1]
>
>
> 2. All other Unicode characters are encoded with one of seven
> multi-Morse schemes:
>
> Code range:        Scheme
> -----------------  ------
> U+0000..U+0007     1
> U+0008..U+001F     2
> U+007F..U+01FF     3
> U+0200..U+0FFF     4
> U+1000..U+7FFF     5
> U+8000..U+3FFFF    6
> U+40000..U+10FFFF  7
>
> Each scheme uses a Morse sequence of the form ".-.yyy", possibly
> preceded by one or more Morse sequences in the form ".-.yyy":
>
> Scheme Bits (x: 0 or 1):     UTF-Morse (y: "." if x is 0, "-" if x is
1):
> ------ --------------------
> ------------------------------------------------
> 1      00000000000000000xxx  .-.yyy
> 2      00000000000000xxxxxx  -..yyy .-.yyy
> 3      00000000000xxxxxxxxx  -..yyy -..yyy .-.yyy
> 4      00000000xxxxxxxxxxxx  -..yyy -..yyy -..yyy .-.yyy
> 5      000000xxxxxxxxxxxxxx  -..yyy -..yyy -..yyy -..yyy .-.yyy
> 6      000xxxxxxxxxxxxxxxxx  -..yyy -..yyy -..yyy -..yyy -..yyy .-.yyy
> 7      xxxxxxxxxxxxxxxxxxxx  -..yyy -..yyy -..yyy -..yyy -..yyy -..yyy
> .-.yyy
>
>
> 3. Notes
>
> [1]: Some sequences are unique to UTF-Morse, and are unknown in
>      traditional Morse code.
>
> [2]: Capital letters use the same code as small letter, preceded by
>      sequence "..--" (which is unique to UTF-Morse).
>
> ----------------------------------------------------------------------
------
> -
>


Reply via email to