Am 06.11.2006 um 12:51 schrieb Kenichi Handa:
In article <[EMAIL PROTECTED]>, Peter
Dyballa <[EMAIL PROTECTED]> writes:
The problems with CP1250 and CP1251 are solved with this patch. No
other failure could be found, except that there is just one
difference, with and without the patch: a thai-tis620 encoded buffer
(without NO BREAK SPACE) shows Thai glyphs, but when the same
contents is reverted to iso-8859-11 (which is thai-tis620 with NO
BREAK SPACE) only empty boxes are shown.
Again, as I can't reproduce this, please send me the exact
file (it is usually a good idea to post the exact file to
reproduce a bug).
By the way, the current implementation of thai-tis620 allows
all bytes in 0xA1..0x7E, but iso-8859-11 allows
0xA0..0xDA,0xDF..0xFB. Some of emply boxes shown in
thai-tis620 are shown in octal form when reverted to
iso-8859-11. This is not a good behavior. But as fixing it
is not trivial and it is anyway not that serious problem,
I'd like to leave it as is. It is fixed in emacs-unicode-2.
I can see this in both GNU Emacsen 22.0.50 and 22.0.90. When it comes
to loading in TIS620 encoding the TIS620 encoded file these lines are
written into *Messages* buffer:
Loading view...done
Loading thai-util...
Loading mule-util...done
Loading thai-util...done
Loading tex-mode...done
In the case of the ISO 8859-11 encoded only this line is added into
*Messages* buffer:
Loading view...done
I launch both Emacsen with -Q. Then: C-x d <my tests dir> RET. Some
cursor movements to position the cursor on the file. Then: C-x RET c
tis620 RET v, or C-x RET c iso-8859-11 RET v.
These are my proper encoding files. The test files were generated by
removing the first lines in vi(m) and saving under the same name in a
"puristic" directory inside my test cases directory:
;;; -*- mode: Text; coding: iso-8859-10; -*-
;
; Time-stamp: <2005-07-14 12:17:18 pete>
;
; New Nordic Glyphs: Saami, Inuit, Icelandic (Latin 6)
;
; oct dec hex UCS2 UTF-8
;=====================================
= 240 = 160 = A0 = U+00A0 = C2 A0 : NO-BREAK SPACE
¡ = 241 = 161 = A1 = U+0104 = C4 84 : LATIN CAPITAL LETTER A WITH OGONEK
¢ = 242 = 162 = A2 = U+0112 = C4 92 : LATIN CAPITAL LETTER E WITH MACRON
£ = 243 = 163 = A3 = U+0122 = C4 A2 : LATIN CAPITAL LETTER G WITH CEDILLA
¤ = 244 = 164 = A4 = U+012A = C4 AA : LATIN CAPITAL LETTER I WITH MACRON
¥ = 245 = 165 = A5 = U+0128 = C4 A8 : LATIN CAPITAL LETTER I WITH TILDE
¦ = 246 = 166 = A6 = U+0136 = C4 B6 : LATIN CAPITAL LETTER K WITH CEDILLA
§ = 247 = 167 = A7 = U+00A7 = C2 A7 : SECTION SIGN
¨ = 250 = 168 = A8 = U+013B = C4 BB : LATIN CAPITAL LETTER L WITH CEDILLA
© = 251 = 169 = A9 = U+0110 = C4 90 : LATIN CAPITAL LETTER D WITH STROKE
ª = 252 = 170 = AA = U+0160 = C5 A0 : LATIN CAPITAL LETTER S WITH CARON
« = 253 = 171 = AB = U+0166 = C5 A6 : LATIN CAPITAL LETTER T WITH STROKE
¬ = 254 = 172 = AC = U+017D = C5 BD : LATIN CAPITAL LETTER Z WITH CARON
= 255 = 173 = AD = U+00AD = C2 AD : HYPHEN-MINUS
® = 256 = 174 = AE = U+016A = C5 AA : LATIN CAPITAL LETTER U WITH MACRON
¯ = 257 = 175 = AF = U+014A = C5 8A : LATIN CAPITAL LETTER ENG
° = 260 = 176 = B0 = U+00B0 = C2 B0 : DEGREE SIGN
± = 261 = 177 = B1 = U+0105 = C4 85 : LATIN SMALL LETTER A WITH OGONEK
² = 262 = 178 = B2 = U+0113 = C4 93 : LATIN SMALL LETTER E WITH MACRON
³ = 263 = 179 = B3 = U+0123 = C4 A3 : LATIN SMALL LETTER G WITH CEDILLA
´ = 264 = 180 = B4 = U+012B = C4 AB : LATIN SMALL LETTER I WITH MACRON
µ = 265 = 181 = B5 = U+0129 = C4 A9 : LATIN SMALL LETTER I WITH TILDE
¶ = 266 = 182 = B6 = U+0137 = C4 B7 : LATIN SMALL LETTER K WITH CEDILLA
· = 267 = 183 = B7 = U+00B7 = C2 B7 : MIDDLE DOT
¸ = 270 = 184 = B8 = U+013C = C4 BC : LATIN SMALL LETTER L WITH CEDILLA
¹ = 271 = 185 = B9 = U+0111 = C4 91 : LATIN SMALL LETTER D WITH STROKE
º = 272 = 186 = BA = U+0161 = C5 A1 : LATIN SMALL LETTER S WITH CARON
» = 273 = 187 = BB = U+0167 = C5 A7 : LATIN SMALL LETTER T WITH STROKE
¼ = 274 = 188 = BC = U+017E = C5 BE : LATIN SMALL LETTER Z WITH CARON
½ = 275 = 189 = BD = U+2015 = E2 80 95 : HORIZONTAL BAR
¾ = 276 = 190 = BE = U+016B = C5 AB : LATIN SMALL LETTER U WITH MACRON
¿ = 277 = 191 = BF = U+014B = C5 8B : LATIN SMALL LETTER ENG
À = 300 = 192 = C0 = U+0100 = C4 80 : LATIN CAPITAL LETTER A WITH MACRON
Á = 301 = 193 = C1 = U+00C1 = C3 81 : LATIN CAPITAL LETTER A WITH ACUTE
 = 302 = 194 = C2 = U+00C2 = C3 82 : LATIN CAPITAL LETTER A WITH CIRCUMFLEX
à = 303 = 195 = C3 = U+00C3 = C3 83 : LATIN CAPITAL LETTER A WITH TILDE
Ä = 304 = 196 = C4 = U+00C4 = C3 84 : LATIN CAPITAL LETTER A WITH DIAERESIS
Å = 305 = 197 = C5 = U+00C5 = C3 85 : LATIN CAPITAL LETTER A WITH RING ABOVE
Æ = 306 = 198 = C6 = U+00C6 = C3 86 : LATIN CAPITAL LETTER AE
Ç = 307 = 199 = C7 = U+012E = C4 AE : LATIN CAPITAL LETTER I WITH OGONEK
È = 310 = 200 = C8 = U+010C = C4 8C : LATIN CAPITAL LETTER C WITH CARON
É = 311 = 201 = C9 = U+00C9 = C3 89 : LATIN CAPITAL LETTER E WITH ACUTE
Ê = 312 = 202 = CA = U+0118 = C4 98 : LATIN CAPITAL LETTER E WITH OGONEK
Ë = 313 = 203 = CB = U+00CB = C3 8B : LATIN CAPITAL LETTER E WITH DIAERESIS
Ì = 314 = 204 = CC = U+0116 = C4 96 : LATIN CAPITAL LETTER E WITH DOT ABOVE
Í = 315 = 205 = CD = U+00CD = C3 8D : LATIN CAPITAL LETTER I WITH ACUTE
Î = 316 = 206 = CE = U+00CE = C3 8E : LATIN CAPITAL LETTER I WITH CIRCUMFLEX
Ï = 317 = 207 = CF = U+00CF = C3 8F : LATIN CAPITAL LETTER I WITH DIAERESIS
Ð = 320 = 208 = D0 = U+00D0 = C3 90 : LATIN CAPITAL LETTER ETH
Ñ = 321 = 209 = D1 = U+0145 = C5 85 : LATIN CAPITAL LETTER N WITH CEDILLA
Ò = 322 = 210 = D2 = U+014C = C5 8C : LATIN CAPITAL LETTER O WITH MACRON
Ó = 323 = 211 = D3 = U+00D3 = C3 93 : LATIN CAPITAL LETTER O WITH ACUTE
Ô = 324 = 212 = D4 = U+00D4 = C3 94 : LATIN CAPITAL LETTER O WITH CIRCUMFLEX
Õ = 325 = 213 = D5 = U+00D5 = C3 95 : LATIN CAPITAL LETTER O WITH TILDE
Ö = 326 = 214 = D6 = U+00D6 = C3 96 : LATIN CAPITAL LETTER O WITH DIAERESIS
× = 327 = 215 = D7 = U+0168 = C5 A8 : LATIN CAPITAL LETTER U WITH TILDE
Ø = 330 = 216 = D8 = U+00D8 = C3 98 : LATIN CAPITAL LETTER O WITH STROKE
Ù = 331 = 217 = D9 = U+0172 = C5 B2 : LATIN CAPITAL LETTER U WITH OGONEK
Ú = 332 = 218 = DA = U+00DA = C3 9A : LATIN CAPITAL LETTER U WITH ACUTE
Û = 333 = 219 = DB = U+00DB = C3 9B : LATIN CAPITAL LETTER U WITH CIRCUMFLEX
Ü = 334 = 220 = DC = U+00DC = C3 9C : LATIN CAPITAL LETTER U WITH DIAERESIS
Ý = 335 = 221 = DD = U+00DD = C3 9D : LATIN CAPITAL LETTER Y WITH ACUTE
Þ = 336 = 222 = DE = U+00DE = C3 9E : LATIN CAPITAL LETTER THORN
ß = 337 = 223 = DF = U+00DF = C3 9F : LATIN SMALL LETTER SHARP S
à = 340 = 224 = E0 = U+0101 = C4 81 : LATIN SMALL LETTER A WITH MACRON
á = 341 = 225 = E1 = U+00E1 = C3 A1 : LATIN SMALL LETTER A WITH ACUTE
â = 342 = 226 = E2 = U+00E2 = C3 A2 : LATIN SMALL LETTER A WITH CIRCUMFLEX
ã = 343 = 227 = E3 = U+00E3 = C3 A3 : LATIN SMALL LETTER A WITH TILDE
ä = 344 = 228 = E4 = U+00E4 = C3 A4 : LATIN SMALL LETTER A WITH DIAERESIS
å = 345 = 229 = E5 = U+00E5 = C3 A5 : LATIN SMALL LETTER A WITH RING ABOVE
æ = 346 = 230 = E6 = U+00E6 = C3 A6 : LATIN SMALL LETTER AE
ç = 347 = 231 = E7 = U+012F = C4 AF : LATIN SMALL LETTER I WITH OGONEK
è = 350 = 232 = E8 = U+010D = C4 8D : LATIN SMALL LETTER C WITH CARON
é = 351 = 233 = E9 = U+00E9 = C3 A9 : LATIN SMALL LETTER E WITH ACUTE
ê = 352 = 234 = EA = U+0119 = C4 99 : LATIN SMALL LETTER E WITH OGONEK
ë = 353 = 235 = EB = U+00EB = C3 AB : LATIN SMALL LETTER E WITH DIAERESIS
ì = 354 = 236 = EC = U+0117 = C4 97 : LATIN SMALL LETTER E WITH DOT ABOVE
í = 355 = 237 = ED = U+00ED = C3 AD : LATIN SMALL LETTER I WITH ACUTE
î = 356 = 238 = EE = U+00EE = C3 AE : LATIN SMALL LETTER I WITH CIRCUMFLEX
ï = 357 = 239 = EF = U+00EF = C3 AF : LATIN SMALL LETTER I WITH DIAERESIS
ð = 360 = 240 = F0 = U+00F0 = C3 B0 : LATIN SMALL LETTER ETH
ñ = 361 = 241 = F1 = U+0146 = C5 86 : LATIN SMALL LETTER N WITH CEDILLA
ò = 362 = 242 = F2 = U+014D = C5 8D : LATIN SMALL LETTER O WITH MACRON
ó = 363 = 243 = F3 = U+00F3 = C3 B3 : LATIN SMALL LETTER O WITH ACUTE
ô = 364 = 244 = F4 = U+00F4 = C3 B4 : LATIN SMALL LETTER O WITH CIRCUMFLEX
õ = 365 = 245 = F5 = U+00F5 = C3 B5 : LATIN SMALL LETTER O WITH TILDE
ö = 366 = 246 = F6 = U+00F6 = C3 B6 : LATIN SMALL LETTER O WITH DIAERESIS
÷ = 367 = 247 = F7 = U+0169 = C5 A9 : LATIN SMALL LETTER U WITH TILDE
ø = 370 = 248 = F8 = U+00F8 = C3 B8 : LATIN SMALL LETTER O WITH STROKE
ù = 371 = 249 = F9 = U+0173 = C5 B3 : LATIN SMALL LETTER U WITH OGONEK
ú = 372 = 250 = FA = U+00FA = C3 BA : LATIN SMALL LETTER U WITH ACUTE
û = 373 = 251 = FB = U+00FB = C3 BB : LATIN SMALL LETTER U WITH CIRCUMFLEX
ü = 374 = 252 = FC = U+00FC = C3 BC : LATIN SMALL LETTER U WITH DIAERESIS
ý = 375 = 253 = FD = U+00FD = C3 BD : LATIN SMALL LETTER Y WITH ACUTE
þ = 376 = 254 = FE = U+00FE = C3 BE : LATIN SMALL LETTER THORN
ÿ = 377 = 255 = FF = U+0138 = C4 B8 : LATIN SMALL LETTER KRA
;;; -*- mode: Text; coding: thai-tis620; -*-
;
; Time-stamp: <2005-07-14 12:13:25 pete>
;
; Thai Glyphs
;
; oct dec hex UCS2 UTF-8
;=====================================
¡ = 241 = 161 = A1 = U+0E01 = E0 B8 81 : THAI CHARACTER KO KAI
¢ = 242 = 162 = A2 = U+0E02 = E0 B8 82 : THAI CHARACTER KHO KHAI
£ = 243 = 163 = A3 = U+0E03 = E0 B8 83 : THAI CHARACTER KHO KHUAT
¤ = 244 = 164 = A4 = U+0E04 = E0 B8 84 : THAI CHARACTER KHO KHWAI
¥ = 245 = 165 = A5 = U+0E05 = E0 B8 85 : THAI CHARACTER KHO KHON
¦ = 246 = 166 = A6 = U+0E06 = E0 B8 86 : THAI CHARACTER KHO RAKHANG
§ = 247 = 167 = A7 = U+0E07 = E0 B8 87 : THAI CHARACTER NGO NGU
¨ = 250 = 168 = A8 = U+0E08 = E0 B8 88 : THAI CHARACTER CHO CHAN
© = 251 = 169 = A9 = U+0E09 = E0 B8 89 : THAI CHARACTER CHO CHING
ª = 252 = 170 = AA = U+0E0A = E0 B8 8A : THAI CHARACTER CHO CHANG
« = 253 = 171 = AB = U+0E0B = E0 B8 8B : THAI CHARACTER SO SO
¬ = 254 = 172 = AC = U+0E0C = E0 B8 8C : THAI CHARACTER CHO CHOE
= 255 = 173 = AD = U+0E0D = E0 B8 8D : THAI CHARACTER YO YING
® = 256 = 174 = AE = U+0E0E = E0 B8 8E : THAI CHARACTER DO CHADA
¯ = 257 = 175 = AF = U+0E0F = E0 B8 8F : THAI CHARACTER TO PATAK
° = 260 = 176 = B0 = U+0E10 = E0 B8 90 : THAI CHARACTER THO THAN
± = 261 = 177 = B1 = U+0E11 = E0 B8 91 : THAI CHARACTER THO NANGMONTHO
² = 262 = 178 = B2 = U+0E12 = E0 B8 92 : THAI CHARACTER THO PHUTHAO
³ = 263 = 179 = B3 = U+0E13 = E0 B8 93 : THAI CHARACTER NO NEN
´ = 264 = 180 = B4 = U+0E14 = E0 B8 94 : THAI CHARACTER DO DEK
µ = 265 = 181 = B5 = U+0E15 = E0 B8 95 : THAI CHARACTER TO TAO
¶ = 266 = 182 = B6 = U+0E16 = E0 B8 96 : THAI CHARACTER THO THUNG
· = 267 = 183 = B7 = U+0E17 = E0 B8 97 : THAI CHARACTER THO THAHAN
¸ = 270 = 184 = B8 = U+0E18 = E0 B8 98 : THAI CHARACTER THO THONG
¹ = 271 = 185 = B9 = U+0E19 = E0 B8 99 : THAI CHARACTER NO NU
º = 272 = 186 = BA = U+0E1A = E0 B8 9A : THAI CHARACTER BO BAIMAI
» = 273 = 187 = BB = U+0E1B = E0 B8 9B : THAI CHARACTER PO PLA
¼ = 274 = 188 = BC = U+0E1C = E0 B8 9C : THAI CHARACTER PHO PHUNG
½ = 275 = 189 = BD = U+0E1D = E0 B8 9D : THAI CHARACTER FO FA
¾ = 276 = 190 = BE = U+0E1E = E0 B8 9E : THAI CHARACTER PHO PHAN
¿ = 277 = 191 = BF = U+0E1F = E0 B8 9F : THAI CHARACTER FO FAN
À = 300 = 192 = C0 = U+0E20 = E0 B8 A0 : THAI CHARACTER PHO SAMPHAO
Á = 301 = 193 = C1 = U+0E21 = E0 B8 A1 : THAI CHARACTER MO MA
 = 302 = 194 = C2 = U+0E22 = E0 B8 A2 : THAI CHARACTER YO YAK
à = 303 = 195 = C3 = U+0E23 = E0 B8 A3 : THAI CHARACTER RO RUA
Ä = 304 = 196 = C4 = U+0E24 = E0 B8 A4 : THAI CHARACTER RU
Å = 305 = 197 = C5 = U+0E25 = E0 B8 A5 : THAI CHARACTER LO LING
Æ = 306 = 198 = C6 = U+0E26 = E0 B8 A6 : THAI CHARACTER LU
Ç = 307 = 199 = C7 = U+0E27 = E0 B8 A7 : THAI CHARACTER WO WAEN
È = 310 = 200 = C8 = U+0E28 = E0 B8 A8 : THAI CHARACTER SO SALA
É = 311 = 201 = C9 = U+0E29 = E0 B8 A9 : THAI CHARACTER SO RUSI
Ê = 312 = 202 = CA = U+0E2A = E0 B8 AA : THAI CHARACTER SO SUA
Ë = 313 = 203 = CB = U+0E2B = E0 B8 AB : THAI CHARACTER HO HIP
Ì = 314 = 204 = CC = U+0E2C = E0 B8 AC : THAI CHARACTER LO CHULA
Í = 315 = 205 = CD = U+0E2D = E0 B8 AD : THAI CHARACTER O ANG
Î = 316 = 206 = CE = U+0E2E = E0 B8 AE : THAI CHARACTER HO NOKHUK
Ï = 317 = 207 = CF = U+0E2F = E0 B8 AF : THAI CHARACTER PAIYANNOI
Ð = 320 = 208 = D0 = U+0E30 = E0 B8 B0 : THAI CHARACTER SARA A
Ñ = 321 = 209 = D1 = U+0E31 = E0 B8 B1 : THAI CHARACTER MAI HAN-AKAT
Ò = 322 = 210 = D2 = U+0E32 = E0 B8 B2 : THAI CHARACTER SARA AA
Ó = 323 = 211 = D3 = U+0E33 = E0 B8 B3 : THAI CHARACTER SARA AM
Ô = 324 = 212 = D4 = U+0E34 = E0 B8 B4 : THAI CHARACTER SARA I
Õ = 325 = 213 = D5 = U+0E35 = E0 B8 B5 : THAI CHARACTER SARA II
Ö = 326 = 214 = D6 = U+0E36 = E0 B8 B6 : THAI CHARACTER SARA UE
× = 327 = 215 = D7 = U+0E37 = E0 B8 B7 : THAI CHARACTER SARA UEE
Ø = 330 = 216 = D8 = U+0E38 = E0 B8 B8 : THAI CHARACTER SARA U
Ù = 331 = 217 = D9 = U+0E39 = E0 B8 B9 : THAI CHARACTER SARA UU
Ú = 332 = 218 = DA = U+0E3A = E0 B8 BA : THAI CHARACTER PHINTHU
ß = 337 = 223 = DF = U+0E3F = E0 B8 BF : THAI CURRENCY SYMBOL BAHT
à = 340 = 224 = E0 = U+0E40 = E0 B9 80 : THAI CHARACTER SARA E
á = 341 = 225 = E1 = U+0E41 = E0 B9 81 : THAI CHARACTER SARA AE
â = 342 = 226 = E2 = U+0E42 = E0 B9 82 : THAI CHARACTER SARA O
ã = 343 = 227 = E3 = U+0E43 = E0 B9 83 : THAI CHARACTER SARA AI MAIMUAN
ä = 344 = 228 = E4 = U+0E44 = E0 B9 84 : THAI CHARACTER SARA AI MAIMALAI
å = 345 = 229 = E5 = U+0E45 = E0 B9 85 : THAI CHARACTER LAKKHANGYAO
æ = 346 = 230 = E6 = U+0E46 = E0 B9 86 : THAI CHARACTER MAIYAMOK
ç = 347 = 231 = E7 = U+0E47 = E0 B9 87 : THAI CHARACTER MAITAIKHU
è = 350 = 232 = E8 = U+0E48 = E0 B9 88 : THAI CHARACTER MAI EK
é = 351 = 233 = E9 = U+0E49 = E0 B9 89 : THAI CHARACTER MAI THO
ê = 352 = 234 = EA = U+0E4A = E0 B9 8A : THAI CHARACTER MAI TRI
ë = 353 = 235 = EB = U+0E4B = E0 B9 8B : THAI CHARACTER MAI CHATTAWA
ì = 354 = 236 = EC = U+0E4C = E0 B9 8C : THAI CHARACTER THANTHAKHAT
í = 355 = 237 = ED = U+0E4D = E0 B9 8D : THAI CHARACTER NIKHAHIT
î = 356 = 238 = EE = U+0E4E = E0 B9 8E : THAI CHARACTER YAMAKKAN
ï = 357 = 239 = EF = U+0E4F = E0 B9 8F : THAI CHARACTER FONGMAN
ð = 360 = 240 = F0 = U+0E50 = E0 B9 90 : THAI DIGIT ZERO
ñ = 361 = 241 = F1 = U+0E51 = E0 B9 91 : THAI DIGIT ONE
ò = 362 = 242 = F2 = U+0E52 = E0 B9 92 : THAI DIGIT TWO
ó = 363 = 243 = F3 = U+0E53 = E0 B9 93 : THAI DIGIT THREE
ô = 364 = 244 = F4 = U+0E54 = E0 B9 94 : THAI DIGIT FOUR
õ = 365 = 245 = F5 = U+0E55 = E0 B9 95 : THAI DIGIT FIVE
ö = 366 = 246 = F6 = U+0E56 = E0 B9 96 : THAI DIGIT SIX
÷ = 367 = 247 = F7 = U+0E57 = E0 B9 97 : THAI DIGIT SEVEN
ø = 370 = 248 = F8 = U+0E58 = E0 B9 98 : THAI DIGIT EIGHT
ù = 371 = 249 = F9 = U+0E59 = E0 B9 99 : THAI DIGIT NINE
ú = 372 = 250 = FA = U+0E5A = E0 B9 9A : THAI CHARACTER ANGKHANKHU
û = 373 = 251 = FB = U+0E5B = E0 B9 9B : THAI CHARACTER KHOMUT
--
Greetings
Pete
"A designer knows he has arrived at perfection not when there is no
longer anything to add, but when there is no longer anything to take
away."
-- Antoine de Saint-Exupéry
_______________________________________________
emacs-pretest-bug mailing list
[email protected]
http://lists.gnu.org/mailman/listinfo/emacs-pretest-bug