Re: Fontsets and ispell

Kenichi Handa Sun, 20 Feb 2005 18:37:57 -0800

In article <[EMAIL PROTECTED]>, Peter Dyballa <[EMAIL PROTECTED]> writes:


> Hello!
> I tried to spell-check a LaTeX file in an AUCTeX (version: 11.55) 
> buffer. When ispell hit the word s�rifs (of sans-s�rifs) this error was 
> reported:

>       Debugger entered--Lisp error: (error "Ispell misalignment: word 
> `s\x8e9rifs' point 1574; probably incompatible versions")
[...]
> So obviously the � was passed as something else but itself. C-u C-x = 
> reports this:

>         character: � (07551, 3945, 0xf69, U+00E9)
>           charset: latin-iso8859-15 (Right-Hand Part of Latin Alphabet 9 
> (ISO/IEC 8859-15): ISO-IR-203.)
[...]
> Although it's stated 'buffer code: 0x8E 0xE9' the buffer's modeline 
> starts with "-0:--", i.e. ISO Latin-15. ispell is set up to work like 
> this customization:

>   '(ispell-local-dictionary-alist (quote (("english" "[a-zA-Z]" 
> "[^a-zA-Z]" "[']" t ("-C") "~tex" iso-8859-1) (nil 
> "[a-zA-Z�������������������������������������������]" 
> "[^a-zA-Z�������������������������������������������]" "" t ("-d" 
> "german") "~latin1" iso-8859-1))))

Emacs distinguishes latin-1 and latin-15 characters.

Please specify CASECHARS and NOT-CASECHARS in unibyte
(i.e. using octal form of byte sequence for iso-8859-1)
without using '-' between eight-bit chars (see
lisp/textmodes/ispell.el.  Then the latest Emacs generates a
proper multibyte string that covers all equivalent
characters in CASECHARS and NOT-CASECHARS.

I've just installed this change in the docstring of
ispell-dictionary-alist, but it seems the English is not
good.   Could someone please improve it?

*** ispell.el   04 Feb 2005 09:28:49 +0900      1.156
--- ispell.el   21 Feb 2005 11:13:16 +0900      
***************
*** 671,676 ****
--- 671,682 ----
  If you want OTHERCHARS to be empty, use the empty string.
  Hint: regexp syntax requires the hyphen to be declared first here.
  
+ CASECHAS, NOT-CASECHARS, and OTHERCHARS must be a unibyte string
+ containing bytes of CHARACTER-SET.  In addition, if they contain
+ a non-ASCII byte, the regular expression must be a single
+ `character set' construct that doesn't specify a character range
+ for non-ASCII bytes.
+ 
  MANY-OTHERCHARS-P is non-nil when multiple OTHERCHARS are allowed in a word.
  Otherwise only a single OTHERCHARS character is allowed to be part of any
  single word.

---
Ken'ichi HANDA
[EMAIL PROTECTED]


_______________________________________________
Emacs-pretest-bug mailing list
[email protected]
http://lists.gnu.org/mailman/listinfo/emacs-pretest-bug

Re: Fontsets and ispell

Reply via email to