Hi,

It seems the sv.wl file is broken, because many of the word have missing
spaces. The errors reported are many words which lacks spaces between them.
There are a lot of smaller chains of words which also lacks spaces. What can
have gone wrong?

Best regards,

/Kenneth

--------------------------------------------------------
Kenneth Johansson       Technical Documentation Manager
Sectra Imtec AB
Teknikringen 20         E-mail: ke-joh AT sectra DOT se
SE-583 30 Link?ping     Phone:  +46 13 23 52 00
SWEDEN                  Web:    http://www.sectra.se 

> -----Original Message-----
> From: Hussein Shafie [mailto:hussein at pixware.fr] 
> Sent: den 16 april 2004 13:17
> To: Kenneth Johansson
> Cc: xmleditor-support at xmlmind.com
> Subject: Re: [XXE] Error when building dictionary for XXE
> 
> * "word rejected" error messages are due to the lack of .hints file.
> 
> * Exception "java.lang.ArrayIndexOutOfBoundsException: 100" 
> is clearly a bug. It is triggerred by words which are more 
> than 100 characters long. 
> We'll fix this bug in next release of XMLmind Spell-Checker.
> 
> However, I easily managed to generate a Swedish dictionary 
> using the word list you sent me. Here's what I did:
> 
> [1] I downloaded the Dictionary Builder from 
> http://www.xmlmind.com/spellchecker/dictbuilder.shtml
> 
> [2] I copied your word list sv.wl in dictkits/sv_src/, after 
> renaming it to sv.wl.ORI
> 
> [3] I ran the attached Tcl script:
> 
> ---
> #!/bin/sh
> #
> # Next line is both a Tcl comment and a shell command \ exec 
> tclsh "$0" "$@"
> 
> set wl [open sv.wl.ORI r]
> set words [read $wl]
> close $wl
> 
> foreach word $words {
>      if {[string length $word] >= 100} {
>       puts stderr $word
>      } else {
>       puts $word
>      }
> }
> ---
> 
> ---
> $ prune.tcl > sv.wl
> ---
> 
> The script rejected the following few words (what do they mean? :-):
> 
> ---
> d?ngtd?rd?rand?ravd?rbortad?refterd?remelland?remotd?restd?rf?
> rd?rhemmad?rh?nd?rid?riblandd?rifr?nd?rigenomd?rind?rinned?rj?
> mted?rmedd?rnered?rn?std?romd?romkringd?rovand?rp?d?rst?desd?r
> tilld?runderd?rupped?rupp?d?rurd?rutavd?ruted?rutf?rd?rutid?ru
> tifr?nd?rutinnand?rut?verd?rvidd?rvidlagd?r?verd?r?td?st
> femtedelarsfemtifemtiofemtioenfemtioettfemtiofemfemtiofyrafemt
> iondefemtioniofemtiosexfemtiosjufemtiotal
> fyrsiffrigtfyrtifyrtiofyrtioenfyrtioettfyrtiofemfyrtiofyrafyrt
> iondefyrtioniofyrtiosexfyrtiosjufyrtiotal
> nitratetnittinittionittioennittioettnittiofemnittiofyranittion
> denittionionittiosexnittiosjunittiotal
> sexigtsexrumssextisextiosextioensextioettsextiofaltsextiofemse
> xtiofyrasextiondesextioniosextiosexsextiosjusextiotal
> sjunketsjunkersjunkitsjurumssjuttisjuttiosjuttioensjuttioettsj
> uttiofemsjuttiofyrasjuttiondesjuttioniosjuttiosexsjuttiosjusjuttiotal
> sjuttiotalistisktsjuttiotresjuttiotv?sjuttio?rssjuttio?ttasjut
> tonsjuttondesjuttonhundrasjuttonhundratal
> tresiffrigttrettitrettiotrettioentrettioetttrettiofalttrettiof
> emtrettiofyratrettiondetrettioniotrettiosextrettiosjutrettiotal
> ---
> 
> [3] In dictkits/sv_src/ which already contains proper 
> sv.hints and sv.freq, I ran "make" which reported:
> 
> ---
> $ make
> ../../builder -cs ISO8859_1 -hints sv.hints -freq sv.freq   sv.wl -o 
> ../sv/base.cdi
> word rejected: mobiliseringensMo?ambiquemod file sv.wl read
> 375409 nodes
> cd ..; jar cf sv.dar sv
> ---
> 
> [4] I got sv.dar which I copied to ~/.xxe2/spell/.
> 
> (from XXE V2.5p3, spell-checker dictionaries, that is .dar 
> files, are searched, not only in XXE_install_dir/spell/, but also in
> XXE_user_preferences_dir/spell/.)
> 
> [5] I was able to spell-check some swedish with XXE.
> 
> 
> 
> 
> Kenneth Johansson wrote:
> > 
> > I've built an ISpell dictionary to build an XXE dictionary. 
> When I try 
> > to use the builder to build the output .cdi file it fails. 
> What am I 
> > doing wrong. The .wl file looks fine to me.
> > 
> > ke-joh:31> builder.bat -cs ISO8859_2 svenska.wl -o sv.cdi .
> > Lots of errors like below.
> > .
> > word rejected: fels?gningar
> > word rejected: fels?gnings
> > word rejected: fels?gningarnas
> > word rejected: fels?gningens
> > word rejected: fels?gningarsfeltolka
> > word rejected: femfaldigtfemhundrafemh?rning word rejected: 
> > femh?rningarna word rejected: femh?rningen word rejected: 
> femh?rningar 
> > word rejected: femh?rnings word rejected: femh?rningarnas word 
> > rejected: femh?rningens word rejected: femh?rningarsfeminin
> > java.lang.ArrayIndexOutOfBoundsException: 100
> >         at 
> azcheck.engine.CompiledTLexBuilder.addWord(Unknown Source)
> >         at 
> azcheck.engine.CompiledTLexBuilder.addWord(Unknown Source)
> >         at 
> azcheck.engine.CompiledTLexBuilder.processTextFile(Unknown
> > Source)
> >         at azcheck.engine.CompiledTLexBuilder.main(Unknown Source)
> > ** java.lang.ArrayIndexOutOfBoundsException: 100
> > 
> 


Reply via email to