Hi, It seems the sv.wl file is broken, because many of the word have missing spaces. The errors reported are many words which lacks spaces between them. There are a lot of smaller chains of words which also lacks spaces. What can have gone wrong?
Best regards, /Kenneth -------------------------------------------------------- Kenneth Johansson Technical Documentation Manager Sectra Imtec AB Teknikringen 20 E-mail: ke-joh AT sectra DOT se SE-583 30 Link?ping Phone: +46 13 23 52 00 SWEDEN Web: http://www.sectra.se > -----Original Message----- > From: Hussein Shafie [mailto:hussein at pixware.fr] > Sent: den 16 april 2004 13:17 > To: Kenneth Johansson > Cc: xmleditor-support at xmlmind.com > Subject: Re: [XXE] Error when building dictionary for XXE > > * "word rejected" error messages are due to the lack of .hints file. > > * Exception "java.lang.ArrayIndexOutOfBoundsException: 100" > is clearly a bug. It is triggerred by words which are more > than 100 characters long. > We'll fix this bug in next release of XMLmind Spell-Checker. > > However, I easily managed to generate a Swedish dictionary > using the word list you sent me. Here's what I did: > > [1] I downloaded the Dictionary Builder from > http://www.xmlmind.com/spellchecker/dictbuilder.shtml > > [2] I copied your word list sv.wl in dictkits/sv_src/, after > renaming it to sv.wl.ORI > > [3] I ran the attached Tcl script: > > --- > #!/bin/sh > # > # Next line is both a Tcl comment and a shell command \ exec > tclsh "$0" "$@" > > set wl [open sv.wl.ORI r] > set words [read $wl] > close $wl > > foreach word $words { > if {[string length $word] >= 100} { > puts stderr $word > } else { > puts $word > } > } > --- > > --- > $ prune.tcl > sv.wl > --- > > The script rejected the following few words (what do they mean? :-): > > --- > d?ngtd?rd?rand?ravd?rbortad?refterd?remelland?remotd?restd?rf? > rd?rhemmad?rh?nd?rid?riblandd?rifr?nd?rigenomd?rind?rinned?rj? > mted?rmedd?rnered?rn?std?romd?romkringd?rovand?rp?d?rst?desd?r > tilld?runderd?rupped?rupp?d?rurd?rutavd?ruted?rutf?rd?rutid?ru > tifr?nd?rutinnand?rut?verd?rvidd?rvidlagd?r?verd?r?td?st > femtedelarsfemtifemtiofemtioenfemtioettfemtiofemfemtiofyrafemt > iondefemtioniofemtiosexfemtiosjufemtiotal > fyrsiffrigtfyrtifyrtiofyrtioenfyrtioettfyrtiofemfyrtiofyrafyrt > iondefyrtioniofyrtiosexfyrtiosjufyrtiotal > nitratetnittinittionittioennittioettnittiofemnittiofyranittion > denittionionittiosexnittiosjunittiotal > sexigtsexrumssextisextiosextioensextioettsextiofaltsextiofemse > xtiofyrasextiondesextioniosextiosexsextiosjusextiotal > sjunketsjunkersjunkitsjurumssjuttisjuttiosjuttioensjuttioettsj > uttiofemsjuttiofyrasjuttiondesjuttioniosjuttiosexsjuttiosjusjuttiotal > sjuttiotalistisktsjuttiotresjuttiotv?sjuttio?rssjuttio?ttasjut > tonsjuttondesjuttonhundrasjuttonhundratal > tresiffrigttrettitrettiotrettioentrettioetttrettiofalttrettiof > emtrettiofyratrettiondetrettioniotrettiosextrettiosjutrettiotal > --- > > [3] In dictkits/sv_src/ which already contains proper > sv.hints and sv.freq, I ran "make" which reported: > > --- > $ make > ../../builder -cs ISO8859_1 -hints sv.hints -freq sv.freq sv.wl -o > ../sv/base.cdi > word rejected: mobiliseringensMo?ambiquemod file sv.wl read > 375409 nodes > cd ..; jar cf sv.dar sv > --- > > [4] I got sv.dar which I copied to ~/.xxe2/spell/. > > (from XXE V2.5p3, spell-checker dictionaries, that is .dar > files, are searched, not only in XXE_install_dir/spell/, but also in > XXE_user_preferences_dir/spell/.) > > [5] I was able to spell-check some swedish with XXE. > > > > > Kenneth Johansson wrote: > > > > I've built an ISpell dictionary to build an XXE dictionary. > When I try > > to use the builder to build the output .cdi file it fails. > What am I > > doing wrong. The .wl file looks fine to me. > > > > ke-joh:31> builder.bat -cs ISO8859_2 svenska.wl -o sv.cdi . > > Lots of errors like below. > > . > > word rejected: fels?gningar > > word rejected: fels?gnings > > word rejected: fels?gningarnas > > word rejected: fels?gningens > > word rejected: fels?gningarsfeltolka > > word rejected: femfaldigtfemhundrafemh?rning word rejected: > > femh?rningarna word rejected: femh?rningen word rejected: > femh?rningar > > word rejected: femh?rnings word rejected: femh?rningarnas word > > rejected: femh?rningens word rejected: femh?rningarsfeminin > > java.lang.ArrayIndexOutOfBoundsException: 100 > > at > azcheck.engine.CompiledTLexBuilder.addWord(Unknown Source) > > at > azcheck.engine.CompiledTLexBuilder.addWord(Unknown Source) > > at > azcheck.engine.CompiledTLexBuilder.processTextFile(Unknown > > Source) > > at azcheck.engine.CompiledTLexBuilder.main(Unknown Source) > > ** java.lang.ArrayIndexOutOfBoundsException: 100 > > >

