On Fri, 9 Jun 2000, Alexey Rodriguez wrote:

>       Good morning everyone, i finally managed to get some time to look
> at my problem. I discovered that htfuzzy has a small bug while parsing
> *.aff files. If you have the following rule:
> 
>       Z               >        -Z, CES          # audaz audaces
>                                     ^
>                                     .
>                                     .
>                                     |
>                                     htfuzzy will stop parsing the line
> after this space, therefore it will cut the word ending but it won't add
> the later part. It caused a lot of repetitions for generated words.
>       I fixed the problem with a lazy script that removed spaces after
> the comma. Even the "DB2 problem..." messages stopped appearing.
>       Maybe this is an issue that has been already addressed. IMHO it
> not a good idea to (only) strip the spaces off the aff file, it would be
> better to fix the parsing code in EndingsDB.cc so that people with similar

hehe the file wich needs some fixing is SuffixEntry.cc and not EndingsDB.cc

> aff files won't have that problem. I can make the patch if you consider
> thatnecessary (Gilles? Geoff?).
>       Another issue that i encountered is that mungeWord doesn't handle
> accented words ('abaco -> �baco). Is this normal or must i fix the aff
> file (or source for instance) ?
>       Thanks for reading.
>                                               Alexey
> 
> 
> 
> ------------------------------------
> To unsubscribe from the htdig mailing list, send a message to
> [EMAIL PROTECTED]
> You will receive a message to confirm this.
> 


------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.

Reply via email to