:
Hi Russell,
Thanks for the answer. Now, what you mean by 'munched'?
  1.. If you mean format of the .dic and .aff files (there are ASCII files):

Format is documented and rather easy to understand. Each basic form is in
its own line, along with several flags. So it is very easy to remove word
from .dic file. But I cannot see result of this.

2.  If you mean translating ASCII file to some internal format:

Yes, I would like to know when it is done. If after each modification I have
to reinstall dictionary again. Is it straightforward process or I need to
know something.




Regards

Kris

I am not sure of the details of the process, I have a list I am about to deal with, but there is lots of information here:

http://lingucomponent.openoffice.org/

IIUC "munching" uses the word list and the affix list to make a usable dictionary under hunspell. I have done it personally only once, but the list runs happily (it is a medical word list). I'll have to go back to the hunspell documentation again.

It isn't trivial, but not a very complex task, either. It helps if you have access to a linux machine, though you could probably use it under cygwin.

I have just been having a look and there are hunspell-unmunch and hunspell-munch functions. I haven't used the former at all.

Searching the lingucomponent mail list archives will probably be helpful to you..

Russell


--
You have posted to an OpenOffice.org mailing list. See:
http://www.openoffice.org/mail_list.html for details on how to subscribe
so that you can see responses provided by other users.
Please reply *only* to [email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to