:
Hi Russell,
Thanks for the answer. Now, what you mean by 'munched'?
1.. If you mean format of the .dic and .aff files (there are ASCII files):
Format is documented and rather easy to understand. Each basic form is in
its own line, along with several flags. So it is very easy to remove word
from .dic file. But I cannot see result of this.
2. If you mean translating ASCII file to some internal format:
Yes, I would like to know when it is done. If after each modification I have
to reinstall dictionary again. Is it straightforward process or I need to
know something.
Regards
Kris
I am not sure of the details of the process, I have a list I am about to
deal with, but there is lots of information here:
http://lingucomponent.openoffice.org/
IIUC "munching" uses the word list and the affix list to make a usable
dictionary under hunspell. I have done it personally only once, but the
list runs happily (it is a medical word list). I'll have to go back to
the hunspell documentation again.
It isn't trivial, but not a very complex task, either. It helps if you
have access to a linux machine, though you could probably use it under
cygwin.
I have just been having a look and there are hunspell-unmunch and
hunspell-munch functions. I haven't used the former at all.
Searching the lingucomponent mail list archives will probably be helpful
to you..
Russell
--
You have posted to an OpenOffice.org mailing list. See:
http://www.openoffice.org/mail_list.html for details on how to subscribe
so that you can see responses provided by other users.
Please reply *only* to [email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]