>1. a) would it make sense to split dictionary by functionality (for >example, base, computer terms, human names etc) ?
In certain cases yes. >b) what are the benefits and drawbacks of such an approach ? Benefit: Avoid seldom used special words in general dictionary, that could cause overseeing real errors. Drawbacks: If you leave out a necessary ones, good words will be marked as bad. >c) if a dictionary is to be split this way, does hyphenator component >also have to be split accordingly ? Do not know. >2. a) what are the chances of adding a dictionary to official oo.org >distribution if it is completely developed, tested and a company has >undertaken it's supporting for several years ? Good chances. It has to be copied to the official sources. >b) does it matter in such a case what language the dictionary is >(population size etc) ? No. >c) what steps are required, how long does such a process usually take >and how closely must developers of such a dictionary work - and with >whom - from oo.org community ? Check the new dictionary very throughoutly, especially the affix file. Copy the dictionary to a well known address. I think, max 1 week. They must contact the dictionary maintainer, R. Holt. >of course, required process regarding native-lang projects should be >complied with, but i'm sure there's more to it :) >3. at the page http://lingucomponent.openoffice.org/, there is text : >"MySpell is used to support spell checking in OpenOffice.org 1.x. It is >planned to replace MySpell with hunspell, which builds on MySpell but >supports Unicode and adds several other useful features." >what is the current status of spellcheck component ? are there still >plans to replace it ? will replacing invalidate existing dictionaries ? Since hunspell is 100% compatible to myspell, it is in fact a superset of it, all old dictionaries remain valid. >4. there must be other things that are important to achieve this goal - >there probably have been cases that we could learn from (both positive >and negative). what are major obstacles and common mistakes ? what >important principles must be considered ? I think, it is a not very difficult to do. Most of the errors are in the affix (*.aff) file, therefore it must be evaluated throughoutly before adding to Oo. The other problems are with unknown character sets, like African ones and the like. Regards: Eleonora --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
