>1. a) would it make sense to split dictionary by functionality (for
>example, base, computer terms, human names etc) ?

In certain cases yes.

>b) what are the benefits and drawbacks of such an approach ?

Benefit: Avoid seldom used special words in general dictionary, 
that could cause overseeing real errors.
Drawbacks: If you leave out a necessary ones, good words 
will be marked as bad.

>c) if a dictionary is to be split this way, does hyphenator component
>also have to be split accordingly ?

 Do not know.

>2. a) what are the chances of adding a dictionary to official oo.org
>distribution if it is completely developed, tested and a company has
>undertaken it's supporting for several years ?

Good chances. It has to be copied to the official sources.

>b) does it matter in such a case what language the dictionary is
>(population size etc) ?

No.

>c) what steps are required, how long does such a process usually take
>and how closely must developers of such a dictionary work - and with
>whom - from oo.org community ?

Check the new dictionary very throughoutly, especially the affix file.
Copy the dictionary to a well known address.
I think, max 1 week.
They must contact the dictionary maintainer, R. Holt.

>of course, required process regarding native-lang projects should be
>complied with, but i'm sure there's more to it :)

>3. at the page http://lingucomponent.openoffice.org/, there is text :

>"MySpell is used to support spell checking in OpenOffice.org 1.x. It is 
>planned to replace MySpell with hunspell, which builds on MySpell but 
>supports Unicode and adds several other useful features."

>what is the current status of spellcheck component ? are there still 
>plans to replace it ? will replacing invalidate existing dictionaries ?

Since hunspell is 100% compatible to myspell, it is in fact a 
superset of it, all old dictionaries remain valid.

>4. there must be other things that are important to achieve this goal -
>there probably have been cases that we could learn from (both positive
>and negative). what are major obstacles and common mistakes ? what
>important principles must be considered ?

I think, it is a not very difficult to do.

Most of the errors are in the affix (*.aff) file, therefore 
it must be evaluated throughoutly before adding to Oo.

The other problems are with unknown character sets, like
African ones and the like.

Regards: Eleonora


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to