I need some "guidance" with the extent of my dictionary files for
LibreOffice and OOo.
My largest dictionaries are about 638,000 words in the spelling word
.dic file. I need to know how large it too large.
I found out this morning that if I compare that word list with a
combined list for chemical and medical words, over 98,000 words from
that combined list is not in the current .dic word list[s].
Now here it the issue, how far should I take this project?
I am going to add all the "missing" words that are part of the
open-source community's lexicon that are not in the current lists, but
where do I stop, and how should I format the "finalized" files?
Should there be one super large list, or should I break it up into
sub-lists? Should the "standard" words go into one .dic file, while
medical, chemistry, and computer/tech words each have their own .dic
file within the .oxt file?
Right now, there is an English dictionary [default one?] that includes
US, British, Canadian, and some other versions of English put together
as one .oxt file, but separate .dic files. I was wondering if that
would be the route I should go with my super-size dictionaries.
To be honest, 20 years ago the spelling dictionary project I was working
on has about 177,000 words and I was told that the English language was
about 250,000 words. Now I have looked at a combined word list and it
has about 737K words in it and there are more words/terms still needing
to be checked. The largest book style dictionary now has 25+ volumes to
it when it was only 15 about 15-20 years ago. So I really think the
final super-sized dictionary word list could one day go over one million
in the next year or two. I just have to figure out if it is worth
building a list for LO to that size.
Your input would help me make the best US, British, and Canadian English
dictionaries out there for LibreOffice. This is for our users to use,
so it would be nice for users to let me know what they think.
--
For unsubscribe instructions e-mail to: [email protected]
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted