Re: [libreoffice-users] calling all users - need missing terms for spelling dictionaries

2012-06-27 Thread toki
On 06/24/2012 05:01 PM, webmaster-Kracked_P_P wrote:

 BUT, I would like to make sure all of my .oxt dictionaries have the 
 words/terms we use every day in articles and email support for LibreOffice 
 and other open source related items.

If you have the disk capacity, then:
* Download the Wikipedia article database;
* Run a script that writes each word it finds into a file;
* Manually go through the list, to pick up misspellings;
* Merge the correct words list into your existing wordlist;
* Merge the known misspelling list into the autocorrect list;

Two potential issues with this approach:
* Names of individuals, organizations, and things are included;
* Foreign words are included;

Whilst there are ways to eliminate both of those problems, the usual
result, when doing so using scripts, is that legitimate words in the
target language are removed, along with the foreign word, or nouns.  As
one example, the Afrikaans dictionary omitted the word die for several
years, because the script that was used to eliminate non-Afrikaans
words, read that word as the English die.

jonathon

-- 
For unsubscribe instructions e-mail to: users+h...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted



[libreoffice-users] calling all users - need missing terms for spelling dictionaries

2012-06-24 Thread webmaster-Kracked_P_P


I know that there are words and terms that are not included in many 
spelling dictionaries.  I even found a spelling word list that someone 
forgot to add dictionary to.  BUT, I would like to make sure all of my 
.oxt dictionaries have the words/terms we use every day in articles and 
email support for LibreOffice and other open source related items.


I want to make sure these newer terms are included.

So I am asking for people to send to me lists of correctly spelled words 
and terms - Off the List - in a text file with one word per line [if 
there are a lot of them].  Then I will merge the lists and make sure 
they are in the .oxt English dictionaries I will be updating sometime soon.


I could do this alone, but many of LO's users have been working with 
open source [and FOSS] longer than I have and you could/should know some 
words and terms that are missed in other spell checking systems.


I know there are missing words and terms in this field of computing, but 
I would take any word that is missing.  I would love to get a hold of a 
good list of geology and other scientific terms that may be missed.  I 
would love to create a specific dictionary to add on to any others with 
these specific words and terms in the science and technology field.  I 
was working on creating a script that would search documents for words 
that I do not currently have, but you must trust the spelling of the 
documents first.  Any word that is sent to me must be spell correctly to 
be accepted.  Any word/term I do not know will be checked with online 
sources for proper spelling.


So would any user like to help with creating a large list of science and 
technical words/terms that could be used to create a separate tech 
dictionary for LO?  I was working on it last year, but medical issues 
put a hold in that project.  I hope to restart it soon.  I would love to 
have 10 to 20 thousand words/terms in such a dictionary.  These words 
will be generic English terms that are used in English 
documents/emails/publications/etc. from the California Coast to the 
streets of London.  How many of the current products out there have 
FOSS or other term used today and not 10 or more years ago.


So, any takers?
Do you like the idea of a separate tech dictionary [or sets of] for LO?


--
For unsubscribe instructions e-mail to: users+h...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted



Re: [libreoffice-users] calling all users - need missing terms for spelling dictionaries

2012-06-24 Thread Libre User
Here is a link to the Geology dictionary at 
geology.com.  http://geology.com/dictionary/glossary-a.shtml  It has 
1 page per letter of the alphabet and is a true dictionary with 
definition, not just a list of terms.


I hope this helps.

Jerry


At 10:01 AM 6/24/2012, you wrote:

I know that there are words and terms that are not included in many 
spelling dictionaries.  I even found a spelling word list that 
someone forgot to add dictionary to.  BUT, I would like to make 
sure all of my .oxt dictionaries have the words/terms we use every 
day in articles and email support for LibreOffice and other open 
source related items.


I want to make sure these newer terms are included.

So I am asking for people to send to me lists of correctly spelled 
words and terms - Off the List - in a text file with one word per 
line [if there are a lot of them].  Then I will merge the lists and 
make sure they are in the .oxt English dictionaries I will be 
updating sometime soon.


I could do this alone, but many of LO's users have been working with 
open source [and FOSS] longer than I have and you could/should know 
some words and terms that are missed in other spell checking systems.


I know there are missing words and terms in this field of computing, 
but I would take any word that is missing.  I would love to get a 
hold of a good list of geology and other scientific terms that may 
be missed.  I would love to create a specific dictionary to add on 
to any others with these specific words and terms in the science and 
technology field.  I was working on creating a script that would 
search documents for words that I do not currently have, but you 
must trust the spelling of the documents first.  Any word that is 
sent to me must be spell correctly to be accepted.  Any word/term I 
do not know will be checked with online sources for proper spelling.


So would any user like to help with creating a large list of science 
and technical words/terms that could be used to create a separate 
tech dictionary for LO?  I was working on it last year, but 
medical issues put a hold in that project.  I hope to restart it 
soon.  I would love to have 10 to 20 thousand words/terms in such a 
dictionary.  These words will be generic English terms that are 
used in English documents/emails/publications/etc. from the 
California Coast to the streets of London.  How many of the current 
products out there have FOSS or other term used today and not 10 
or more years ago.


So, any takers?
Do you like the idea of a separate tech dictionary [or sets of] for LO?


--
For unsubscribe instructions e-mail to: users+h...@global.libreoffice.org
Problems? 
http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/

Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted


--
For unsubscribe instructions e-mail to: users+h...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted



Re: [libreoffice-users] calling all users - need missing terms for spelling dictionaries

2012-06-24 Thread webmaster-Kracked_P_P


What I was hoping to do would be create one for LO, like you can find 
for medical and chemical terms.


I know that there was at least one user who asked if we know of where 
he/she could find one for she/her field of science.


There are a lot of fields of education that have their own terms that 
people in the educational field - teacher and student - could use spell 
checking dictionaries for the terms used in their fiend of study or 
teaching.  I sure could have used one when I had to type up my notes for 
my classes in college.  Back then MSO did not have a lot of the terms in 
its spelling system.



On 06/24/2012 06:18 PM, Libre User wrote:
Here is a link to the Geology dictionary at geology.com.  
http://geology.com/dictionary/glossary-a.shtml  It has 1 page per 
letter of the alphabet and is a true dictionary with definition, not 
just a list of terms.


I hope this helps.

Jerry


At 10:01 AM 6/24/2012, you wrote:

I know that there are words and terms that are not included in many 
spelling dictionaries.  I even found a spelling word list that 
someone forgot to add dictionary to. BUT, I would like to make sure 
all of my .oxt dictionaries have the words/terms we use every day in 
articles and email support for LibreOffice and other open source 
related items.


I want to make sure these newer terms are included.

So I am asking for people to send to me lists of correctly spelled 
words and terms - Off the List - in a text file with one word per 
line [if there are a lot of them].  Then I will merge the lists and 
make sure they are in the .oxt English dictionaries I will be 
updating sometime soon.


I could do this alone, but many of LO's users have been working with 
open source [and FOSS] longer than I have and you could/should know 
some words and terms that are missed in other spell checking systems.


I know there are missing words and terms in this field of computing, 
but I would take any word that is missing.  I would love to get a 
hold of a good list of geology and other scientific terms that may be 
missed.  I would love to create a specific dictionary to add on to 
any others with these specific words and terms in the science and 
technology field.  I was working on creating a script that would 
search documents for words that I do not currently have, but you must 
trust the spelling of the documents first.  Any word that is sent to 
me must be spell correctly to be accepted.  Any word/term I do not 
know will be checked with online sources for proper spelling.


So would any user like to help with creating a large list of science 
and technical words/terms that could be used to create a separate 
tech dictionary for LO?  I was working on it last year, but medical 
issues put a hold in that project.  I hope to restart it soon.  I 
would love to have 10 to 20 thousand words/terms in such a 
dictionary.  These words will be generic English terms that are 
used in English documents/emails/publications/etc. from the 
California Coast to the streets of London.  How many of the current 
products out there have FOSS or other term used today and not 10 or 
more years ago.


So, any takers?
Do you like the idea of a separate tech dictionary [or sets of] for 
LO?



--
For unsubscribe instructions e-mail to: 
users+h...@global.libreoffice.org
Problems? 
http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/

Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot 
be deleted






--
For unsubscribe instructions e-mail to: users+h...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted