Hi Clytie, I would like to talk with your colleague to see if we can
join our efforts to have a better dictionary. Ask him to drop me an
email with his progress and IM contact.
1. Currently my Vietnamese Ooo dictionary it's the normal Free
Vietnamese Dictionary Project but reformated for Ooo. I couldn't make a
more complex dictionary because of my lack of knowledge about the
language, but I think Duy's idea about recognition of compound words is
great, but difficult, he may need help from me in technical aspects.
2. I remember having forward to the Dicooo team the Dicooo translation
you sent to me but I never got answer thought.
3. For the Ooo 3.x dictionary packaging that would be very easy, because
it's a issue of packaging as an extension, the dictionary file itself
won't change, for packaging, I will just take the packaging script from
any other OOo 3.x dictionary and use it for the vietnamese.
4. My vietnamese wife and me are doing good in Singapore, however, we
feel observed and controlled by the Big Brother :-) , this country still
lacks of Freedom of Speech. We are returning to Spain at the end of the
year :-) after spending 2 month in Vietnam from August.
Best Regards.
Ivan.
Clytie Siddall wrote:
Hi again :)
I've chased up a new Vietnamese dictionary being developed by Nguyễn
Thái Ngọc Duy, a colleague of mine. He has been working on improving
recognition of compound words. Where English adds syllables to create
more complex words, Vietnamese adds single-syllable words. Even our
simpler 'words' are usually more than word one long, e.g.
screen
màn hình
Spellchecking is difficult in our language anyway, since there are so
many viable single-syllable words. It becomes even more complex when a
single "word" is expressed by more than one single-syllable word. So
if I type
màu hình
for "screen", typing a "u" instead of the final "n" in the first word,
a spellchecker will not flag that as an error, because "màu" on its
own is a valid word in Vietnamese (it means "colour").
So Duy has been working on better recognition of compound words in our
language. Unfortunately, he has run into some nasty technical
difficulties, and has not had time to solve them. He has pointed me to
his word list [1], which may be more current than the standard
Hunspell or Aspell Vietnamese wordlist, and has invited us to use it
in any way that would benefit OpenOffice.org.
This word list is based on the 2007 list from the Free Vietnamese
Dictionary Project [2]. Wiktionary is also based on this project: it
is our best dictionary resource.
Ivan, would this newer word list be useful in your dictionary? Could
you use it to update the current word list?
I'm keen to help with making sure we have a viable spellchecker in
OpenOffice.org 3.0 Vietnamese, but I'm not sure where to start. :S
from Clytie
Vietnamese Free Software Translation Team
http://vnoss.net/dokuwiki/doku.php?id=projects:l10n
[1] http://dev.gentoo.org/~pclouds/files/words-vi-0.1.tar.gz
[2] http://www.informatik.uni-leipzig.de/~duc/software/misc/vn_words.zip
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]