RE: [native-lang] Status update season!

2006-12-20 Thread Meelad Zakaria
On Wed, 2006-12-20 at 10:11 +0500, Murod Latifov wrote:
 Hello Milad,

Regards :)
 
 Persian and Tajik have many similarities e.g. grammar, except alphabet.
 Persian uses Arabic letters Tajik uses Cyrillic. I divided words into
 groups by mean of type of pref. suff. they accept (say noun and inside
 noun special sub rules for particular type of nouns). I made some
 progress with Tajik grammar, it works fine, we need to add dictionary
 entries. I can read and write Tajik in Persian. I also created small OOo
 writer script that converts Tajik words into Persian, which means
 letters are converted to Arabic and word can be read by Persian speaker.

As you know in Persian there are more than one letter associated to some
sounds than one, for example the s sound can be produced by 3 different
letters, and with replacing these letters, sometimes different words
with different meanings are produced. So the converter you wrote, unless
you use a database with one to one relation between Cyrillic and Persian
words, the result may be readable for Persian readers but are not
essentially true.

 This is too simple conversion with no proper logic of typing words in
 either language. I think our collaboration will give more results, than
 doing it separately. 
 Also we lack presence of computer terminology, it would be great if we
 could have some sort of discussions in this regards.

There are some efforts in this regards by Academy of Persian Language
and other sources, and there are some books published in this respect,
though none of them are considered final yet.

  You can call me on
 Scype [EMAIL PROTECTED] and we speak Persian.
 
 I wrote to you once before but there was no reply. 
 
I'm really sorry about that, I must have messed something up. I'm
looking forward to that...

Luck,
-mee

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: [native-lang] Status update season!

2006-12-20 Thread Meelad Zakaria
On Wed, 2006-12-20 at 01:48 +0100, Lars Aronsson wrote:

 OpenOffice.org currently uses Hunspell.  It's dictionary format is 
 compatible with the Myspell used earlier, and very similar to the 
 older Aspell and Ispell.  There is a fa-demo to download from 
 http://wiki.services.openoffice.org/wiki/Dictionary
 that contains 332554 words in Persian.  According to the included 
 README file, this is the same dictionary as is listed on 
 www.aspell.net for GNU Aspell 0.60, maintained by Babak Mahmoudi 
 and Edwin Hakopian.

That dictionary is pretty outdated, and in various cases, contains
several wrong spellings, and even wrong words.

  The included fa.aff file is empty.  Perhaps 
 affixes are not applicable to Persian?  I have no idea.

Affixes do apply in Persian, but some combinations though are correct by
grammatical logic, are considered wrong because they have never been
used in Persian texts. So auto-generated lists don't work and we should
use a flat wordlist with all non-eligible words excluded.
 
 For most languages, there is only one dictionary, maintained by a 
 single person or a small team, who produces packages adopted for 
 each spell checking software.
 
In Persian, there has been different approaches to in writing and
form of some words has changed quite differently in recent 10 years,
and people out there mostly use their own taste on choosing the 
written form. There is The Academy of Persian Language which is 
considered the governing body of such problem, and it's been working on 
generating a unified wordlist for acceptable words, and there are some 
new laws to oblige official documents to use this version, so the thing 
to do is to use this list for the spellchecker. My dear friend 
Roozbeh Pournader is working on a spellchecker based on Academy list 
and checked manually for aspell, and I hope to use that for oo as well.

Thanks,
-mee

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]