---- On Tue, 23 Nov 2010 19:29:32 -0800 Kenneth Gonsalves  wrote ---- 

>On Wed, 2010-11-24 at 08:50 +0530, Kenneth Gonsalves wrote: 
>> in context of the discussion on counting tamil characters, here is one 
>> solution: 
> 
>if this is not readable in your mail client, the code is here: 
> 
>http://bitbucket.org/lawgon/tamtrans/src/21197e0f1388/syllcount.py 

Number of syllables is is not the number of characters excluding vowels.
For example மக்கள் has only 3 syllables not 4.
ம, க்க , ள்

ஆண்டவரின் has ஆ,ண்ட,வ,ரி,ன் 
and 

Syllable is a pronunciation unit. http://en.wikipedia.org/wiki/Syllable

see syllabalize_ta method in 
https://github.com/santhoshtr/silpa/blob/master/src/silpa/modules/syllabalizer/syllabalizer.py
Or see online tool for this at http://silpa.smc.org.in/Syllabalize

Thanks
Santhosh
_______________________________________________
ILUGC Mailing List:
http://www.ae.iitm.ac.in/mailman/listinfo/ilugc

Reply via email to