On Fri, Jun 25, 2010 at 7:33 PM, Kenneth Gonsalves wrote:
> what is so big about tamil mails? it is just unicode which is not much
> bigger
> than ascii
>
I did not mean Unicode took up space. In fact i assumed English mails were
also encoded in utf-8 only.
It is bigger because:
* Tamil sentences will be longer (have more words) than their English
equivalents.
Translate any English passage, you can understand what I mean.
* Words expressed in Tamil will take up more character space than English.
(or equal. Its rare other way)
Examples:
GNU - குனு [3 in Eng - 4 in Tamil]
Linux - லினக்ஸ் [5-7]
Python - பைத்தான் [6 - 8]
*All acronyms*
And if, as you say, english mails are in ascii and only other language mails
get utf-8 encoding
in mailman, then there is a three way increase in size, isn't it?
PS: I'm not arguing that size limit should be increased, or that Tamil mails
always get missing.
Rather, my concern is that legitimate Tamil mails can end up blocked.
--
அகிலன்(Akilan R)
(http://www.coding-aviator.blogspot.com)
"I should have no use for a paradise in which I should be deprived of the
right to prefer hell."
--Jean Rostand
_______________________________________________
ILUGC Mailing List:
http://www.ae.iitm.ac.in/mailman/listinfo/ilugc