On 2/14/07, Karsten Düsterloh <[EMAIL PROTECTED]> wrote:
> Well, how should this work?
> If all characters are below 0x80, it's most probably(!) ASCII, and else?
Can Javascript do this type of comparison? I'd assume you'd have to
compare each char to do so... but still this wouldn't be reliable.

> What makes 0xA4 be a euro sign instead of mere currency symbol?

I'm not sure if these are rhetorical questions or not. :(

I have found some interesting things though:
Universal Encoding Detector written in Python
http://chardet.feedparser.org/docs/how-it-works.html

which linked to this article,
http://www.mozilla.org/projects/intl/UniversalCharsetDetection.html

Haven't read them yet, but seems promising. I know Mozilla has some
detection built in.. but no scriptable interfaces, it would seem...
_______________________________________________
Project_owners mailing list
[email protected]
http://mozdev.org/mailman/listinfo/project_owners

Reply via email to