Guido van Rossum writes: > To the contrary, an encoding-guessing module is often needed, and > guessing can be done with a pretty high success rate. Other Unicode > libraries (e.g. ICU) contain guessing modules. I suppose the API could > return two values: the guessed encoding and a confidence indicator. > Note that the locale settings might figure in the guess.
Not locale settings, but user configuration. A Bayesian detector (CodeBayes? hi, Skip!) might be a good way to go for servers, while a simple language preference might really up the probability for user agents. _______________________________________________ Python-Dev mailing list Python-Dev@python.org http://mail.python.org/mailman/listinfo/python-dev Unsubscribe: http://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com