Status: Untriaged Owner: ---- CC: [email protected] Labels: Type-Bug Pri-3 OS-All Area-WebKit Size-Medium I18N
New issue 15898 by [email protected]: Improve encoding detection for a very short text which may be useful for query-display http://code.google.com/p/chromium/issues/detail?id=15898 This is b/1082767. ICU's autoencoding detector uses byte tri-grams with binary values (good or bad) and an encoding with the most 'good' votes wins. This works reasonably well for chunks of data reasonably long (say, 200 bytes), but it works very poorly for a short chunk. Firefox uses byte unigrams and bigrams with weights. -- You received this message because you are listed in the owner or CC fields of this issue, or because you starred this issue. You may adjust your issue notification preferences at: http://code.google.com/hosting/settings --~--~---------~--~----~------------~-------~--~----~ Automated mail from issue updates at http://crbug.com/ Subscription options: http://groups.google.com/group/chromium-bugs -~----------~----~----~----~------~----~------~--~---
