All, I ran our three encoding detectors on the text/html/xml files in our regression corpus. Results are here: http://162.242.228.174/encoding_detection/
I haven't had a chance to do any analysis. Let me know if you find anything
of interest.
Best,
Tim
