> 2) Scan all the strings in the current document for non-latin-1 (e.g. > UTF-8) characters
I must have misunderstood something here. A string of octets may simultaneously be valid Latin-1 text and valid UTF-8 text (for example, 0xde 0xa3 is UTF-8 for Greek capital sigma, U+03A3, but is also Latin-1 for the two-character sequence capital-thorn pound-sign). Or does the "current document" being scanned store text in some way which does not have this ambiguity? /~\ The ASCII der Mouse \ / Ribbon Campaign X Against HTML [EMAIL PROTECTED] / \ Email! 7D C8 61 52 5D E7 2D 39 4E F1 31 3E E8 B3 27 4B _______________________________________________ geda-dev mailing list [email protected] http://www.seul.org/cgi-bin/mailman/listinfo/geda-dev
