eryk sun <eryk...@gmail.com>: > Windows NTFS doesn't normalize names to a canonical form. It also > allows lone surrogate codes, which is invalid UTF-16.
Somewhat related, surrogate codes are invalid Unicode and shouldn't be allowed in Unicode strings. However, Python does allow them. Marko -- https://mail.python.org/mailman/listinfo/python-list