Hi Stephan, > I would expect the positions of 'b' being 2 and 3, resp., as that are the > lengths of the strings as returned by unicode.utf8.len. However, > unicode.utf8.find seems to have another notion of the length of a string.
It is documented: (Well, sort of, you need to downlaod the slunicode library and look into 'unittest'.) -- NOTE: find positions are in bytes for all ctypes! -- use ascii.sub to cut found ranges! -- this is a) faster b) more reliable Patrick
