Hi Stephan,

> I would expect the positions of 'b' being 2 and 3, resp., as that are the 
> lengths of the strings as returned by unicode.utf8.len.  However, 
> unicode.utf8.find seems to have another notion of the length of a string.

It is documented: (Well, sort of, you need to downlaod the slunicode library 
and look into 'unittest'.)

--      NOTE: find positions are in bytes for all ctypes!
--      use ascii.sub to cut found ranges!
--      this is a) faster b) more reliable

Patrick


Reply via email to