Ray Wurlod wrote:
Yes, and yet no.

With NLS enabled, internal storage uses an idiosyncratic variant of UTF8 (sometimes called 
UV-UTF8) in which the mark characters Char(248) through Char(255) are always preserved as 
single-byte characters, so that the usual occupants of these Unicode code points are re-mapped 
to the "user defined" area.

I did once publish (on this forum) the rules for UV-UTF8 encoding, so they should be in the archives. In particular, it's not two bytes per character; it's somewhere between one and four bytes per character.

So it sounds like I have a reasonble chance of expecting any search string I pass to an LDAP server to be in UTF8 as long as I parse the user input with uniseq and unichar.


thanks again Ray.


Craig ------- u2-users mailing list [EMAIL PROTECTED] To unsubscribe please visit http://listserver.u2ug.org/

Reply via email to