Re: [sqlite] UTF support

jose isaias cabrera Tue, 07 Oct 2014 14:22:00 -0700


"J Decker" wrote...

So, I guess it is technically not allowed to encode 11 bit unicode
characters as 16.
the greek characters are 0x3XX  which is 10 bits... I checked what
WideCharToMultiByte was doing and found it was using 11 bit encodings...
fixed my encoder to use an appropriate size for what's required, added 11

bit decoding, and now in and out works for that and some chinesecharacters

which are more than 11 bits.

The 'unrecognized token' is 0xE0  ? ... although a thing could be 12 bits

exactly... so is it checking ( char[0] == 0xe0 ) && ( ( char[1] & 0xE0 )==

0x80 )?

as a side note.. using visual studio to mouse over the resulting char *
string with 11 bit encodings it shows bad characters, if encoded as (valid
but illegal) 16 bit it browses correctly.

J,

My suggestion is for you to read about ANSI, ASCII, UTF7, UTF8, UTF16 andUTF32 and understand the ins and outs of the various encoding. You may needto create your own wrapper to get things to work correctly.

What happens if you create a text file using notepad and make sure that yousave it as UTF8 and then read that file with the content that you want?Then write it to SQLite and get it back and write it back to another file?Does that work?

_______________________________________________
sqlite-users mailing list
sqlite-users@sqlite.org
http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users

Re: [sqlite] UTF support

Reply via email to