Re: Unicode again

Dirk Wed, 16 Aug 2006 18:03:22 -0700

Does Cyrillic, or any other codepage, use the low 32 code points (ie.control characters) for language characters? As far as I can see, theerror in the LOADVSSNAMES thread was due to that, not due to anincorrect choice of codepage.

I don't know, and you are right, the conversion does not prevent "bad"characters. So there is one benefit: MultiByteToWideChar will convertinvalid characters into the defined default characters for the codepage.So if a character is extracted, that makes no sense in the specifiedcodepage, it will be automatically mapped to the defined default character.

It might work equally well to declare the output XML to beWindows-855, pass the VSS strings through unconverted, and let Perl dothe conversion to Unicode within its XML parser.

Yes. This should work also, except for the discouraged characters. So itmight be possible, that we transfer junk which must be filtered on thevss2svn side before passing it to the XML reader.

Just another question: Do you know whether it is possible to specify theThreadCodepage. All functions that I find specify the locale and with itthe codepage. But there is no function to specify the codepage itself,like SetACP or SetThreadACP.


Dirk
_______________________________________________
vss2svn-users mailing list
Project homepage:
http://www.pumacode.org/projects/vss2svn/
Subscribe/Unsubscribe/Admin:
http://lists.pumacode.org/mailman/listinfo/vss2svn-users-lists.pumacode.org
Mailing list web interface (with searchable archives):
http://dir.gmane.org/gmane.comp.version-control.subversion.vss2svn.user

Re: Unicode again

Reply via email to