Does Cyrillic, or any other codepage, use the low 32 code points (ie. control characters) for language characters? As far as I can see, the error in the LOADVSSNAMES thread was due to that, not due to an incorrect choice of codepage.
I don't know, and you are right, the conversion does not prevent "bad" characters. So there is one benefit: MultiByteToWideChar will convert invalid characters into the defined default characters for the codepage. So if a character is extracted, that makes no sense in the specified codepage, it will be automatically mapped to the defined default character.
It might work equally well to declare the output XML to be Windows-855, pass the VSS strings through unconverted, and let Perl do the conversion to Unicode within its XML parser.
Yes. This should work also, except for the discouraged characters. So it might be possible, that we transfer junk which must be filtered on the vss2svn side before passing it to the XML reader.
Just another question: Do you know whether it is possible to specify the ThreadCodepage. All functions that I find specify the locale and with it the codepage. But there is no function to specify the codepage itself, like SetACP or SetThreadACP.
Dirk _______________________________________________ vss2svn-users mailing list Project homepage: http://www.pumacode.org/projects/vss2svn/ Subscribe/Unsubscribe/Admin: http://lists.pumacode.org/mailman/listinfo/vss2svn-users-lists.pumacode.org Mailing list web interface (with searchable archives): http://dir.gmane.org/gmane.comp.version-control.subversion.vss2svn.user