On Nov 11, 2009, at 8:25 AM, Karl Kleinpaste wrote:

> Jonathan Morgan <[email protected]> writes:
>> We had a similar problem in BPBible (and I think we would have been
>> using the same files).
> 
> XEmacs identifies it in our ui/languages file as U+00E5, which matches
> what's found in the GNOME character map application when searching it
> for "ring".  I still don't grasp how it could be a problem.

There is a difference between code points and their encoding.

U+00E5 is the unicode code point, not the encoding. In hex the utf-8 encoding 
would be C3 A5. In ISO-8859-1, it would be E5. 

Not in your case, but in decomposed form in utf-8 would be 61 CC 8A, which is 
an U+0061U+030A, where U+030A is a "Combining Ring Above".

So I'd suggest looking at a hex dump to see what the encoding is.

Regarding the handling of \u00e5 in vc++, it might be that it is taking that 
and interpreting it as a cp1252 encoded character. Maybe, one has to tell vc++ 
that the string is to be understood as a utf-8 string. I've run into something 
similar in perl and in native2ascii.

In Him,
        DM


_______________________________________________
sword-devel mailing list: [email protected]
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to