At 05:30 +0100 on 12/11/2008, Andrew Brown wrote about Re: Character coding:

>On 11 Dec 2008, at 02:44, Peter N Lewis wrote:
>
>>>  I am trying to sort out some files that contain a number of entities,
>>>  a large number of correctly-coded accented characters (é for e 
>>>  acute)
>  >> and a good many incorrectly-coded accented characters (È for e
>  >> acute).

The reason why you are getting incorrect accented letters is due to 
you saving the files with the bad letters encoded as 
ISO-8859-1/Windows-1252 instead of MacRoman. The real codepoint for é 
is what MacRoman uses for È. When you read a ISO-8859-1/Windows-1252 
encoded file and say it is MacRoman, the codepoints in the x80-xff 
range get remapped to their MacRoman Glyphs. If you try reading one 
of these files and telling BBEdit to use ISO-8859-1/Windows-1252 
mapping,  they should display the correct glyphs.
-- 


Robert A. Rosenberg
RAR Programming Systems Ltd.
(845)-357-0931 - Home
(646)-479-1984 - Cell Phone
(646)-349-4025 - Fax

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google
Groups "BBEdit Talk" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/bbedit?hl=en
If you have a specific feature request or would like to report a suspected (or 
confirmed) problem with the software, please email to "[email protected]" 
rather than posting to the group.
-~----------~----~----~----~------~----~------~--~---

Reply via email to