At 05:30 +0100 on 12/11/2008, Andrew Brown wrote about Re: Character coding:
>On 11 Dec 2008, at 02:44, Peter N Lewis wrote: > >>> I am trying to sort out some files that contain a number of entities, >>> a large number of correctly-coded accented characters (é for e >>> acute) > >> and a good many incorrectly-coded accented characters (È for e > >> acute). The reason why you are getting incorrect accented letters is due to you saving the files with the bad letters encoded as ISO-8859-1/Windows-1252 instead of MacRoman. The real codepoint for é is what MacRoman uses for È. When you read a ISO-8859-1/Windows-1252 encoded file and say it is MacRoman, the codepoints in the x80-xff range get remapped to their MacRoman Glyphs. If you try reading one of these files and telling BBEdit to use ISO-8859-1/Windows-1252 mapping, they should display the correct glyphs. -- Robert A. Rosenberg RAR Programming Systems Ltd. (845)-357-0931 - Home (646)-479-1984 - Cell Phone (646)-349-4025 - Fax --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "BBEdit Talk" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/bbedit?hl=en If you have a specific feature request or would like to report a suspected (or confirmed) problem with the software, please email to "[email protected]" rather than posting to the group. -~----------~----~----~----~------~----~------~--~---
