Us ASCII will always be us ASCII in both encodings. UTF-8 is what you want so convert any non-utf to utf and you'll be fine
Terry Sent from my iPhone On Oct 22, 2010, at 12:44 PM, JC Ahangama <[email protected]> wrote: > I agree, sort of. > > The question is still why does US-ASCII letters show inside an HTML file > declared as charset utf-8 and letters like ð,þ, á show as glyph not found. I > did not *convert* US-ASCII. You will understand the problem only if you open > the attached HTML files on 3 tabs and compare. > > I still believe that somebody forgot something somewhere about 2004. > > Thanks. > > JC > > On Fri, Oct 22, 2010 at 5:29 AM, Terry Riegel > <[email protected]>wrote: > >> >> >> I think you are confused about utf-8 >> >> declaring iso-8859-1 text to be utf-8 doesn't make it utf-8 >> >> If you want it to be utf-8 you need to convert it to that. >> >> Terry >> >> >> On Oct 21, 2010, at 11:53 PM, JC Ahangama >> <[email protected]<ahangama%40gmail.com>> >> wrote: >> >>> I believe I know what is going on. >>> >>> The treatment of ISO-8859-1 set by Unicode is the culprit, at least in >> the >>> Windows machines. Please check the three versions of an HTML file for the >>> same text given at the bottom of the page. Characters outside ASCII that >> are >>> still within ISO-8859-1 (codepoints 128 thru 255) are not included in the >>> Unicode repertoire (as the last sample illustrates). The HTML pages do >> not >>> declare a font, and therefore, uses the Last Resort font of the system, >>> which demonstrably *has* the letters that UTF-8 set thinks are missing. >>> >>> If you are working with people that want to use Indic, it is best that >> they >>> transliterate their languages to ISO-8859-1 and display them by means of >>> orthographic fonts. Here are two web sites that illustrates it (The >> language >>> is Sinhala): >>> http://www.ahangama.com/ -- My Wordpress blog has both English and >> Sinhala >>> http://www.lovatasinhala.com/ -- has only *one* graphic, that of the >> lion. >>> >>> Regards, >>> >>> JC >>> >>> >>> 1. No character set declaration. Shows the text correctly because >> ISO-8859-1 >>> is the default charset. >>> ====================================================================== >>> <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"> >>> <html> >>> <head> >>> <title>ISO-8859-1 but no character set declared</title> >>> </head> >>> <body> >>> ASCII lc:<br> >>> <span style="font-size:20px;letter-spacing:8px;"> >>> abcdefghijklmnopqrstuvwxyz >>> </span><br><br> >>> Some non-English letters:<br> >>> <span style="font-size:20px;letter-spacing:8px;"> >>> ðþææéúíóáðçøçµëûüïöäÐçôçñ >>> </span> >>> </body> >>> </html> >>> ====================================================================== >>> >>> 2. character declared as iso-8859-1. The text shows correctly >>> ====================================================================== >>> <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"> >>> <html> >>> <head> >>> <title>ISO-8859-1 charset explicitly declared</title> >>> <meta http-equiv="Content-Type" content="text/html; Charset=iso-8859-1"> >>> </head> >>> <body> >>> ASCII lc:<br> >>> <span style="font-size:20px;letter-spacing:8px;"> >>> abcdefghijklmnopqrstuvwxyz >>> </span><br><br> >>> Some non-English letters:<br> >>> <span style="font-size:20px;letter-spacing:8px;"> >>> ðþææéúíóáðçøçµëûüïöäÐçôçñ >>> </span> >>> </body> >>> </html> >>> ====================================================================== >>> >>> 3. characters set declared as UTF-8. No European characters! >>> ====================================================================== >>> <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"> >>> <html> >>> <head> >>> <title>ISO-8859-1 charset explicitly declared</title> >>> <meta http-equiv="Content-Type" content="text/html; Charset=utf-8"> >>> </head> >>> <body> >>> ASCII lc:<br> >>> <span style="font-size:20px;letter-spacing:8px;"> >>> abcdefghijklmnopqrstuvwxyz >>> </span><br><br> >>> Some non-English letters:<br> >>> <span style="font-size:20px;letter-spacing:8px;"> >>> ðþææéúíóáðçøçµëûüïöäÐçôçñ >>> </span> >>> </body> >>> </html> >>> =================================== >>> >>> >>> [Non-text portions of this message have been removed] >>> >>> >>> >>> ------------------------------------ >>> >>> ----- >>> To unsubscribe send a message to: >> [email protected]<svg-developers-unsubscribe%40yahoogroups.com> >>> -or- >>> visit http://groups.yahoo.com/group/svg-developers and click "edit my >> membership" >>> ----Yahoo! Groups Links >>> >>> >>> >> >> > > > [Non-text portions of this message have been removed] > > > > ------------------------------------ > > ----- > To unsubscribe send a message to: [email protected] > -or- > visit http://groups.yahoo.com/group/svg-developers and click "edit my > membership" > ----Yahoo! Groups Links > > > ------------------------------------ ----- To unsubscribe send a message to: [email protected] -or- visit http://groups.yahoo.com/group/svg-developers and click "edit my membership" ----Yahoo! Groups Links <*> To visit your group on the web, go to: http://groups.yahoo.com/group/svg-developers/ <*> Your email settings: Individual Email | Traditional <*> To change settings online go to: http://groups.yahoo.com/group/svg-developers/join (Yahoo! ID required) <*> To change settings via email: [email protected] [email protected] <*> To unsubscribe from this group, send an email to: [email protected] <*> Your use of Yahoo! Groups is subject to: http://docs.yahoo.com/info/terms/

