Us ASCII will always be us ASCII in both encodings.

UTF-8 is what you want so convert any non-utf to utf and you'll be fine

Terry

Sent from my iPhone

On Oct 22, 2010, at 12:44 PM, JC Ahangama <[email protected]> wrote:

> I agree, sort of.
> 
> The question is still why does US-ASCII letters show inside an HTML file
> declared as charset utf-8 and letters like ð,þ, á show as glyph not found. I
> did not *convert* US-ASCII. You will understand the problem only if you open
> the attached HTML files on 3 tabs and compare.
> 
> I still believe that somebody forgot something somewhere about 2004.
> 
> Thanks.
> 
> JC
> 
> On Fri, Oct 22, 2010 at 5:29 AM, Terry Riegel
> <[email protected]>wrote:
> 
>> 
>> 
>> I think you are confused about utf-8
>> 
>> declaring iso-8859-1 text to be utf-8 doesn't make it utf-8
>> 
>> If you want it to be utf-8 you need to convert it to that.
>> 
>> Terry
>> 
>> 
>> On Oct 21, 2010, at 11:53 PM, JC Ahangama 
>> <[email protected]<ahangama%40gmail.com>>
>> wrote:
>> 
>>> I believe I know what is going on.
>>> 
>>> The treatment of ISO-8859-1 set by Unicode is the culprit, at least in
>> the
>>> Windows machines. Please check the three versions of an HTML file for the
>>> same text given at the bottom of the page. Characters outside ASCII that
>> are
>>> still within ISO-8859-1 (codepoints 128 thru 255) are not included in the
>>> Unicode repertoire (as the last sample illustrates). The HTML pages do
>> not
>>> declare a font, and therefore, uses the Last Resort font of the system,
>>> which demonstrably *has* the letters that UTF-8 set thinks are missing.
>>> 
>>> If you are working with people that want to use Indic, it is best that
>> they
>>> transliterate their languages to ISO-8859-1 and display them by means of
>>> orthographic fonts. Here are two web sites that illustrates it (The
>> language
>>> is Sinhala):
>>> http://www.ahangama.com/ -- My Wordpress blog has both English and
>> Sinhala
>>> http://www.lovatasinhala.com/ -- has only *one* graphic, that of the
>> lion.
>>> 
>>> Regards,
>>> 
>>> JC
>>> 
>>> 
>>> 1. No character set declaration. Shows the text correctly because
>> ISO-8859-1
>>> is the default charset.
>>> ======================================================================
>>> <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
>>> <html>
>>> <head>
>>> <title>ISO-8859-1 but no character set declared</title>
>>> </head>
>>> <body>
>>> ASCII lc:<br>
>>> <span style="font-size:20px;letter-spacing:8px;">
>>> abcdefghijklmnopqrstuvwxyz
>>> </span><br><br>
>>> Some non-English letters:<br>
>>> <span style="font-size:20px;letter-spacing:8px;">
>>> ðþææéúíóáðçøçµëûüïöäÐçôçñ
>>> </span>
>>> </body>
>>> </html>
>>> ======================================================================
>>> 
>>> 2. character declared as iso-8859-1. The text shows correctly
>>> ======================================================================
>>> <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
>>> <html>
>>> <head>
>>> <title>ISO-8859-1 charset explicitly declared</title>
>>> <meta http-equiv="Content-Type" content="text/html; Charset=iso-8859-1">
>>> </head>
>>> <body>
>>> ASCII lc:<br>
>>> <span style="font-size:20px;letter-spacing:8px;">
>>> abcdefghijklmnopqrstuvwxyz
>>> </span><br><br>
>>> Some non-English letters:<br>
>>> <span style="font-size:20px;letter-spacing:8px;">
>>> ðþææéúíóáðçøçµëûüïöäÐçôçñ
>>> </span>
>>> </body>
>>> </html>
>>> ======================================================================
>>> 
>>> 3. characters set declared as UTF-8. No European characters!
>>> ======================================================================
>>> <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
>>> <html>
>>> <head>
>>> <title>ISO-8859-1 charset explicitly declared</title>
>>> <meta http-equiv="Content-Type" content="text/html; Charset=utf-8">
>>> </head>
>>> <body>
>>> ASCII lc:<br>
>>> <span style="font-size:20px;letter-spacing:8px;">
>>> abcdefghijklmnopqrstuvwxyz
>>> </span><br><br>
>>> Some non-English letters:<br>
>>> <span style="font-size:20px;letter-spacing:8px;">
>>> ðþææéúíóáðçøçµëûüïöäÐçôçñ
>>> </span>
>>> </body>
>>> </html>
>>> ===================================
>>> 
>>> 
>>> [Non-text portions of this message have been removed]
>>> 
>>> 
>>> 
>>> ------------------------------------
>>> 
>>> -----
>>> To unsubscribe send a message to:
>> [email protected]<svg-developers-unsubscribe%40yahoogroups.com>
>>> -or-
>>> visit http://groups.yahoo.com/group/svg-developers and click "edit my
>> membership"
>>> ----Yahoo! Groups Links
>>> 
>>> 
>>> 
>> 
>> 
> 
> 
> [Non-text portions of this message have been removed]
> 
> 
> 
> ------------------------------------
> 
> -----
> To unsubscribe send a message to: [email protected]
> -or-
> visit http://groups.yahoo.com/group/svg-developers and click "edit my 
> membership"
> ----Yahoo! Groups Links
> 
> 
> 


------------------------------------

-----
To unsubscribe send a message to: [email protected]
-or-
visit http://groups.yahoo.com/group/svg-developers and click "edit my 
membership"
----Yahoo! Groups Links

<*> To visit your group on the web, go to:
    http://groups.yahoo.com/group/svg-developers/

<*> Your email settings:
    Individual Email | Traditional

<*> To change settings online go to:
    http://groups.yahoo.com/group/svg-developers/join
    (Yahoo! ID required)

<*> To change settings via email:
    [email protected] 
    [email protected]

<*> To unsubscribe from this group, send an email to:
    [email protected]

<*> Your use of Yahoo! Groups is subject to:
    http://docs.yahoo.com/info/terms/

Reply via email to