**********************************************
    **      THIS IS A WARNING MESSAGE ONLY      **
    **  YOU DO NOT NEED TO RESEND YOUR MESSAGE  **
    **********************************************

The original message was received at Tue, 20 Mar 2001 09:29:23 -0800 (PST)
from unicode2.apple.com [17.254.3.212]

   ----- The following addresses had transient non-fatal errors -----
<[EMAIL PROTECTED]>

   ----- Transcript of session follows -----
<[EMAIL PROTECTED]>... Deferred: Connection timed out with marril.com.
Warning: message still undelivered after 4 hours
Will keep trying until message is 4 days old


-- Attached file included as plaintext by Listar --

Reporting-MTA: dns; bz2.apple.com
Arrival-Date: Tue, 20 Mar 2001 09:29:23 -0800 (PST)

Final-Recipient: RFC822; [EMAIL PROTECTED]
Action: delayed
Status: 4.4.1
Remote-MTA: DNS; marril.com
Last-Attempt-Date: Tue, 20 Mar 2001 13:32:33 -0800 (PST)
Will-Retry-Until: Sat, 24 Mar 2001 09:29:23 -0800 (PST)


-- Attached file included as plaintext by Listar --

Return-Path: <[EMAIL PROTECTED]>
Received: from unicode.org (unicode2.apple.com [17.254.3.212])
        by bz2.apple.com (8.9.3/8.9.3) with ESMTP id JAA10090;
        Tue, 20 Mar 2001 09:29:23 -0800 (PST)
Received: (from agent@localhost)
        by unicode.org (8.9.3/8.9.3) id IAA23734;
        Tue, 20 Mar 2001 08:59:05 -0800 (GMT-0800)
Message-Id: <[EMAIL PROTECTED]>
Errors-To: [EMAIL PROTECTED]
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
X-UML-Sequence: 18856 (2001-03-20 16:56:05 GMT)
From: Markus Scherer <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Date: Tue, 20 Mar 2001 08:56:04 -0800 (GMT-0800)
Subject: Re: Unicode encoding forms in web development

For HTML, use UTF-8. For XML, use UTF-8 or UTF-16.
US-ASCII and ISO 8859-1 are also acceptable, either if your actual character needs are 
limited to their repertoires or with numeric character references.
If you know the sender and receiver and you half a low-bandwidth application, consider 
SCSU.

See the Unicode FAQ with its recommendations.

See more specific comments below.

markus

Michel Paul wrote:
> 1- W3C recognized the benefits of Unicode character
> set by enforcing it HTML and XML. BUT they also did
> not enforce the Unicode encoding forms. Any character
> encoding form can be used.

Right, but:
For anything not Unicode/US-ASCII/ISO 8859-1, you will need character conversion 
tables. Such tables are poorly standardized, are a maintenance nightmare, and use a 
lot of space. You always have the danger of losing text because the table is not 
precisely the same in the sender and receiver processes, or because the encoding model 
is different and is not transformed by the conversion.

> 2- Since there is more than one Unicode encoding form,
> its declaration/identification (charset, BOM, ...) is
> still compulsory. Then why not using any other
> character encoding form?

See above. Conversion between any Unicode encoding form (and US-ASCII/ISO 8859-1) is 
simple, fast, and algorithmic (without tables).
Which of the dozen or so Shift-JIS tables in the industry are you using?

> 3- Authoring and development tools have a better
> support of "local" character encoding forms (non
> Unicode ones). That is why the vast majority of web
> pages do not use Utf-8, 16 or 32.

_Useful_ authoring tools will at least support UTF-8 and UTF-16.



Reply via email to