On Thu, 3 Oct 2002, Reuven M. Lerner wrote:

> >>>>> "Tzafrir" == Tzafrir Cohen <[EMAIL PROTECTED]> writes:
>
>   Tzafrir> There are a number of incompatible ways to encode Hebrew
>   Tzafrir> (e.g: ISO-8859-8/visual, cp1255/logical and UTF-8). So
>   Tzafrir> simply saying that the language is Hebrew is not enough
>   Tzafrir> for the browser.
>
> No one is suggesting that the HTTP Content-type header should indicate
> a language.Rather, the Content-type header indicates a character
> set, and in some places an encoding as well.For example:
>
>   Content-type: text/html; charset=utf-8        # Unicode, UTF-8
>   Content-type: text/html; charset=iso-8859-8   # Hebrew + English
>   Content-type: text/html; charset=windows-1255 # Heb/Eng Windows encoding
>
> A browser receiving the above headers knows not only that the content
> is in Hebrew and English (or in Unicode), but also what the encoding
> is (and thus how to display it).
>
> Naming the encoding explicitly would only be a problem on a site that
> uses multiple encodings.For example, if your site has some pages in
> UTF-8 and others in Latin-1 and still others in ISO-8859-8, then you
> would be in trouble.But in such a (rare) case, you can remove the
> default encoding and allow people to use meta tags.

Quite rare indeed. The default configuration of apache (or at least: the
one that comes with Mandrake):

http://www.gadot.org.il/manual/

http://www.gadot.org.il/manual/index.html.en
http://www.gadot.org.il/manual/index.html.ja.jis

Not to mention all sorts of sites with multi-charset content.

>
> On a site that has a single, consistent encoding, it's nice to have
> the server take care of such things for you, avoiding the need for
> meta tags.

(you mean: set everything to UTF-8)

>
>   Tzafrir> Setting a server-wide default to a certain charset is
>   Tzafrir> certainly not a wise default. I would consider it a
>   Tzafrir> misconfiguration of RH's side.
>
> Given that this is the W3C's preferred way of doing things, it strikes
> me as a pretty reasonable approach, actually.The implied default of
> Latin-1 strikes me as pretty short-sighted in a world where a large
> (and growing) number of Internet users come from outside of the US.

So you have my opinion about this standard, and how predictable it leaves
things

-- 
Tzafrir Cohen
mailto:[EMAIL PROTECTED]
http://www.technion.ac.il/~tzafrir



=================================================================
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]

Reply via email to