I'll have a go at providing you with the info:
>> "How did u create the file?" Do you mean the html file? I just >> edited in vi and saved it in the html folder. >> >> Do you want the entire apache httpd.conf file? > > > the output from > > wget --spider -S http://localhost > (or whatever your webserver is found with) > wget --spider -S http://www.joffer.net --00:14:15-- http://www.joffer.net/ => `index.html' Resolving www.joffer.net... 10.10.20.2 Connecting to www.joffer.net|10.10.20.2|:80... connected. HTTP request sent, awaiting response... HTTP/1.1 200 OK Date: Fri, 21 Oct 2005 22:14:15 GMT Server: Apache Last-Modified: Fri, 14 Oct 2005 00:16:40 GMT ETag: "1aa6a9-6d4-c5820a00" Accept-Ranges: bytes Content-Length: 1748 Keep-Alive: timeout=15, max=100 Connection: Keep-Alive Content-Type: text/html; charset=UTF-8 Length: 1,748 (1.7K) [text/html] 200 OK > > and maybe the info what kind of encoding you think it is, and if it > was a remote vi, so when you type a ö (as I do in german Ä,ß etc.) or > some other local char, then the encoding of the remote (probably ssh) > machine gets transferred in what the terminal program thinks is > right, then written to disk on the server, and then played out by > apache, with whatever apache thinks what kind of encoding it is. > so please include the part of the html file outputted by hexdump, but > please excuse that I don't have the hexdump commandline ready to > display the decimal or hex value next to every char/line of output. > then we know for sure what kind of encoding has been written to the > file. > Well. My special norwegian charactes (æ ø å Æ Ø Å) shows up just fine on the command line using putty, but not when I'm editing a file using mc (midnight commander). If I understand you correct, you are trying to say that my system probably stores files as ISO8859-1 format? Where can I see what my system uses? Here is a picture of what a simple page look like when I use UTF-8 in apache config: http://www.joffer.net/webshare/images/apache-utf8-with-opera-with-encoding-set-to-auto.jpg "http://www.joffer.net/testutf8.html" should have read: <snip> Can I write my norwegian characters: æ ø å Æ Ø Å </snip> Hexdump on the file: $ hexdump testutf8.html 0000000 213c 6f64 7463 7079 2065 7468 6c6d 7020 0000010 6275 696c 2063 2d22 2f2f 3357 2f43 442f 0000020 4454 4820 4d54 204c 2e34 2030 7254 6e61 0000030 6973 6974 6e6f 6c61 2f2f 4e45 3e22 3c0a 0000040 7468 6c6d 0a3e 683c 6165 3e64 3c0a 656d 0000050 6174 6820 7474 2d70 7165 6975 3d76 4322 0000060 6e6f 6574 746e 542d 7079 2265 6320 6e6f 0000070 6574 746e 223d 6574 7478 682f 6d74 3b6c 0000080 6320 6168 7372 7465 553d 4654 382d 3e22 0000090 3c0a 6974 6c74 3e65 6574 7473 7520 6674 00000a0 3c38 742f 7469 656c 0a3e 2f3c 6568 6461 00000b0 0a3e 623c 646f 3e79 430a 6e61 4920 7720 00000c0 6972 6574 6d20 2079 6f6e 7772 6765 6169 00000d0 206e 6863 7261 6361 6574 7372 203a 20f8 00000e0 20e6 20e5 20c6 20d8 20c5 3c0a 622f 646f 00000f0 3e79 3c0a 682f 6d74 3e6c 000a 00000fb $ cat testutf8.html <!doctype html public "-//W3C//DTD HTML 4.0 Transitional//EN"> <html> <head> <meta http-equiv="Content-Type" content="text/html; charset=UTF-8"> <title>test utf8</title> </head> <body> Can I write my norwegian characters: ø æ å Æ Ø Å </body> </html> Some info I've gathered: Running 'locale' gives me this: ~$ locale LANG= LC_CTYPE="POSIX" LC_NUMERIC="POSIX" LC_TIME="POSIX" LC_COLLATE="POSIX" LC_MONETARY="POSIX" LC_MESSAGES="POSIX" LC_PAPER="POSIX" LC_NAME="POSIX" LC_ADDRESS="POSIX" LC_TELEPHONE="POSIX" LC_MEASUREMENT="POSIX" LC_IDENTIFICATION="POSIX" LC_ALL= "locale -a" gives me a long list, and I've got (what is important to me): .. en_US.iso88591 en_US.utf8 .. nb_NO.iso88591 nb_NO.utf8 .. en_US and nb_NO is the only one with ".utf8" I also used the "Validate" function in opera (it uses http://validator.w3.org/check# ...) and got this result: <snip> Result: Failed validation, File: opr0ALOR.htm Encoding: utf-8 Doctype: "Sorry, I am unable to validate this document because on line 24 it contained one or more bytes that I cannot interpret as utf-8 (in other words, the bytes found are not valid values in the specified Character Encoding). Please check both the content of the file and the character encoding indication." </snip> Hope some of the info was usefull.. I'm quite lost on using utf-8 obviously.. /Christopher _______________________________________________ tsl-discuss mailing list [email protected] http://lists.trustix.org/mailman/listinfo/tsl-discuss
