Looks like I am stuck with charsets... 
-------------------------------------------------------------
        ASPseek v.1.2.10 is now configured as follows:

C++ compiler:             GNU c++ v.2.95.4
Compilation flags:        -g -O2 -D_REENTRANT
Installation path:        /usr/local/aspseek
Supported database(s):    MySQL
Features enabled:
 * Limiting clones by site
 * Fast clones lookup
 * Support for external mime-type converter programs
 * Store data in Unicode format
 * Support for https:// secure protocol
-------------------------------------------------------------

My Server sends the correct Charset-Header:

~/> telnet localhost 80
Trying 127.0.0.1...
Connected to localhost.
Escape character is '^]'.
GET / HTTP/1.1
Host: www2.jugendpolitik.net

HTTP/1.1 200 OK
Date: Mon, 09 Dec 2002 13:12:01 GMT
Server: Apache/1.3.26 (Unix) Debian GNU/Linux PHP/4.1.2 mod_ssl/2.8.9
OpenSSL/0.9.6g
X-Powered-By: PHP/4.1.2
Transfer-Encoding: chunked
Content-Type: text/html; charset=iso-8859-1

However - if I search e.g. for "F�rdermittel" only the "F" gets marked
in the result - and it finds all pages including just an "F":
http://www2.jugendpolitik.net/cgi-bin/jugendpolitik.net-search.cgi?q=F%F6rdermittel

So I'd searched the docs, the lists, and the "knowledge" I collected more
than a year ago ;) and got reminded of "cs=":
http://www2.jugendpolitik.net/cgi-bin/jugendpolitik.net-search.cgi?q=F%F6rdermittel&cs=iso-8859-1

Then however the Umlauts that are included in the page don't get
displayed correctly (They are displayed as "?"). To explain what I am
talking of, I've done a little screenshot (temporary):
http://www.b-a-l-u.de/tmp/aspseek.jpg

I've also tried to modify etc/ucharset.conf, commented all
CharsetTableU1-lines and added "CharsetTableU1 iso-8859-1 de
tables/iso8859-1.txt"

Which didn't help either. Any suggestions where to look? Do I have to
re-index the pages or similar?

     Balu
PS: Please send a Cc:, I am not subscribed to the list

Reply via email to