Looks like I am stuck with charsets...
-------------------------------------------------------------
ASPseek v.1.2.10 is now configured as follows:
C++ compiler: GNU c++ v.2.95.4
Compilation flags: -g -O2 -D_REENTRANT
Installation path: /usr/local/aspseek
Supported database(s): MySQL
Features enabled:
* Limiting clones by site
* Fast clones lookup
* Support for external mime-type converter programs
* Store data in Unicode format
* Support for https:// secure protocol
-------------------------------------------------------------
My Server sends the correct Charset-Header:
~/> telnet localhost 80
Trying 127.0.0.1...
Connected to localhost.
Escape character is '^]'.
GET / HTTP/1.1
Host: www2.jugendpolitik.net
HTTP/1.1 200 OK
Date: Mon, 09 Dec 2002 13:12:01 GMT
Server: Apache/1.3.26 (Unix) Debian GNU/Linux PHP/4.1.2 mod_ssl/2.8.9
OpenSSL/0.9.6g
X-Powered-By: PHP/4.1.2
Transfer-Encoding: chunked
Content-Type: text/html; charset=iso-8859-1
However - if I search e.g. for "F�rdermittel" only the "F" gets marked
in the result - and it finds all pages including just an "F":
http://www2.jugendpolitik.net/cgi-bin/jugendpolitik.net-search.cgi?q=F%F6rdermittel
So I'd searched the docs, the lists, and the "knowledge" I collected more
than a year ago ;) and got reminded of "cs=":
http://www2.jugendpolitik.net/cgi-bin/jugendpolitik.net-search.cgi?q=F%F6rdermittel&cs=iso-8859-1
Then however the Umlauts that are included in the page don't get
displayed correctly (They are displayed as "?"). To explain what I am
talking of, I've done a little screenshot (temporary):
http://www.b-a-l-u.de/tmp/aspseek.jpg
I've also tried to modify etc/ucharset.conf, commented all
CharsetTableU1-lines and added "CharsetTableU1 iso-8859-1 de
tables/iso8859-1.txt"
Which didn't help either. Any suggestions where to look? Do I have to
re-index the pages or similar?
Balu
PS: Please send a Cc:, I am not subscribed to the list