Thanks, Alexander and Kir, now things become more clear. I think,
keeping separate entry page and using s.cgi for results only is not
the best solution, long term. Are there any pitfalls in just modifying
s.cgi code to work with the Unicode, if I need a multi-language site?

     Gregory



-----Original Message-----
From: Kir Kolyshkin [mailto:[EMAIL PROTECTED]]
Sent: Montag, 3. Juni 2002 14:40
To: [EMAIL PROTECTED]
Subject: Re: [aseek-users] Search with umlauts


Gregory Kozlovsky wrote:
> 
> I don't quite understand. The search page and also the results page are
> made not by me but by s.cgi, is it?

Al is talking about initial search page, which is quite ofter made static
(for example see http://apache.aspseek.org/).

> The s.cgi script sets the HTTP header in which it specifies
>     Content-Type: text/html; charset=iso-8859-1

If your request will contain 'cs=utf-8' parameter, reply will also
be in utf-8 charset, with the appropriate header. I have just checked
this on www.aspseek.com site (my typing it quoted):

> telnet www.aspseek.com 80
Trying 128.241.21.30...
Connected to www.aspseek.com.
Escape character is '^]'.
> GET /cgi-bin/s.cgi?q=words&cs=utf-8 HTTP/1.0
> Site: www.aspseek.com
HTTP/1.1 200 OK
Date: Mon, 03 Jun 2002 12:30:32 GMT
Server: Apache/1.3.14 (Unix) mod_perl/1.24_01
Connection: close
Content-Type: text/html; charset=utf-8

<META HTTP-EQUIV="Content-type" CONTENT="text/html; charset=utf-8">
<HTML>
<HEAD>
 <TITLE>ASPseek: words</TITLE>
[....skipped for clarity....]

> Is it posible to force s.cgi to change this behaviour via the config file

Probably default charset can also be set by specifying

LocalCharset utf-8

in "variables" section of s.htm (but I haven't checked this).

> or
> I will need to rewrite s.cgi to search in many languages at once?
> 
>      Gregory
> 
> -----Original Message-----
> From: Alexander F Avdonkin [mailto:[EMAIL PROTECTED]]
> Sent: Montag, 3. Juni 2002 14:05
> To: [EMAIL PROTECTED]
> Subject: Re: [aseek-users] Search with umlauts
> 
> In this case you should use "cs=utf-8" and make page with search form in
> utf-8
> charset.

-- [EMAIL PROTECTED] ICQ UIN 7551596 Phone +7 903 6722750 --
   Guinness a Day Keeps a Doctor Away (people's wisdom)

Reply via email to