Jon,

You'll need to set encoding to UTF-8. 
We don't use the default Nutch JSP pages, so I'm not sure if they have it or
not, but here's the simplified process.

1. make sure your JSP files have the something like this on top
<%@ page contentType="text/html; charset=utf-8" pageEncoding="utf-8"  

2. Your tomcat server.xml should have this line (URIEncoding="UTF-8")
     <Connector port="80"
               maxThreads="250" minSpareThreads="25" maxSpareThreads="75"
               enableLookups="false" redirectPort="8443" acceptCount="100"
               connectionTimeout="15000" disableUploadTimeout="180000"
URIEncoding="UTF-8" useBodyEncodingForURI="false" />

This should take care of it. 

Regards,
CC

--------------------------------------------
Filangy, Inc.
Interested in Improving Search? Join our Team!
http://filangy.com/jointheteam.jsp 



-----Original Message-----
From: J B [mailto:[EMAIL PROTECTED] 
Sent: Monday, May 30, 2005 1:46 PM
To: [email protected]
Subject: Searching with � and �?

Hello,

Is there anyone who can help me configure Nutch so that I can use it for
Swedics or German websites containing characters like "�" and "�"? Crawling
and indexing seems to work fine, it's just the searching that goes wrong. 
When I enter a searchstring like "K�ln", knowing that it appears in the
text, the resultpage says that there are no matching results, and the "�" is
replaced by random characters...

I have searched the docs and the web, but I can't find the answer to my
problem.

Best regards,

Jon

P.S. Sorry if two versions of this message reached the list, I am quite new
to this...

_________________________________________________________________
Chat: Ha en fest p� Habbo Hotel
http://habbohotel.msn.se/habbo/sv/channelizer Checka in h�r!



Reply via email to