Hi,
Does anybody know how to set another character
encoding than UTF-8, which seems to be the default in
Nutch 0.8.1 on Tomcat 5 ? (Ubuntu 6.10 / Tomcat 5.0)
What I have tried :
In <tomcat_root>/conf/web.xml :
(in jsp section) :
Added :
<init-param>
<param-name>javaEncoding</param-name>
<param-value>ISO-8859-1</param-value>
</init-param>
In <tomcat_root>/webapps/ROOT/WEB-INF/web.xml :
(in <servlet-name>Cached</servlet-name> section)
Added :
<init-param>
<param-name>javaEncoding</param-name>
<param-value>ISO-8859-1</param-value>
</init-param>
Stopped and restarted Tomcat (from the crawldir folder
of Nutch)
The browser keeps showing UTF-8 encoded pages, and
french special characters are being replaced with
wrong characters.
Any idea ?
Thanks
___________________________________________________________________________
Découvrez une nouvelle façon d'obtenir des réponses à toutes vos questions !
Profitez des connaissances, des opinions et des expériences des internautes sur
Yahoo! Questions/Réponses
http://fr.answers.yahoo.com