Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The following page has been changed by RenaudRichardet:
http://wiki.apache.org/nutch/GettingNutchRunningWithUtf8
------------------------------------------------------------------------------
= How to Configure App Servers to Pass non-ASCII Characters? =
Nutch GUI uses the GET method to pass the query strings to the server.
Tomcat 4 and 5 need to be configured to enable passing of non-ASCII characters.
- Note that this note describes how to make Tomcat pass non-ASCII characters.
Nutch, in its "factory set" configuration, handle only limited characters.
Especially, it will not handle Chinese/Japanese/Korean text properly. (Each
CJK character is treated as if it were a word by itself.)
+ Note that this note describes how to make Tomcat pass non-ASCII characters.
Nutch, in its "factory set" configuration, handle only limited characters.
Especially, it will not handle Chinese/Japanese/Korean text properly. (Each
CJK character is treated as if it were a word by itself.) German special chars
are also wrongly displayed (ö, ä, ü).
== Tomcat 4 and Tomcat 5 ==
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-cvs mailing list
Nutch-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-cvs