Dear Wiki user, You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.
The following page has been changed by HossMan: http://wiki.apache.org/solr/FAQ ------------------------------------------------------------------------------ == Why don't International Characters Work? == - Solr can index any characters expressed in the UTF-8 charset (see [http://issues.apache.org/jira/browse/SOLR-96 SOLR-96]). There are no known bugs with Solr's character handling, but there have been some reported issues with the way different application servers (and different versions of the same application server) treat incoming and outgoing multibyte characters. In particular, people have reported better success with Tomcat then with Jetty. + Solr can index any characters expressed in the UTF-8 charset (see [http://issues.apache.org/jira/browse/SOLR-96 SOLR-96]). There are no known bugs with Solr's character handling, but there have been some reported issues with the way different application servers (and different versions of the same application server) treat incoming and outgoing multibyte characters. In particular, people have reported better success with Tomcat then with Jetty... + + * "[http://www.nabble.com/International-Charsets-in-embedded-XML-tf1780147.html#a4897795 International Charsets in embedded XML]" (Jetty 5.1) + * "[http://www.nabble.com/Problem-with-surrogate-characters-in-utf-8-tf3920744.html Problem with surrogate characters in utf-8]" (Jetty 6) If you notice a problem with multibyte characters, the first step to ensure that it is not a true Solr bug would be to write a unit test that bypasses the application server directly using the [http://lucene.apache.org/solr/api/org/apache/solr/util/AbstractSolrTestCase.html AbstractSolrTestCase].
