Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change 
notification.

The following page has been changed by HossMan:
http://wiki.apache.org/solr/FAQ

------------------------------------------------------------------------------
  
  == Why don't International Characters Work? ==
  
- Solr can index any characters expressed in the UTF-8 charset (see 
[http://issues.apache.org/jira/browse/SOLR-96 SOLR-96]). There are no known 
bugs with Solr's character handling, but there have been some reported issues 
with the way different application servers (and different versions of the same 
application server) treat incoming and outgoing multibyte characters.  In 
particular, people have reported better success with Tomcat then with Jetty.
+ Solr can index any characters expressed in the UTF-8 charset (see 
[http://issues.apache.org/jira/browse/SOLR-96 SOLR-96]). There are no known 
bugs with Solr's character handling, but there have been some reported issues 
with the way different application servers (and different versions of the same 
application server) treat incoming and outgoing multibyte characters.  In 
particular, people have reported better success with Tomcat then with Jetty...
+ 
+  * 
"[http://www.nabble.com/International-Charsets-in-embedded-XML-tf1780147.html#a4897795
 International Charsets in embedded XML]" (Jetty 5.1)
+  * 
"[http://www.nabble.com/Problem-with-surrogate-characters-in-utf-8-tf3920744.html
 Problem with surrogate characters in utf-8]" (Jetty 6)
  
  If you notice a problem with multibyte characters, the first step to ensure 
that it is not a true Solr bug would be to write a unit test that bypasses the 
application server directly using the 
[http://lucene.apache.org/solr/api/org/apache/solr/util/AbstractSolrTestCase.html
 AbstractSolrTestCase].
  

Reply via email to