Tomcat is notorious for not defaulting to UTF-8 encoding for URLs which is how the query is passed, which is needed to preserve all these accented characters.

In Tomcat's server.xml, it should have something like:

<Connector port="8080"
          protocol="HTTP/1.1"
          connectionTimeout="20000"
          URIEncoding="UTF-8"
          redirectPort="8443"/>

The "URIEncoding="UTF-8"" is essential.

-- Jack Krupansky

-----Original Message----- From: jignesh
Sent: Saturday, May 18, 2013 1:53 PM
To: solr-user@lucene.apache.org
Subject: Not able to search Spanish word with ascent in solr

I have install solr 3.5
I would like to search words(Spanish words) like

-> enseñé
-> étnico
-> castaño
-> después

with ascent ñ,é etc.

But solr is not search such words from index.
I have used
-------------

   <fieldType name="text_es" class="solr.TextField"
positionIncrementGap="100">
     <analyzer>
       <tokenizer class="solr.StandardTokenizerFactory"/>
       <filter class="solr.LowerCaseFilterFactory"/>
       <filter class="solr.StopFilterFactory" ignoreCase="true"
words="lang/stopwords_es.txt" format="snowball"
enablePositionIncrements="true"/>
       <filter class="solr.SpanishLightStemFilterFactory"/>

     </analyzer>
   </fieldType>
------------- like :
<field name="name" type="text_es" indexed="true" stored="true"/>
<field name="features" type="text_es" indexed="true" stored="true"
multiValued="true"/>

But still not able to search Spanish word with ascent..

Please let me know if I am missing anything?

Thanks



--
View this message in context: http://lucene.472066.n3.nabble.com/Not-able-to-search-Spanish-word-with-ascent-in-solr-tp4064404.html Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to