Tomcat is notorious for not defaulting to UTF-8 encoding for URLs which is
how the query is passed, which is needed to preserve all these accented
characters.
In Tomcat's server.xml, it should have something like:
<Connector port="8080"
protocol="HTTP/1.1"
connectionTimeout="20000"
URIEncoding="UTF-8"
redirectPort="8443"/>
The "URIEncoding="UTF-8"" is essential.
-- Jack Krupansky
-----Original Message-----
From: jignesh
Sent: Saturday, May 18, 2013 1:53 PM
To: solr-user@lucene.apache.org
Subject: Not able to search Spanish word with ascent in solr
I have install solr 3.5
I would like to search words(Spanish words) like
-> enseñé
-> étnico
-> castaño
-> después
with ascent ñ,é etc.
But solr is not search such words from index.
I have used
-------------
<fieldType name="text_es" class="solr.TextField"
positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="lang/stopwords_es.txt" format="snowball"
enablePositionIncrements="true"/>
<filter class="solr.SpanishLightStemFilterFactory"/>
</analyzer>
</fieldType>
-------------
like :
<field name="name" type="text_es" indexed="true" stored="true"/>
<field name="features" type="text_es" indexed="true" stored="true"
multiValued="true"/>
But still not able to search Spanish word with ascent..
Please let me know if I am missing anything?
Thanks
--
View this message in context:
http://lucene.472066.n3.nabble.com/Not-able-to-search-Spanish-word-with-ascent-in-solr-tp4064404.html
Sent from the Solr - User mailing list archive at Nabble.com.