Hi Jose, Seems there is a corresponding filter in Lucene proper:
http://lucene.apache.org/java/2_4_0/api/org/apache/lucene/analysis/ISOLatin1AccentFilter.html We use xmlui, rather than jspui though, so I have no idea how you'd configure it in the jspui setup. Sorry, Kevin On Fri, Sep 3, 2010 at 2:16 PM, Blanco, Jose <[email protected]> wrote: > Kevin: > > This is for the solr/statistics webapp, but what about the jspui webapp that > one uses lucene. > > Thanks! > Jose > > -----Original Message----- > From: Kevin S. Clarke [mailto:[email protected]] > Sent: Monday, August 23, 2010 4:35 PM > To: Blanco, Jose > Cc: [email protected] > Subject: Re: [Dspace-tech] searching with diacritics > > There is a ISOLatin1AccentFilterFactory that does this. To use it you'd add > > <filter class="solr.ISOLatin1AccentFilterFactory"/> > > to your > > <fieldType name="text" class="solr.TextField" positionIncrementGap="100"> > <analyzer type="index"> > <filter class="solr.ISOLatin1AccentFilterFactory"/> > and > > <analyzer type="query"> > <filter class="solr.ISOLatin1AccentFilterFactory"/> > > Though looking at our solr config it doesn't look like we have it in > there either. Not sure if there is a reason for not including it. > > Perhaps (I'm just now looking up the docs for it) because it's been deprecated > Cf. > http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ISOLatin1AccentFilterFactory) > > Still, I don't see the newer version solr.ASCIIFoldingFilterFactory in > our schema.xml either. > > Think I'll put it in, reindex, and test. > > Kevin > > > > On Mon, Aug 23, 2010 at 11:41 AM, Blanco, Jose <[email protected]> wrote: >> I noticed that in my instance if I search for: >> >> Jose ( without an accent ) >> >> I get all the results with Jose ( without an accent ). >> >> And if I search for >> >> Jose ( with an accent on the e ) >> >> I get all the instances that have Jose with accent. >> >> My instance is setup with English as the default language, so I thought I >> experiment in my development environment and change it to French >> (search.analyzer = org.apache.lucene.analysis.fr.FrenchAnalyzer). I did an >> index-init after changing it to French. But when I tried the search again, >> I got the same results. >> >> I found that this instace http://riuma.uma.es/xmlui , and it works as I >> expected. Searching for Jose ( without the accent ), finds all Jose's >> regardless of whether they have accents or not. Is there a way to map >> diacritics? It would be great for to be able to search Latin 1 chars with >> or without accents. >> >> Thank you! >> Jose >> >> >> >> ------------------------------------------------------------------------------ >> This SF.net email is sponsored by >> >> Make an app they can't live without >> Enter the BlackBerry Developer Challenge >> http://p.sf.net/sfu/RIM-dev2dev >> _______________________________________________ >> DSpace-tech mailing list >> [email protected] >> https://lists.sourceforge.net/lists/listinfo/dspace-tech >> > > ------------------------------------------------------------------------------ > Sell apps to millions through the Intel(R) Atom(Tm) Developer Program > Be part of this innovative community and reach millions of netbook users > worldwide. Take advantage of special opportunities to increase revenue and > speed time-to-market. Join now, and jumpstart your future. > http://p.sf.net/sfu/intel-atom-d2d > _______________________________________________ > DSpace-tech mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/dspace-tech > ------------------------------------------------------------------------------ This SF.net Dev2Dev email is sponsored by: Show off your parallel programming skills. Enter the Intel(R) Threading Challenge 2010. http://p.sf.net/sfu/intel-thread-sfd _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech

