Hi Jose,

Seems there is a corresponding filter in Lucene proper:

http://lucene.apache.org/java/2_4_0/api/org/apache/lucene/analysis/ISOLatin1AccentFilter.html

We use xmlui, rather than jspui though, so I have no idea how you'd
configure it in the jspui setup.

Sorry,
Kevin


On Fri, Sep 3, 2010 at 2:16 PM, Blanco, Jose <[email protected]> wrote:
> Kevin:
>
> This is for the solr/statistics webapp, but what about the jspui webapp that 
> one uses lucene.
>
> Thanks!
> Jose
>
> -----Original Message-----
> From: Kevin S. Clarke [mailto:[email protected]]
> Sent: Monday, August 23, 2010 4:35 PM
> To: Blanco, Jose
> Cc: [email protected]
> Subject: Re: [Dspace-tech] searching with diacritics
>
> There is a ISOLatin1AccentFilterFactory that does this.  To use it you'd add
>
> <filter class="solr.ISOLatin1AccentFilterFactory"/>
>
> to your
>
> <fieldType name="text" class="solr.TextField" positionIncrementGap="100">
>    <analyzer type="index">
>          <filter class="solr.ISOLatin1AccentFilterFactory"/>
> and
>
>     <analyzer type="query">
>         <filter class="solr.ISOLatin1AccentFilterFactory"/>
>
> Though looking at our solr config it doesn't look like we have it in
> there either.  Not sure if there is a reason for not including it.
>
> Perhaps (I'm just now looking up the docs for it) because it's been deprecated
>  Cf. 
> http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ISOLatin1AccentFilterFactory)
>
> Still, I don't see the newer version solr.ASCIIFoldingFilterFactory in
> our schema.xml either.
>
> Think I'll put it in, reindex, and test.
>
> Kevin
>
>
>
> On Mon, Aug 23, 2010 at 11:41 AM, Blanco, Jose <[email protected]> wrote:
>> I noticed that in my instance if I search for:
>>
>> Jose ( without an accent )
>>
>> I get all the results with Jose ( without an accent ).
>>
>> And if I search for
>>
>> Jose ( with an accent on the e )
>>
>> I get all the instances that have Jose with accent.
>>
>> My instance is setup with English as the default language, so I thought I 
>> experiment in my development environment and change it to French 
>> (search.analyzer = org.apache.lucene.analysis.fr.FrenchAnalyzer).  I did an 
>> index-init after changing it to French.  But when I tried the search again, 
>> I got the same results.
>>
>> I found that this instace http://riuma.uma.es/xmlui , and it works as I 
>> expected.  Searching for Jose ( without the accent ), finds all Jose's 
>> regardless of whether they have accents or not.  Is there a way to map 
>> diacritics?  It would be great for to be able to search Latin 1 chars with 
>> or without accents.
>>
>> Thank you!
>> Jose
>>
>>
>>
>> ------------------------------------------------------------------------------
>> This SF.net email is sponsored by
>>
>> Make an app they can't live without
>> Enter the BlackBerry Developer Challenge
>> http://p.sf.net/sfu/RIM-dev2dev
>> _______________________________________________
>> DSpace-tech mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>>
>
> ------------------------------------------------------------------------------
> Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
> Be part of this innovative community and reach millions of netbook users
> worldwide. Take advantage of special opportunities to increase revenue and
> speed time-to-market. Join now, and jumpstart your future.
> http://p.sf.net/sfu/intel-atom-d2d
> _______________________________________________
> DSpace-tech mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>

------------------------------------------------------------------------------
This SF.net Dev2Dev email is sponsored by:

Show off your parallel programming skills.
Enter the Intel(R) Threading Challenge 2010.
http://p.sf.net/sfu/intel-thread-sfd
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to