On May 10, 2004, at 4:31 AM, Cocula Remi wrote:
You have to write a special analyzer that include an accent filter. Then use this analyzer for both indexing and querying.
The issue, however, is that wildcard queries are not analyzed by QueryParser.
I think if your analyzer is working fine for regular term and phrase queries, that subclassing QueryParser and overriding getWildcardQuery to handle special processing (perhaps even analyzing if you are careful not to remove the wildcard characters themselves!) is the way to go.
Although pre-processing is a fine solution too if the accented characters should be removed in all query types.
Erik
-----Message d'origine----- De : Stephane James Vaucher [mailto:[EMAIL PROTECTED] Envoy� : lundi 10 mai 2004 10:05 � : Lucene Users List Objet : Analysis of wildcard queries
I've seen this: http://www.jguru.com/faq/view.jsp?EID=538312
I've seen in the code that there is a method to set lowercasing, but I
need to remove accentuated chars as well. Any suggestions as to which is
preferable, preprocessing the input or subclassing a QueryParser and
redefining getWildcardQuery?
cheers, sv
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
