It does employ stemming, here is the AliasAnaylzer class. I will also create a sample program as Daniel had suggested.
import java.io.Reader; import org.apache.lucene.analysis.Analyzer; import org.apache.lucene.analysis.LowerCaseFilter; import org.apache.lucene.analysis.StopAnalyzer; import org.apache.lucene.analysis.StopFilter; import org.apache.lucene.analysis.TokenStream; import org.apache.lucene.analysis.PorterStemFilter; public class AliasAnalyzer extends Analyzer { public AliasAnalyzer() { }//end constructor public TokenStream tokenStream(String fieldName, Reader reader) { TokenStream result = new AliasFilter( new StopFilter( new LowerCaseFilter( new AlphanumericTokenizer(reader)), StopAnalyzer.ENGLISH_STOP_WORDS)); result = new PorterStemFilter(result); return result; }//end tokenStream }//end AliasAnalyzer -----Original Message----- From: "René Hackl" [mailto:[EMAIL PROTECTED] Sent: Monday, October 18, 2004 3:21 AM To: Lucene Developers List Subject: Re: Failure of wildcard search in the middle of a term Does your analyzer employ stemming? Maybe the ending gets ripped off and the word is indexed as 'calgarian' or even 'calgar'. Did you write the Analyzer yourself? I can't find it in any package to see whether it does stem tokens... Best Regards, René > I am using Lucene 1.4.1, I have indexed a PDF document. > > > <http://www.calgary.ca/docgallery/bu/finance/Striking_A_Balance_Web.pdf> http://www.calgary.ca/docgallery/bu/finance/Striking_A_Balance_Web.pdf > > In the document there is the word Calgarians. > > A search of Calgarians finds the document. > > A search of Calgar* finds the document. > > A search of Cal*s finds nothing. > > Using a standard query parser and an AliasAnalyzer , I know I have used > the wildcard in the middle of a search string before and it returned > documents. Is anyone else having this issue? or does anyone have an idea of the > cause? No errors are occurring just no results are being returned. > > -- GMX ProMail mit bestem Virenschutz http://www.gmx.net/de/go/mail +++ Empfehlung der Redaktion +++ Internet Professionell 10/04 +++ --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]