It does employ stemming, here is the AliasAnaylzer class.  I will also create a sample 
program as Daniel had suggested.  



import java.io.Reader;
import org.apache.lucene.analysis.Analyzer;
import org.apache.lucene.analysis.LowerCaseFilter;
import org.apache.lucene.analysis.StopAnalyzer;
import org.apache.lucene.analysis.StopFilter;
import org.apache.lucene.analysis.TokenStream;
import org.apache.lucene.analysis.PorterStemFilter;
 
public class AliasAnalyzer extends Analyzer {
  public AliasAnalyzer() {
  }//end constructor

  public TokenStream tokenStream(String fieldName, Reader reader) {
    TokenStream result = new AliasFilter(
                           new StopFilter(
                             new LowerCaseFilter(
                               new AlphanumericTokenizer(reader)),
                                    StopAnalyzer.ENGLISH_STOP_WORDS));
    result = new PorterStemFilter(result);
    return result; 
  }//end tokenStream
   
}//end AliasAnalyzer



-----Original Message-----
From: "René Hackl" [mailto:[EMAIL PROTECTED]
Sent: Monday, October 18, 2004 3:21 AM
To: Lucene Developers List
Subject: Re: Failure of wildcard search in the middle of a term


Does your analyzer employ stemming? Maybe the ending gets ripped off and the
word is indexed as 'calgarian' or even 'calgar'.

Did you write the Analyzer yourself? I can't find it in any package to see
whether it does stem tokens...

Best Regards,
René



> I am using Lucene 1.4.1, I have indexed a PDF document.
> 
>   
> <http://www.calgary.ca/docgallery/bu/finance/Striking_A_Balance_Web.pdf>
http://www.calgary.ca/docgallery/bu/finance/Striking_A_Balance_Web.pdf 
> 
> In the document there is the word Calgarians. 
> 
> A search of Calgarians finds the document.
> 
> A search of Calgar* finds the document.
> 
> A search of Cal*s finds nothing. 
> 
> Using a standard query parser and an AliasAnalyzer , I know I have used
> the wildcard in the middle of a search string before and it returned
> documents. Is anyone else having this issue? or does anyone have an idea
of the
> cause?  No errors are occurring just no results are being returned.  
> 
> 

-- 
GMX ProMail mit bestem Virenschutz http://www.gmx.net/de/go/mail
+++ Empfehlung der Redaktion +++ Internet Professionell 10/04 +++


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to