Thx Ren� that's very Helpfull !!!

But I got an error in the code :

String s = stemmer.stem(token.termText());

The stem method uses a boolean argument, and not a string...

any Idea ?

-----Original Message-----
From: Ren� Ferr�ro [mailto:[EMAIL PROTECTED]
Sent: mercredi 19 mars 2003 22:20
To: Lucene Users List; [EMAIL PROTECTED]
Subject: RE: Full French Analyser ?


Hi Pierre,

I did the same thing some time ago. Here are the
highlights :

1- Create a FrenchStemFilter class that extends
TokenFilter

import net.sf.snowball.ext.frenchStemmer;

/**
 * Constructor for SnowballFrenchStemFilter.
 */

public FrenchStemFilter(TokenStream in)
{
        stemmer = new frenchStemmer();
        input = in;
}

public final Token next() throws IOException
{
        Token token = input.next();
        if (token == null)
                return null;
        else
        {
                String s = stemmer.stem(token.termText());
                // If not stemmed, dont waste the time creating a
new token
                if (!s.equals(token.termText()))
                        return new Token(s, token.startOffset(),
token.endOffset(), token.type());
        }
        return token;
}

2- Finally create a FrenchAnalyzer that returns a
TokenStream whose tokens are filtered by the previous
stemmer.

Hope that it can help you.

 --- Pierre Lacchini <[EMAIL PROTECTED]> a �crit : > Ok
thx !!! That is exactly what i was looking for...
>
> But how can i use it ?
> (sorry i'm kinda noob in Java)...
>
> The snowball.JAR has been added to my project, but
> now i dunno how to use
> it...
>
> -----Original Message-----
> From: Alex Murzaku [mailto:[EMAIL PROTECTED]
> Sent: mercredi 19 mars 2003 15:49
> To: 'Lucene Users List'; [EMAIL PROTECTED]
> Subject: RE: Full French Analyser ?
>
>
> You can find Danish, Dutch, English, Finnish,
> French, German, Italian,
> Norwegian, Portuguese, Russian, Spanish and Swedish
> Snowball
> stemmers/analyzers at:
>
http://jakarta.apache.org/lucene/docs/lucene-sandbox/snowball/
>
> Doug or Otis, why don't you move these out of the
> sandbox and make them
> integral part of Lucene?
>
> --
> Alex Murzaku
> ___________________________________________
>  alex(at)lissus.com  http://www.lissus.com
>
> -----Original Message-----
> From: Pierre Lacchini [mailto:[EMAIL PROTECTED]
> Sent: Wednesday, March 19, 2003 10:02 AM
> To: Lucene (E-mail)
> Subject: Full French Analyser ?
>
>
> Heya all,
>
> I'm looking for a full French Analyser, containing a
> FrenchPorterStemmer... Does anyone know where i can
> find one ?
>
> And if I wanna create my own FrenchAnalyser  - I
> have the STOP_WORDS
> list - can I remove the standard PorterStemFilter ?
>
> In fact, can I crete a new Analyser without
> PorterStemmer at all ?
>
> Thx ;)
>
>
>
>
>
---------------------------------------------------------------------
> To unsubscribe, e-mail:
> [EMAIL PROTECTED]
> For additional commands, e-mail:
> [EMAIL PROTECTED]
>
>
>
---------------------------------------------------------------------
> To unsubscribe, e-mail:
> [EMAIL PROTECTED]
> For additional commands, e-mail:
> [EMAIL PROTECTED]
>

___________________________________________________________
Do You Yahoo!? -- Une adresse @yahoo.fr gratuite et en fran�ais !
Yahoo! Mail : http://fr.mail.yahoo.com


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to