Hi Pierre,
I did the same thing some time ago. Here are the
highlights :
1- Create a FrenchStemFilter class that extends
TokenFilter
import net.sf.snowball.ext.frenchStemmer;
/**
* Constructor for SnowballFrenchStemFilter.
*/
public FrenchStemFilter(TokenStream in)
{
stemmer = new frenchStemmer();
input = in;
}
public final Token next() throws IOException
{
Token token = input.next();
if (token == null)
return null;
else
{
String s = stemmer.stem(token.termText());
// If not stemmed, dont waste the time creating a
new token
if (!s.equals(token.termText()))
return new Token(s, token.startOffset(),
token.endOffset(), token.type());
}
return token;
}
2- Finally create a FrenchAnalyzer that returns a
TokenStream whose tokens are filtered by the previous
stemmer.
Hope that it can help you.
--- Pierre Lacchini <[EMAIL PROTECTED]> a �crit : > Ok
thx !!! That is exactly what i was looking for...
>
> But how can i use it ?
> (sorry i'm kinda noob in Java)...
>
> The snowball.JAR has been added to my project, but
> now i dunno how to use
> it...
>
> -----Original Message-----
> From: Alex Murzaku [mailto:[EMAIL PROTECTED]
> Sent: mercredi 19 mars 2003 15:49
> To: 'Lucene Users List'; [EMAIL PROTECTED]
> Subject: RE: Full French Analyser ?
>
>
> You can find Danish, Dutch, English, Finnish,
> French, German, Italian,
> Norwegian, Portuguese, Russian, Spanish and Swedish
> Snowball
> stemmers/analyzers at:
>
http://jakarta.apache.org/lucene/docs/lucene-sandbox/snowball/
>
> Doug or Otis, why don't you move these out of the
> sandbox and make them
> integral part of Lucene?
>
> --
> Alex Murzaku
> ___________________________________________
> alex(at)lissus.com http://www.lissus.com
>
> -----Original Message-----
> From: Pierre Lacchini [mailto:[EMAIL PROTECTED]
> Sent: Wednesday, March 19, 2003 10:02 AM
> To: Lucene (E-mail)
> Subject: Full French Analyser ?
>
>
> Heya all,
>
> I'm looking for a full French Analyser, containing a
> FrenchPorterStemmer... Does anyone know where i can
> find one ?
>
> And if I wanna create my own FrenchAnalyser - I
> have the STOP_WORDS
> list - can I remove the standard PorterStemFilter ?
>
> In fact, can I crete a new Analyser without
> PorterStemmer at all ?
>
> Thx ;)
>
>
>
>
>
---------------------------------------------------------------------
> To unsubscribe, e-mail:
> [EMAIL PROTECTED]
> For additional commands, e-mail:
> [EMAIL PROTECTED]
>
>
>
---------------------------------------------------------------------
> To unsubscribe, e-mail:
> [EMAIL PROTECTED]
> For additional commands, e-mail:
> [EMAIL PROTECTED]
>
___________________________________________________________
Do You Yahoo!? -- Une adresse @yahoo.fr gratuite et en fran�ais !
Yahoo! Mail : http://fr.mail.yahoo.com
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]