John,

It actually should be pretty easy to use just the parts of Lucene you
want (the analyzers, etc) without using the rest.  See the example of
the PorterStemmer from this article:

http://www.onjava.com/pub/a/onjava/2003/01/15/lucene.html?page=2

You could feed a Reader to the tokenStream() method of
PorterStemAnalyzer, and get back a TokenStream, from which you pull
the tokens using the next() method.



On Wed, 17 Nov 2004 18:54:07 -0500, [EMAIL PROTECTED]
<[EMAIL PROTECTED]> wrote:
> 
> Is there a way to use Lucene stemming and stop word removal without using the 
> rest of the tool?   I am downloading the code now, but I imagine the answer 
> might be deeply burried.  I would like to be able to send in a phrase and get 
> back a collection of keywords if possible.
> 
> I am thinking of using an intermediary solution before moving fully to 
> Lucene.  I don't have time to spend a month making a carefully tested, 
> administratable Lucene solution for my site yet, but I intend to do so over 
> time.  Funny thing is the Lucene code likely would only take up a couple 
> hundred of lines, but integration and administration would take me much more 
> time.
> 
> In the meantime, I am thinking I could use perhaps Lucene steming and parsing 
> of words, then stick each search word along with the associated primary key 
> in an indexed MySql table.   Each record I would need to do this to is small 
> with maybe only average 15 userful words.   I would be able to have an 
> in-database solution though ranking, etc would not exist.   This is better 
> then the exact word searching i have currently which is really bad.
> 
> By the way, MySql 4.1.1 has some Lucene type handling, but it too does not 
> have stemming and I am sure it is very slow compaired to Lucene.   Cpanel is 
> still stuck on MySql 4.0.* so many people would not have access to even this 
> basic ability in production systems for some time yet.
> 
> JohnE
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 
>

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to