On 12 Apr 2007 at 0:28, karl wettin wrote:
> 
> 11 apr 2007 kl. 22.32 skrev Steffen Heinrich:
> 
> > According to occasional references on this list some people have
> > already tried to implement such a search with lucene but did they
> > succeed?
> >
> > My first idea was to run every completed token of the request
> > (current user input) through a spellchecker and expand an incomplete
> > token to a PrefixQuery.
> 
> I've posted a solution in the Jira that use sucessful user queries as  
> a cropus rather than bashing a Lucene index, and is thus best suited  
> for systems with a bit of user activity. The data is stored in a trie- 
> pattern with meta data at each node, allowing for extra data such as  
> number of hits, available facet classifications or what not. I've had  
> it running for a couple of days in a system with one user query every  
> ten milliseconds, so it ought to be stable(tm).
> 
> https://issues.apache.org/jira/browse/LUCENE-625
> 
> There is a memoryleak in the trie at optimize() that has been fixed  
> locally. Might be available in LUCENE-626 too. Not sure right now,  
> but let me know and I'll make sure to post it.
> 
> 
> -- 
> karl
> 
Hello Karl,

thank you for this information. It was the first time I ever heard of 
tries and had to read up on them. Very interesting!

The intended system however can not be trained by user input. The 
suggestions have to come from a given corpus (e.g. an ocasionally 
updated product database).
Do you think adopting your package to set up the tries from a corpus 
would be fairly easy?

Cheers, Steffen



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to