From what I can understand, you want to insert the word "history" and then get proposed "related" terms in combination with your input query. In essense this would be to do a "look-up" on top-terms in the subset of documents matching the initial query "history". Exactly how you could do this is a bit uncertain from my knowledge, but I suggest you read up on term-frequency and the tf-idf scheme.

Also: take a look at the org.apache.lucene.search.similar package:
http://hudson.zones.apache.org/hudson/job/Lucene-trunk/javadoc//org/apache/lucene/search/similar/package-summary.html
and read the motivation email listed in the first segment of
http://hudson.zones.apache.org/hudson/job/Lucene-trunk/javadoc//org/apache/lucene/search/similar/MoreLikeThis.html

I couldn't really see how you would autocomplete after the word history without listing a bunch of un-interesting terms as suggestions... But i might be wrong... Of course, if it was autocompletion you were looking for¸ Asbjørn answered that one just fine:)

Best regards,
 Aleksander M. Stensby


On Thu, 09 Oct 2008 18:49:26 +0200, Asbjørn A. Fellinghaug <[EMAIL PROTECTED]> wrote:

Albert Juhe:

Hi,

I want to make a wizard that can help to find n-grams terms.
For example:
If i want to search History, after write it the system propose you the
following searches:
history europe
history spain
history .....
Consulting the terms indexed.

Does it exits in Lucene?

Hi.

I interpret your question in such a way that you want autocompletion in
your search system? In that case, I believe there are some Analyzer's
which does this in the 'contrib' package. Also, I've created an Analyzer
which creates "bigrams" (n-gram of size 2) in my master thesis.
Feel free to download it from this page:
http://asbjorn.fellinghaug.com/blog/2008/08/the-code-for-my-master-thesis/

Also, have a look at the package org.apache.lucene.analysis.ngram:
http://lucene.apache.org/java/2_3_2/api/org/apache/lucene/analysis/ngram/package-summary.html




--
Aleksander M. Stensby
Senior Software Developer
Integrasco A/S
+47 41 22 82 72
[EMAIL PROTECTED]

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to