Re: Similarity Search

Mork0075 Fri, 14 Mar 2008 07:57:28 -0700

So you mean i merge the 10 paragraphs to one huge query and then selectthe top x?


Is this, from a semantic standpoint, the optimal solution?


For explanation:

Paragraph_A
Paragraph_B
Paragraph_C
....

merge -> Paragraph_A Paragraph_B Paragraph_C ... = query

[EMAIL PROTECTED] schrieb:

Hi,

You can enter the whole paragraph as query, and then select the top 10.

Cheers.
jz

On Fri, Mar 14, 2008 at 9:14 AM, Mork0075 <[EMAIL PROTECTED]> wrote:

Hello,

 we are using lucene in one of our applications for fulltext search,
 which works very vell.

 I'am now interested in some similarity search for whole paragraphs.

 For example there are 1000 textual items in the database, which contain
 on average more then perhaps 100 words per item. Now i have a set of 10
 textual items, and would like to know, which of the 1000 texual items
 are similar to the 10 (in a certain tolerance)?

 Is this possible with lucene?

 Thanks in advance
 Mark

Re: Similarity Search

Reply via email to