Hi,
I'm trying to implement Explicit semantic analysis(ESA) via Lucene.
How do I take a term TFIDF in a query into consideration when matching
documents?
For example:
Query:"a b c a d a"
Doc1:"a b a"
Doc2:"a b c"
The query should match Doc1 better than 2.
I'd like this to work without impacting
@lucene.apache.org
Subject: tf/idf similarity with modified document similarity
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Hello,
what is the best method to score documents similar to default similarity,
but the document
frequency should be calculated per query against the matching result
document set
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Hello,
what is the best method to score documents similar to default similarity, but
the document
frequency should be calculated per query against the matching result document
set, not statically
against the whole corpus.
Didn't found a good and pe
>> regardsshaimaa
>>
>> --
>> If you reply to this email, your message will be added to the
>> discussion below:
>> http://lucene.472066.n3.nabble.com/Document-Similarity-tp3998082.html
>> To unsubscribe from Lucene, click
>&g
ntain any code
any help will be greatly apreciated
regardsshaimaa
> Date: Mon, 30 Jul 2012 07:32:49 -0700
> From: in.ab...@gmail.com
> To: java-user@lucene.apache.org
> Subject: Re: Document Similarity
>
> Hi ELshaimaa,
> I couldnt able understood what is your need . Can you
can use to map the document to one of the documents in
> the index
> regardsshaimaa
>
> --
> If you reply to this email, your message will be added to the discussion
> below:
> http://lucene.472066.n3.nabble.com/Document-Similarity-tp3998082.html
(12/03/13 2:38), Hassane Cabir wrote:
Hi guys,
I'm using Lucene for my project and I need to calcule how similar two (or
more) documents are, using TFIDF. How to get TFIDF with lucene?
Any insights on this?
Solr has TermVectorComponent which can return tf, df and tf-idf of each term
in a docu
Hi guys,
I'm using Lucene for my project and I need to calcule how similar two (or
more) documents are, using TFIDF. How to get TFIDF with lucene?
Any insights on this?
Thank you for your support
.
--
Hassane
e the fields you just indexed... no need to
> retrieve it again).
>
> -Yonik
>
> On 1/20/06, Klaus <[EMAIL PROTECTED]> wrote:
> >
> > >In my case, i need to filter similar documents in search results and
> > >therefore determine document similarity durin
n my case, i need to filter similar documents in search results and
> >therefore determine document similarity during indexing process using
> >term vectors. Obviously, i can't compare currently indexing document
> >with all documents in my collection.
>
> Yes you can.
>In my case, i need to filter similar documents in search results and
>therefore determine document similarity during indexing process using
>term vectors. Obviously, i can't compare currently indexing document
>with all documents in my collection.
Yes you can. Right after
Hello lucene people!
First of all, i would like to thank all of community participants (
developers, users, Erik and Otis for "Lucene in Action" book ) for
their great work.
As far as i understand it, there are two most popular approches
concerning document similarity:
1. "cosine
12 matches
Mail list logo