Re: Extracting article keywords using tf-idf algorithm

2015-07-20 Thread Ali Nazemian
Hi again, It seems that my problem with the strange behavior of Solr caused by the fact that I tried to update documents and add keyword field inside the Lucene index (not from using Solrj API) for the sake of better performance, But it seems that some processes ignored by this way of modifying ind

Re: Extracting article keywords using tf-idf algorithm

2015-07-18 Thread Ali Nazemian
Dear Diego, Hi, Yeah, exactly what I want. As Shawn said it is acronym for More Like This. Actually since Lucene already did the hardworking for the purpose of calculating interesting terms, I just want to use that for adding a multi-value field to all indexed documents. Best regards. On Sat, Jul

Re: Extracting article keywords using tf-idf algorithm

2015-07-18 Thread Shawn Heisey
On 7/18/2015 9:16 AM, Diego Ceccarelli wrote: > Could you please post your code somewhere? I don't understand what is > "mlt" :) This is an acronym that means "More Like This". https://wiki.apache.org/solr/MoreLikeThis Thanks, Shawn

Re: Extracting article keywords using tf-idf algorithm

2015-07-18 Thread Diego Ceccarelli
Dear Ali, I'm not sure I understand what you are trying to do, please correct me if I misunderstood: given a document indexed into lucene you want to retrieve the top-k terms with highest tf-idf right? Could you please post your code somewhere? I don't understand what is "mlt" :) Cheers, Diego

Extracting article keywords using tf-idf algorithm

2015-07-17 Thread Ali Nazemian
Dear Lucene/Solr developers, Hi, I decided to develop a plugin for Solr in order to extract main keywords from article. Since Solr already did the hard-working for calculating tf-idf scores I decided to use that for the sake of better performance. I know that UpdateRequestProcessor is the best suit