To get the term-freq

2017-11-16 Thread Dwaipayan Roy
​Hi, I want to get the term frequency of a given term t in a given document with lucene docid say d. Formally, I need a function say f() that takes two arguments: 1. lucene-docid d, 2. term t, and returns the number of time t occurs in d. I know of one solution, that is, traversing the whole

Optimize FTS memory footprint

2017-11-16 Thread Eli Revach
Hi I am using Elasticserach 1.7.5 , our segment memory allocation per node is very big , its seems like related to FST . 1) Amy way to reduce /optimze its size ( i understed its the index for the terms) ? 2) Did index optimize can help ? 2) The fact that we used nested objects can dramticly the

Custom scoring algorithm and Explanation extending.

2017-11-16 Thread Vadim Gindin
Hello 1. I would like to use my custom scoring algorithm. Is it make sense to use Lucene with other scoring algorithm? What is the best way for that - implement Similarity and own Queries? 2. I'm reasearching Elasticsearch/Lucene capabilities. Elastichsearch contains request parameter "explain"

Weighed Synonyms - Lucene 7.1

2017-11-16 Thread benafia salem
Hello, I'm trying to implement a weighed synonyms analyzer using SynonymGraphFilter and a custom payload analyzer. My synonyms file looks like: obama,president^0.9,barry^0.5 speaks,greets^0.8,congratulates^0.9,informs media,press,journal,tv^0.8 I want user searching for "obama greets press" to

Extending Explanation class information

2017-11-16 Thread Vadim Gindin
Hello Lucene developers and users. I'm currently researching Elasticsearch/Lucene capabilites. I'd like to extend an information that Explanation class provides. This class currently provides only score computation for document or query. Particularly I'd like to include the following information