You could append an EOF token to every indexed text, and then iterate over Terms to get the positions of those tokens?
On Tue, Jun 2, 2020 at 11:50 AM Moritz Staudinger <[email protected]> wrote: > > Hello, > > I am not sure if I am at the right place here, but I got a question about > the approximation my Lucene implementation does. > > I am trying to calculate the same scores Lucenes BM25Similiarity calculates, > but I found out that Lucene only approximates the length of documents for > scoring but uses the correct values for the average document length. > Is there a way to turn off these approximations or to get the values, so > that I can save it for my own calculations? > > For my Implementation I use Lucene 8.4.1 in Combination with Spring Boot, if > this is necessary. > > Thank you in advance, > Moritz > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
