You could append an EOF token to every indexed text, and then iterate over Terms to get the positions of those tokens?
On Tue, Jun 2, 2020 at 11:50 AM Moritz Staudinger <mor...@staudinger.work> wrote: > > Hello, > > I am not sure if I am at the right place here, but I got a question about > the approximation my Lucene implementation does. > > I am trying to calculate the same scores Lucenes BM25Similiarity calculates, > but I found out that Lucene only approximates the length of documents for > scoring but uses the correct values for the average document length. > Is there a way to turn off these approximations or to get the values, so > that I can save it for my own calculations? > > For my Implementation I use Lucene 8.4.1 in Combination with Spring Boot, if > this is necessary. > > Thank you in advance, > Moritz > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org