Re: What is the approximate processing mechanism for field length?

2023-08-10 Thread Mikhail Khludnev
Hello, Koo. I can share my shallow understanding of this algorithm. Approximate field length is called norm and stored in a byte per document for sake of compactness (implying performance). Norms are encoded via org.apache.lucene.util.SmallFloat#intToByte4. Thus, there might be just 256 different

Re: What is the approximate processing mechanism for field length?

2023-08-09 Thread Jean-Marie Lallouet
Did the analyzer for this field remove some tokens? Sent from my iPhone > On 9. Aug 2023, at 16:43, 承諾一輩子 <502565...@qq.com.invalid> wrote: > > Dear colleague: >    I am a drive development engineer works in ZTE corporation from > China. >    Recently in learning lucene source code.There is a

Re: What is the approximate processing mechanism for field length?

2023-08-09 Thread Jean-Marie Lallouet
Did the analyzer for this field remove some tokens? Sent from my iPhone > On 9. Aug 2023, at 16:43, 承諾一輩子 <502565...@qq.com.invalid> wrote: > > Dear colleague: >    I am a drive development engineer works in ZTE corporation from > China. >    Recently in learning lucene source code.There is a