Hello, Koo.
I can share my shallow understanding of this algorithm.
Approximate field length is called norm and stored in a byte per document
for sake of compactness (implying performance). Norms are encoded
via org.apache.lucene.util.SmallFloat#intToByte4.
Thus, there might be just 256 different
Did the analyzer for this field remove some tokens?
Sent from my iPhone
> On 9. Aug 2023, at 16:43, 承諾一輩子 <502565...@qq.com.invalid> wrote:
>
> Dear colleague:
> I am a drive development engineer works in ZTE corporation from
> China.
> Recently in learning lucene source code.There is a
Did the analyzer for this field remove some tokens?
Sent from my iPhone
> On 9. Aug 2023, at 16:43, 承諾一輩子 <502565...@qq.com.invalid> wrote:
>
> Dear colleague:
> I am a drive development engineer works in ZTE corporation from
> China.
> Recently in learning lucene source code.There is a