busykoala opened a new pull request, #12478:
URL: https://github.com/apache/lucene/pull/12478

   ### Description
   
   This pull request adds a new feature to Lucene's DictionaryDecompounder. 
Now, you can set the position increment of subtokens to one. This feature is 
required when you're doing AND searches that involve subtokens.
   
   Right now, the position increment is set to zero. That's how 
DictionaryDecompounder currently operates. But with this update, users can set 
the subtokenPositionIncrement to one. This changes the position increment of 
the subtokens to one. This means, if you're using the AND operator in 
Elasticsearch match clauses to search for 'orangenschokolade', and 'orangen' 
and 'schokolade' are in your dictionary, it will correctly search for 'orangen 
AND schokolade'.
   
   By default, the DictionaryDecompounder emits the original compounded token. 
This behavior remains unchanged when the flag is set to zero. However, when set 
to one, it changes the DictionaryDecompounder's output to individual subtokens, 
and the original compounded token will not be emitted.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to