jasperjiaguo commented on code in PR #9708:
URL: https://github.com/apache/pinot/pull/9708#discussion_r1012145231


##########
pinot-segment-local/src/main/java/org/apache/pinot/segment/local/segment/creator/impl/text/LuceneTextIndexCreator.java:
##########
@@ -82,16 +91,20 @@ public class LuceneTextIndexCreator implements 
TextIndexCreator {
    *               no need to commit the index from the realtime side. So when 
the realtime segment
    *               is destroyed (which is after the realtime segment has been 
committed and converted
    *               to offline), we close this lucene index writer to release 
resources but don't commit.
-   *               This is the reason to have commit flag part of the 
constructor.
+   * @param stopWordsInclude the words to include in addition to the default 
stop word list

Review Comment:
   The current behavior is to prioritize exclude these duplicate words. I'm 
planning to put default `ENGLISH_STOP_WORDS_SET`, config keys, and this 
behavior in the user doc.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to