Unalian commented on PR #41114: URL: https://github.com/apache/doris/pull/41114#issuecomment-2371017217
> @Unalian > > 1. Did you use the official Clucene for your tests? > 2. Doris has implemented some optimizations and modifications to Clucene. You can refer to this codebase: https://github.com/apache/doris-thirdparty/tree/clucene and the related pull request: [[opt](inverted index) optimize the space usage of the inverted index dictionary file and position information doris-thirdparty#238](https://github.com/apache/doris-thirdparty/pull/238). 1. I used this version: git://clucene.git.sourceforge.net/gitroot/clucene/clucene; And I set simple analyzer; Set writer config: SOTRE_NO; INDEX_NONORMS; INDEX_TOKENIZED; Add I set the config to make sure there is only one segment. 2. Thank you! I am reading the code here. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
