dosubot[bot] commented on issue #2873: URL: https://github.com/apache/incubator-hugegraph/issues/2873#issuecomment-3310364877
<!-- Greeting --> Hi @qushencn! I'm [Dosu](https://go.dosu.dev/dosubot) and I’m helping the incubator-hugegraph team. <!-- Answer --> HugeGraph 目前在代码中内置了 7 种中文分词器,分别是:ansj、hanlp、smartcn、jieba、jcseg、mmseg4j 和 ikanalyzer。你可以通过配置 `search.text_analyzer` 选项来指定分词器,例如: ``` search.text_analyzer=jieba ``` 每种分词器还支持不同的分词模式,通过 `search.text_analyzer_mode` 配置,例如: - ansj: BaseAnalysis, IndexAnalysis, ToAnalysis, NlpAnalysis - hanlp: standard, nlp, index, nShort, shortest, speed - jieba: SEARCH, INDEX - jcseg: Simple, Complex - mmseg4j: Simple, Complex, MaxWord - ikanalyzer: smart, max_word 配置示例: ``` search.text_analyzer=ikanalyzer search.text_analyzer_mode=smart ``` 如果需要扩展自定义分词器,可以通过注册方式引入。关于第 8 种分词器,目前在代码和配置中只找到 7 种,官网可能存在描述滞后或统计方式不同的情况。详细可参考源码实现 [AnalyzerFactory.java](https://github.com/apache/incubator-hugegraph/blob/a53af864b31f803780a1db31cb20d3d94d5d34fc/hugegraph-server/hugegraph-core/src/main/java/org/apache/hugegraph/analyzer/AnalyzerFactory.java) 和 [CoreOptions.java](https://github.com/apache/incubator-hugegraph/blob/a53af864b31f803780a1db31cb20d3d94d5d34fc/hugegraph-server/hugegraph-core/src/main/java/org/apache/hugegraph/config/CoreOptions.java)。 <!-- Dosu Comment Footer --> *To reply, just mention [@dosu](https://go.dosu.dev/dosubot).* --- <sup>How did I do? [Good](https://app.dosu.dev/response-feedback/b1ef92b3-ced8-4b8f-bac0-46b36977a3dc?feedback_type=great_response) | [Irrelevant](https://app.dosu.dev/response-feedback/b1ef92b3-ced8-4b8f-bac0-46b36977a3dc?feedback_type=irrelevant_answer) | [Incorrect](https://app.dosu.dev/response-feedback/b1ef92b3-ced8-4b8f-bac0-46b36977a3dc?feedback_type=incorrect_sources) | [Verbose](https://app.dosu.dev/response-feedback/b1ef92b3-ced8-4b8f-bac0-46b36977a3dc?feedback_type=too_verbose) | [Hallucination](https://app.dosu.dev/response-feedback/b1ef92b3-ced8-4b8f-bac0-46b36977a3dc?feedback_type=hallucination) | [Report 🐛](https://app.dosu.dev/response-feedback/b1ef92b3-ced8-4b8f-bac0-46b36977a3dc?feedback_type=bug_report) | [Other](https://app.dosu.dev/response-feedback/b1ef92b3-ced8-4b8f-bac0-46b36977a3dc?feedback_type=other)</sup> [](https://app.dosu.dev/e5e246fa-a522-4d8a-ab3c-e2588b5b501b/ask?utm_source=githu b) [](https://go.dosu.dev/discord-bot) [](https://twitter.com/intent/tweet?text=%40dosu_ai%20helped%20me%20solve%20this%20issue!&url=https%3A//github.com/apache/incubator-hugegraph/issues/2873) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@hugegraph.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@hugegraph.apache.org For additional commands, e-mail: issues-h...@hugegraph.apache.org