[ https://issues.apache.org/jira/browse/SOLR-2939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Stanislaw Osinski resolved SOLR-2939. ------------------------------------- Resolution: Fixed Fix Version/s: 4.0 In trunk and branch_3x. Wiki page updated. The language code variable expansion in field names has not yet been implemented, I'll move it to a dedicated issue. > Clustering of multilingual search results > ----------------------------------------- > > Key: SOLR-2939 > URL: https://issues.apache.org/jira/browse/SOLR-2939 > Project: Solr > Issue Type: Improvement > Components: contrib - Clustering > Reporter: Stanislaw Osinski > Assignee: Stanislaw Osinski > Fix For: 3.6, 4.0 > > > Carrot2 internally supports clustering of multilingual search results. The > clustering component should allow passing a language field to Carrot2. This > feature would need at least two new parameters: {{carrot.lang}} for the name > of Solr field that contains the language code (ISO 639) and a > {{carrot.lcmap}} field similar to the one in language recognizer to map > arbitrary strings to ISO 639 codes. > Another feature of language recognizer we should mirror is the expansion of > the {{{lang}}} token in field names into the language code of the document > (in case of multiple languages per document -- the first Carrot2-supported > language code). The feature seems easy to implement in the non-distributed > setting of Solr, but the simple implementation isn't going to work in the > distributed setting because the name of the specific field to be fetched > depends on the content (language) of each matching document. Looking at the > {{SearchClusteringEngine.getFieldsToLoad(SolrQueryRequest)}} method, a quick > but costly solution would be to load the contents of all stored fields. I'm > not too strong in distributed-mode Solr, but maybe this could be optimized so > that only the required fields get fetched? -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org