[ https://issues.apache.org/jira/browse/SOLR-4412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13698279#comment-13698279 ]
Jack Krupansky commented on SOLR-4412: -------------------------------------- Thanks for the clarification - I mistook the very long base class name for one of the implementations. So, it looks fine. > LanguageIdentifier lcmap for language field > ------------------------------------------- > > Key: SOLR-4412 > URL: https://issues.apache.org/jira/browse/SOLR-4412 > Project: Solr > Issue Type: Bug > Components: contrib - LangId > Affects Versions: 4.1 > Reporter: Jan Høydahl > Assignee: Jan Høydahl > Fix For: 5.0, 4.4 > > Attachments: SOLR-4412.patch > > > For some languages, the detector will detect sub-languages, such as > LangDetect detecting zh-tw or zh-cn for Chinese. Tika detector only detects > zh. Today you can use {{lcmap}} to map these two into one code, e.g. > {{langid.map.lcmap=zh-cn:zh zh-tw:zh}}. But the {{langField}} output is not > changed. > We need an option for {{langField}} as well. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org