[ https://issues.apache.org/jira/browse/SOLR-3524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13412685#comment-13412685 ]
Christian Moen commented on SOLR-3524: -------------------------------------- {{CHANGES.txt}} for some reason didn't make it into {{branch_4x}}. Fixed this in revision 1360622. > Make discard-punctuation feature in Kuromoji configurable from > JapaneseTokenizerFactory > --------------------------------------------------------------------------------------- > > Key: SOLR-3524 > URL: https://issues.apache.org/jira/browse/SOLR-3524 > Project: Solr > Issue Type: Improvement > Components: Schema and Analysis > Affects Versions: 3.6 > Reporter: Kazuaki Hiraga > Assignee: Christian Moen > Priority: Minor > Fix For: 4.0, 5.0 > > Attachments: SOLR-3524.patch, SOLR-3524.patch, > kuromoji_discard_punctuation.patch.txt > > > JapaneseTokenizer, Kuromoji doesn't provide configuration option to preserve > punctuation in Japanese text, although It has a parameter to change this > behavior. JapaneseTokenizerFactory always set third parameter, which > controls this behavior, to true to remove punctuation. > I would like to have an option I can configure this behavior by fieldtype > definition in schema.xml. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org