[ 
https://issues.apache.org/jira/browse/OPENNLP-1268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16865369#comment-16865369
 ] 

ASF GitHub Bot commented on OPENNLP-1268:
-----------------------------------------

kottmann commented on issue #356: OPENNLP-1268 -- fix StringUtil.toLowerCase() 
to work on codepoints, not chars
URL: https://github.com/apache/opennlp/pull/356#issuecomment-502553756
 
 
   Did you see a difference in detection performance with that change? Which 
languages would be affected?
   
   
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


> StringUtil.toLowerCase() should lowercase codepoints, not chars
> ---------------------------------------------------------------
>
>                 Key: OPENNLP-1268
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-1268
>             Project: OpenNLP
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Trivial
>
> {{StringUtils#toLowerCase()}} should run Character.tolowerCase() on code 
> points.  It is currently failing to lowercase characters beyond the BMP.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to