[ 
https://issues.apache.org/jira/browse/HADOOP-11602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14324094#comment-14324094
 ] 

Steve Loughran commented on HADOOP-11602:
-----------------------------------------

Akira: this patch fixes a defect that's been lurking in Hadoop for a while: in 
some locales it is not going to work, simply due to case conversion logic. It 
doesn't surface often, but it can (and has surfaced) —I met someone at 
ApacheCon EU who had encountered it.

This patch spans a lot of files, but that's because the lurking bug is 
widespread. A single across-the-board, across-the-branches patch will fix it in 
one go, set an example for the rest of the code. Combined with some 
rigorousness on reviews, we can keep the code clean. Setting up jenkins to test 
in the turkish locale would be even more rigorous

> Fix toUpperCase/toLowerCase to use Locale.ENGLISH
> -------------------------------------------------
>
>                 Key: HADOOP-11602
>                 URL: https://issues.apache.org/jira/browse/HADOOP-11602
>             Project: Hadoop Common
>          Issue Type: Bug
>    Affects Versions: 2.6.0
>            Reporter: Tsuyoshi OZAWA
>            Assignee: Tsuyoshi OZAWA
>         Attachments: HADOOP-11602-001.patch, HADOOP-11602-002.patch
>
>
> String#toLowerCase()/toUpperCase() without local can occur unexpected 
> behavior based on the locale. It's written in 
> [Javadoc|http://docs.oracle.com/javase/7/docs/api/java/lang/String.html#toLowerCase()]:
> {quote}
> For instance, "TITLE".toLowerCase() in a Turkish locale returns "t\u0131tle", 
> where '\u0131' is the LATIN SMALL LETTER DOTLESS I character
> {quote}
> This issue is derived from HADOOP-10101.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to