GitHub user srowen opened a pull request:

    https://github.com/apache/spark/pull/17527

    [SPARK-20156][CORE][SQL][STREAMING][MLLIB] Java String toLowerCase "Turkish 
locale bug" causes Spark problems

    ## What changes were proposed in this pull request?
    
    Add Locale.ROOT to all calls to String `toLowerCase`, `toUpperCase`, to 
avoid inadvertent locale-sensitive variation in behavior (aka the "Turkish 
locale problem").
    
    The change looks large but it is just adding `Locale.ROOT` (the locale with 
no country or language specified) to every call to these methods.
    
    ## How was this patch tested?
    
    Existing tests.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/srowen/spark SPARK-20156

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/17527.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #17527
    
----
commit 59194388d8de07d258a1a952b2cc52c57962c77a
Author: Sean Owen <[email protected]>
Date:   2017-04-04T08:47:18Z

    Add Locale.ROOT to all calls to String toLowerCase, toUpperCase, to avoid 
inadvertent locale-sensitive variation in behavior (aka the "Turkish locale 
problem")

commit ab5995c18595195df51b43c6358ed761de0181a4
Author: Sean Owen <[email protected]>
Date:   2017-04-04T08:51:30Z

    Revert inadvertent change to R file

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to