GitHub user srowen opened a pull request:
https://github.com/apache/spark/pull/17527
[SPARK-20156][CORE][SQL][STREAMING][MLLIB] Java String toLowerCase "Turkish
locale bug" causes Spark problems
## What changes were proposed in this pull request?
Add Locale.ROOT to all calls to String `toLowerCase`, `toUpperCase`, to
avoid inadvertent locale-sensitive variation in behavior (aka the "Turkish
locale problem").
The change looks large but it is just adding `Locale.ROOT` (the locale with
no country or language specified) to every call to these methods.
## How was this patch tested?
Existing tests.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/srowen/spark SPARK-20156
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/17527.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #17527
----
commit 59194388d8de07d258a1a952b2cc52c57962c77a
Author: Sean Owen <[email protected]>
Date: 2017-04-04T08:47:18Z
Add Locale.ROOT to all calls to String toLowerCase, toUpperCase, to avoid
inadvertent locale-sensitive variation in behavior (aka the "Turkish locale
problem")
commit ab5995c18595195df51b43c6358ed761de0181a4
Author: Sean Owen <[email protected]>
Date: 2017-04-04T08:51:30Z
Revert inadvertent change to R file
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]