[
https://issues.apache.org/jira/browse/SPARK-15064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15267021#comment-15267021
]
Burak KÖSE commented on SPARK-15064:
------------------------------------
Thank you, I have been trying to improve StopWordsRemover. For example; the
real problem is
lowercase = "i", and uppercase should be "İ", but the programming language
gives "I", right?
> Locale support in StopWordsRemover
> ----------------------------------
>
> Key: SPARK-15064
> URL: https://issues.apache.org/jira/browse/SPARK-15064
> Project: Spark
> Issue Type: New Feature
> Components: ML
> Affects Versions: 2.0.0
> Reporter: Xiangrui Meng
>
> We support case insensitive filtering (default) in StopWordsRemover. However,
> case insensitive matching depends on the locale and region, which cannot be
> explicitly set in StopWordsRemover. We should consider adding this support in
> MLlib.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]