[
https://issues.apache.org/jira/browse/ARROW-13853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17419937#comment-17419937
]
Eduardo Ponce edited comment on ARROW-13853 at 9/24/21, 7:18 PM:
-----------------------------------------------------------------
Arrow string compute functions do not support a locale setting and use the
default POSIX locale which is "C".
Arrow provides UTF-8 variants of the string functions for localization
standardization.
Currently, only _strftime_ uses a locale option for formatting the resulting
string.
Localization would also need to be considered for kernels that use/change text
casing, compare strings, and format numbers.
Created [ARROW-14126|https://issues.apache.org/jira/browse/ARROW-14126] to
further investigate localization support for string functions.
was (Author: edponce):
Arrow string compute functions do not support a locale setting and use the
default POSIX locale which is "C".
Arrow provides UTF-8 variants of the string functions for localization
standardization.
Currently, only _strftime_ uses a locale option for formatting the resulting
string.
Localization would also need to be considered for kernels that use/change text
casing, compare strings, and format numbers.
> [R] String title case kernel
> ----------------------------
>
> Key: ARROW-13853
> URL: https://issues.apache.org/jira/browse/ARROW-13853
> Project: Apache Arrow
> Issue Type: New Feature
> Components: R
> Reporter: Neal Richardson
> Assignee: Eduardo Ponce
> Priority: Major
> Labels: kernel
> Fix For: 6.0.0
>
>
> ARROW-12714 added the kernel and a basic mapping, but we should add a test.
> Also the stringr function takes a "locale" argument which is not handled
> here; we should either pass it to Arrow C++ if it supports it (which I doubt)
> or error if a non-default value is provided in R.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)