[ 
https://issues.apache.org/jira/browse/ARROW-13157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368896#comment-17368896
 ] 

David Li commented on ARROW-13157:
----------------------------------

IIRC, this should be doable, but requires some trouble: we would use RE2 to do 
the case-insensitive search, but RE2 doesn't return the match position unless 
you have a capture group. However, then you can't use the 'literal' option 
anymore and have to escape all regex characters in your search string. Though I 
see RE2 has a QuoteMeta function to do this for you, so it shouldn't be that 
bad.

> [C++] Implement ignore_case option for find_substring 
> ------------------------------------------------------
>
>                 Key: ARROW-13157
>                 URL: https://issues.apache.org/jira/browse/ARROW-13157
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++
>            Reporter: Nic Crane
>            Priority: Major
>
> The find_substring compute function uses the MatchSubstringOptions Options 
> class.  However, when I try to set ignore_case to TRUE, I get the following 
> error:
> {code:java}
>  Error: NotImplemented: find_substring with ignore_case
> {code}
> R code to replicate the error is below, though depends on a currently 
> unmerged branch:
> {code:java}
> df <- tibble(x = c("Foo and Bar", "baz and qux and quux"))
> df %>%
>       Table$create() %>%
>       mutate(x = arrow_find_substring(x, options = list(pattern = "b", 
> ignore_case = TRUE))) %>%
>       collect()
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to