[
https://issues.apache.org/jira/browse/ARROW-555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17051264#comment-17051264
]
Maarten Breddels commented on ARROW-555:
----------------------------------------
Related: https://issues.apache.org/jira/browse/ARROW-7083
I will probably start working on this a few weeks from now. My initial
intention would be to separate the algorithms as much as possible so it would
be possible to add them both to gandiva and a 'bare' kernel, or with a minimal
amount of refactoring.
[~wesm]: what's your reason to choose re2? Gandiva and vaex both use pcre, but
I have no strong preference (except being a bit familiar with pcre).
> [C++] String algorithm library for StringArray/BinaryArray
> ----------------------------------------------------------
>
> Key: ARROW-555
> URL: https://issues.apache.org/jira/browse/ARROW-555
> Project: Apache Arrow
> Issue Type: New Feature
> Components: C++
> Reporter: Wes McKinney
> Priority: Major
> Labels: Analytics
>
> This is a parent JIRA for starting a module for processing strings in-memory
> arranged in Arrow format. This will include using the re2 C++ regular
> expression library and other standard string manipulations (such as those
> found on Python's string objects)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)