[ 
https://issues.apache.org/jira/browse/ARROW-8961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17128386#comment-17128386
 ] 

Antoine Pitrou commented on ARROW-8961:
---------------------------------------

Also, {{unilib}} uses similar a lookup scheme, so it's unlikely to be 
significantly faster (it's actually a bit more complicated, because it seems it 
tries to compress the data tables more, at the expense of slightly more 
complicated lookup).

A concern about {{unilib}}, though, would be that it has had a single 
contributor over its 6 years of existence.

> [C++] Vendor utf8proc library
> -----------------------------
>
>                 Key: ARROW-8961
>                 URL: https://issues.apache.org/jira/browse/ARROW-8961
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++
>            Reporter: Wes McKinney
>            Priority: Major
>             Fix For: 1.0.0
>
>
> This is a minimal MIT-licensed library for UTF-8 data processing originally 
> developed for use in Julia
> https://github.com/JuliaStrings/utf8proc



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to