[
https://issues.apache.org/jira/browse/ARROW-8961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17128420#comment-17128420
]
Antoine Pitrou commented on ARROW-8961:
---------------------------------------
I've compiled both libraries:
* {{utf8proc}} weighs around 300 kB (mostly static data)
* the weight of {{unilib}} depends on which functionality is being used, as
it's header only; for example a test executable that uses property lookup and
conversion, but not codepoint combining weighs around 120 kB
> [C++] Vendor utf8proc library
> -----------------------------
>
> Key: ARROW-8961
> URL: https://issues.apache.org/jira/browse/ARROW-8961
> Project: Apache Arrow
> Issue Type: New Feature
> Components: C++
> Reporter: Wes McKinney
> Priority: Major
> Fix For: 1.0.0
>
>
> This is a minimal MIT-licensed library for UTF-8 data processing originally
> developed for use in Julia
> https://github.com/JuliaStrings/utf8proc
--
This message was sent by Atlassian Jira
(v8.3.4#803005)