[ https://issues.apache.org/jira/browse/ARROW-3536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Wes McKinney resolved ARROW-3536. --------------------------------- Resolution: Fixed Fix Version/s: (was: 0.13.0) 0.12.0 Issue resolved by pull request 2916 [https://github.com/apache/arrow/pull/2916] > [C++] Fast UTF8 validation functions > ------------------------------------ > > Key: ARROW-3536 > URL: https://issues.apache.org/jira/browse/ARROW-3536 > Project: Apache Arrow > Issue Type: New Feature > Components: C++ > Reporter: Wes McKinney > Assignee: Antoine Pitrou > Priority: Major > Labels: pull-request-available > Fix For: 0.12.0 > > Time Spent: 1h 50m > Remaining Estimate: 0h > > [~lemire] discusses this topic in > https://lemire.me/blog/2018/05/16/validating-utf-8-strings-using-as-little-as-0-7-cycles-per-byte/ > In Java there is also > https://lemire.me/blog/2018/10/16/validating-utf-8-bytes-java-edition/ -- This message was sent by Atlassian JIRA (v7.6.3#76005)