[jira] [Commented] (ARROW-6131) [C++] Optimize the Arrow UTF-8-string-validation

2019-08-06 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16901471#comment-16901471 ] Wes McKinney commented on ARROW-6131: - In principle this seems OK to me. We can discuss further in a

[jira] [Commented] (ARROW-6131) [C++] Optimize the Arrow UTF-8-string-validation

2019-08-05 Thread Yuqi Gu (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16900580#comment-16900580 ] Yuqi Gu commented on ARROW-6131: OK, I see. And how about to introduce a fast non-ASCII validation method

[jira] [Commented] (ARROW-6131) [C++] Optimize the Arrow UTF-8-string-validation

2019-08-05 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16900111#comment-16900111 ] Wes McKinney commented on ARROW-6131: - [~yqGu] in which component of the project is UTF8-validation

[jira] [Commented] (ARROW-6131) [C++] Optimize the Arrow UTF-8-string-validation

2019-08-05 Thread Antoine Pitrou (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16899825#comment-16899825 ] Antoine Pitrou commented on ARROW-6131: --- I expect all-ASCII data to be very frequent in the kind of

[jira] [Commented] (ARROW-6131) [C++] Optimize the Arrow UTF-8-string-validation

2019-08-04 Thread Yuqi Gu (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16899742#comment-16899742 ] Yuqi Gu commented on ARROW-6131: The origin utf8 benchmark : {code:java}