[
https://issues.apache.org/jira/browse/ARROW-6131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16901471#comment-16901471
]
Wes McKinney commented on ARROW-6131:
-
In principle this seems OK to me. We can discuss further in a
[
https://issues.apache.org/jira/browse/ARROW-6131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16900580#comment-16900580
]
Yuqi Gu commented on ARROW-6131:
OK, I see. And how about to introduce a fast non-ASCII validation method
[
https://issues.apache.org/jira/browse/ARROW-6131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16900111#comment-16900111
]
Wes McKinney commented on ARROW-6131:
-
[~yqGu] in which component of the project is UTF8-validation
[
https://issues.apache.org/jira/browse/ARROW-6131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16899825#comment-16899825
]
Antoine Pitrou commented on ARROW-6131:
---
I expect all-ASCII data to be very frequent in the kind of
[
https://issues.apache.org/jira/browse/ARROW-6131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16899742#comment-16899742
]
Yuqi Gu commented on ARROW-6131:
The origin utf8 benchmark :
{code:java}