Wes McKinney created ARROW-3401:
-----------------------------------
Summary: [C++] Pluggable statistics collector API for
unconvertible CSV values
Key: ARROW-3401
URL: https://issues.apache.org/jira/browse/ARROW-3401
Project: Apache Arrow
Issue Type: New Feature
Components: C++
Reporter: Wes McKinney
Fix For: 0.12.0
It would be useful to be able to collect statistics (e.g. distinct value
counts) about values in a column of a CSV file that cannot be converted to a
desired data type.
When conversion fails, the converters can call into an abstract API like
{code}
statistics_->CannotConvert(token, size);
{code}
or something similar
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)