[
https://issues.apache.org/jira/browse/ARROW-3401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rok Mihevc updated ARROW-3401:
------------------------------
External issue URL: https://github.com/apache/arrow/issues/19729
> [C++] Pluggable statistics collector API for unconvertible CSV values
> ---------------------------------------------------------------------
>
> Key: ARROW-3401
> URL: https://issues.apache.org/jira/browse/ARROW-3401
> Project: Apache Arrow
> Issue Type: New Feature
> Components: C++
> Reporter: Wes McKinney
> Priority: Major
>
> It would be useful to be able to collect statistics (e.g. distinct value
> counts) about values in a column of a CSV file that cannot be converted to a
> desired data type.
> When conversion fails, the converters can call into an abstract API like
> {code}
> statistics_->CannotConvert(token, size);
> {code}
> or something similar
--
This message was sent by Atlassian Jira
(v8.20.10#820010)