[
https://issues.apache.org/jira/browse/ARROW-5977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16901133#comment-16901133
]
Wes McKinney commented on ARROW-5977:
-------------------------------------
I think just include is okay. It might make sense to co-develop this in
conjunction with the Datasets interface to CSV files (since this needs to be
able to select columns as well as insert missing fields -- which become all
null -- this can happen as a post-scan operation though)
> [C++] [Python] Method for read_csv to limit which columns are read?
> -------------------------------------------------------------------
>
> Key: ARROW-5977
> URL: https://issues.apache.org/jira/browse/ARROW-5977
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++, Python
> Affects Versions: 0.14.0
> Reporter: Jordan Samuels
> Priority: Major
> Labels: csv
>
> In pandas there is pd.read_csv(usecols=...) but I can't see a way to do this
> in pyarrow.
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)