[ 
https://issues.apache.org/jira/browse/ARROW-5977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16901133#comment-16901133
 ] 

Wes McKinney commented on ARROW-5977:
-------------------------------------

I think just include is okay. It might make sense to co-develop this in 
conjunction with the Datasets interface to CSV files (since this needs to be 
able to select columns as well as insert missing fields -- which become all 
null -- this can happen as a post-scan operation though)

> [C++] [Python] Method for read_csv to limit which columns are read?
> -------------------------------------------------------------------
>
>                 Key: ARROW-5977
>                 URL: https://issues.apache.org/jira/browse/ARROW-5977
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++, Python
>    Affects Versions: 0.14.0
>            Reporter: Jordan Samuels
>            Priority: Major
>              Labels: csv
>
> In pandas there is pd.read_csv(usecols=...) but I can't see a way to do this 
> in pyarrow. 



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Reply via email to