[ 
https://issues.apache.org/jira/browse/ARROW-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uwe L. Korn updated ARROW-1731:
-------------------------------
    Description: 
Currently it's all-or-nothing, and to do the subsetting in pandas incurs a data 
copy. This would enable columns (by name or index) to be selected out without 
additional data copying. We should add a {{columns=}} argument to the the 
{{from_pandas}} calls and do the subsetting when we dispatch the individual 
arrays for conversion to Arrow.

cc [~cpcloud] [~jreback]

  was:
Currently it's all-or-nothing, and to do the subsetting in pandas incurs a data 
copy. This would enable columns (by name or index) to be selected out without 
additional data copying

cc [~cpcloud] [~jreback]


> [Python] Provide for selecting a subset of columns to convert in 
> RecordBatch/Table.from_pandas
> ----------------------------------------------------------------------------------------------
>
>                 Key: ARROW-1731
>                 URL: https://issues.apache.org/jira/browse/ARROW-1731
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>            Reporter: Wes McKinney
>            Assignee: Uwe L. Korn
>            Priority: Major
>              Labels: beginner
>             Fix For: 0.10.0
>
>
> Currently it's all-or-nothing, and to do the subsetting in pandas incurs a 
> data copy. This would enable columns (by name or index) to be selected out 
> without additional data copying. We should add a {{columns=}} argument to the 
> the {{from_pandas}} calls and do the subsetting when we dispatch the 
> individual arrays for conversion to Arrow.
> cc [~cpcloud] [~jreback]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to