Joris Van den Bossche created ARROW-8314:
--------------------------------------------

             Summary: [Python] Provide a method to select a subset of columns 
of a Table
                 Key: ARROW-8314
                 URL: https://issues.apache.org/jira/browse/ARROW-8314
             Project: Apache Arrow
          Issue Type: New Feature
          Components: Python
            Reporter: Joris Van den Bossche


I looked through the open issues and in our API, but didn't directly find 
something about selecting a subset of columns of a table.

Assume you have a table like:

{code}
table = pa.table({'a': [1, 2], 'b': [.1, .2], 'c': ['a', 'b']})
{code}

You can select a single column with {{table.column('a')}} or {{table['a']}} to 
get a chunked array. You can add, append, remove and replace columns (with 
{{add_column}}, {{append_column}}, {{remove_column}}, {{set_column}}). 
But an easy way to get a subset of the columns (without the manuall removing 
the ones you don't want one by one) doesn't seem possible. 

I would propose something like:

{code}
table.select(['a', 'c'])
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to