[ https://issues.apache.org/jira/browse/ARROW-4589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated ARROW-4589: ---------------------------------- Labels: pull-request-available (was: ) > [Rust] [DataFusion] Aggregate query does not push down projection to CSV file > ----------------------------------------------------------------------------- > > Key: ARROW-4589 > URL: https://issues.apache.org/jira/browse/ARROW-4589 > Project: Apache Arrow > Issue Type: Improvement > Components: Rust - DataFusion > Affects Versions: 0.12.0 > Reporter: Andy Grove > Assignee: Andy Grove > Priority: Major > Labels: pull-request-available > Fix For: 0.13.0 > > > If I run a query like the following: > {code:java} > SELECT MIN(fare_amount), MAX(fare_amount) FROM tripdata{code} > I see this logical plan: > {code:java} > Logical plan: Aggregate: groupBy=[[]], aggr=[[MIN(#10), MAX(#10)]] > TableScan: tripdata projection=None{code} > > This means that every column is being loaded into arrays rather than just the > two columns that I care about, resulting in terrible performance. -- This message was sent by Atlassian JIRA (v7.6.3#76005)