Andy Grove created ARROW-4589: --------------------------------- Summary: [Rust] [DataFusion] Aggregate query does not push down projection to CSV file Key: ARROW-4589 URL: https://issues.apache.org/jira/browse/ARROW-4589 Project: Apache Arrow Issue Type: Improvement Components: Rust - DataFusion Affects Versions: 0.12.0 Reporter: Andy Grove Assignee: Andy Grove Fix For: 0.13.0
If I run a query like the following: {code:java} SELECT MIN(fare_amount), MAX(fare_amount) FROM tripdata{code} I see this logical plan: {code:java} Logical plan: Aggregate: groupBy=[[]], aggr=[[MIN(#10), MAX(#10)]] TableScan: tripdata projection=None{code} This means that every column is being loaded into arrays rather than just the two columns that I care about, resulting in terrible performance. -- This message was sent by Atlassian JIRA (v7.6.3#76005)