[jira] [Updated] (ARROW-4748) [Rust] [DataFusion] GROUP BY performance could be optimized
[ https://issues.apache.org/jira/browse/ARROW-4748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-4748: -- Labels: pull-request-available (was: ) > [Rust] [DataFusion] GROUP BY performance could be optimized > --- > > Key: ARROW-4748 > URL: https://issues.apache.org/jira/browse/ARROW-4748 > Project: Apache Arrow > Issue Type: Improvement > Components: Rust, Rust - DataFusion >Affects Versions: 0.12.0 >Reporter: Andy Grove >Assignee: Andy Grove >Priority: Major > Labels: pull-request-available > > The logic to build the group by keys is row-based, performing an array > downcast on every single group by value. This could be done in a columnar way > instead. > > I also wonder if it is possible to avoid converting the result map to an > array of map entries. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (ARROW-4748) [Rust] [DataFusion] GROUP BY performance could be optimized
[ https://issues.apache.org/jira/browse/ARROW-4748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-4748: -- Fix Version/s: (was: 0.14.0) > [Rust] [DataFusion] GROUP BY performance could be optimized > --- > > Key: ARROW-4748 > URL: https://issues.apache.org/jira/browse/ARROW-4748 > Project: Apache Arrow > Issue Type: Improvement > Components: Rust, Rust - DataFusion >Affects Versions: 0.12.0 >Reporter: Andy Grove >Priority: Major > > The logic to build the group by keys is row-based, performing an array > downcast on every single group by value. This could be done in a columnar way > instead. > > I also wonder if it is possible to avoid converting the result map to an > array of map entries. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-4748) [Rust] [DataFusion] GROUP BY performance could be optimized
[ https://issues.apache.org/jira/browse/ARROW-4748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-4748: -- Fix Version/s: (was: 0.13.0) 0.14.0 > [Rust] [DataFusion] GROUP BY performance could be optimized > --- > > Key: ARROW-4748 > URL: https://issues.apache.org/jira/browse/ARROW-4748 > Project: Apache Arrow > Issue Type: Improvement > Components: Rust, Rust - DataFusion >Affects Versions: 0.12.0 >Reporter: Andy Grove >Priority: Major > Fix For: 0.14.0 > > > The logic to build the group by keys is row-based, performing an array > downcast on every single group by value. This could be done in a columnar way > instead. > > I also wonder if it is possible to avoid converting the result map to an > array of map entries. -- This message was sent by Atlassian JIRA (v7.6.3#76005)