Andy Grove created ARROW-11056: ---------------------------------- Summary: [Rust] [DataFusion] Allow ParquetExec to parallelize work based on row groups Key: ARROW-11056 URL: https://issues.apache.org/jira/browse/ARROW-11056 Project: Apache Arrow Issue Type: Improvement Components: Rust - DataFusion Reporter: Andy Grove
ParquetExec currently parallelizes work by passinging individual files to threads. It would be nice to be able to do this in a finer-grained way by assigning row groups and/or column chunks instead. This will be especially important in distributed systems built on DataFusion. -- This message was sent by Atlassian Jira (v8.3.4#803005)