alamb commented on a change in pull request #1166: URL: https://github.com/apache/arrow-datafusion/pull/1166#discussion_r734961145
########## File path: docs/source/specification/roadmap.md ########## @@ -92,8 +92,28 @@ Note: There are some additional thoughts on a datafusion-cli vision on [#1096](h - publishing to apt, brew, and possible NuGet registry so that people can use it more easily - adopt a shorter name, like dfcli? -## Ballista +# Ballista -# Vision +Ballista is a distributed compute platform based on Apache Arrow and DataFusion. It provides a query scheduler that +breaks a physical plan into stages and tasks and then schedules tasks for execution across the available executors +in the cluster. -TBD +Having Ballista as part of the DataFusion codebase helps ensure that DataFusion remains suitable for distributed +compute. For example, it helps ensure that physical query plans can be serialized to protobuf format and that they Review comment: thank you for this context -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org