avantgardnerio commented on code in PR #6639: URL: https://github.com/apache/arrow-datafusion/pull/6639#discussion_r1226962856
########## docs/source/user-guide/introduction.md: ########## @@ -34,37 +46,47 @@ DataFusion offers SQL and Dataframe APIs, excellent [performance](https://benchm - Many extension points: user defined scalar/aggregate/window functions, DataSources, SQL, other query languages, custom plan and execution nodes, optimizer passes, and more. - Streaming, asynchronous IO directly from popular object stores, including AWS S3, - Azure Blob Storage, and Google Cloud Storage. Other storage systems are supported via the - `ObjectStore` trait. + Azure Blob Storage, and Google Cloud Storage (Other storage systems are supported via the + `ObjectStore` trait). - [Excellent Documentation](https://docs.rs/datafusion/latest) and a [welcoming community](https://arrow.apache.org/datafusion/contributor-guide/communication.html). -- A state of the art query optimizer with projection and filter pushdown, sort aware optimizations, - automatic join reordering, expression coercion, and more. -- Permissive Apache 2.0 License, Apache Software Foundation governance -- Written in [Rust](https://www.rust-lang.org/), a modern system language with development - productivity similar to Java or Golang, the performance of C++, and - [loved by programmers everywhere](https://insights.stackoverflow.com/survey/2021#technology-most-loved-dreaded-and-wanted). -- Support for [Substrait](https://substrait.io/) for query plan serialization, making it easier to integrate DataFusion - with other projects, and to pass plans across language boundaries. +- A state of the art query optimizer with expression coercion and + simplification, projection and filter pushdown, sort and distribution + aware optimizations, automatic join reordering, and more. +- Permissive Apache 2.0 License, predictable and well understood + [Apache Software Foundation](https://www.apache.org/) governance. +- Implementation in [Rust](https://www.rust-lang.org/), a modern + system language with development productivity similar to Java or + Golang, the performance of C++, and [loved by programmers + everywhere](https://insights.stackoverflow.com/survey/2021#technology-most-loved-dreaded-and-wanted). +- Support for [Substrait](https://substrait.io/) query plans, to + easily pass plans across language and system boundaries. ## Use Cases DataFusion can be used without modification as an embedded SQL engine or can be customized and used as a foundation for -building new systems. Here are some examples of systems built using DataFusion: +building new systems. + +While most current usecases are "analytic" or (throughput) some Review Comment: I'm not sure I could say it any better. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
