jackwener commented on code in PR #4903: URL: https://github.com/apache/arrow-datafusion/pull/4903#discussion_r1070299362
########## README.md: ########## @@ -21,34 +21,52 @@ <img src="docs/source/_static/images/DataFusion-Logo-Background-White.svg" width="256" alt="logo"/> -DataFusion is an extensible query planning, optimization, and execution framework, written in -Rust, that uses [Apache Arrow](https://arrow.apache.org) as its +DataFusion is very fast, extensible query engine, for building high quality data centric systems in +[Rust](http://rustlang.org), using the [Apache Arrow](https://arrow.apache.org) in-memory format. +DataFusion offers SQL and Dataframe APIs, excellent [performance](https://benchmark.clickhouse.com/), built in support for CSV, Parquet Json, and Avro, extensive customization, and a great community. + [](https://codecov.io/gh/apache/arrow-datafusion?branch=master) ## Features -- SQL query planner with support for multiple SQL dialects -- DataFrame API -- Parquet, CSV, JSON, and Avro file formats are supported natively. Custom - file formats can be supported by implementing a `TableProvider` trait. -- Supports popular object stores, including AWS S3, Azure Blob - Storage, and Google Cloud Storage. There are extension points for implementing - custom object stores. +- Feature rich [SQL support](https://arrow.apache.org/datafusion/user-guide/sql/index.html) and [DataFrame API](https://arrow.apache.org/datafusion/user-guide/dataframe.html) +- Blazingly fast, vectorized, multi-threaded, streaming execution engine. +- Native support for Parquet, CSV, JSON, and Avro file formats. Support + for custom file formats and non file datasources via the `TableProvider` trait. +- Many extension points: user defined scalar/aggregate/window functions, DataSources, SQL, + other query languages, custom plan and execution nodes, optimizer passes, and more. +- Streaming, asynchronous IO directly from popular object stores, including AWS S3, + Azure Blob Storage, and Google Cloud Storage. Other storage systems are supported via the + `ObjectStore` trait. +- [Excellent Documentation](https://docs.rs/datafusion/latest) and a + [welcoming community](https://arrow.apache.org/datafusion/community/communication.html). +- A state of the art query optimizer with projection and filter pushdown, sort aware optimizations, + automatic join reordering, expression coercion, and more. +- Permissive Apache 2.0 License, Apache Software Foundation governance +- Written in [Rust](https://www.rust-lang.org/), a modern system language with development + producticity similar to Java or golang, the performance of C++, and Review Comment: ```suggestion productivity similar to Java or Golang, the performance of C++, and ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
