alamb opened a new issue, #440:
URL: https://github.com/apache/datafusion-python/issues/440

   ## What this project could be
   
   I think this project needs someone who wants to make a world class python 
dataframe library and user experience take the helm.  I will argue why I think 
this is a compelling opportunity to make a great piece of technology and have a 
wide impact across the data analytic space:
   
   ## What this project could be
   
   I think this project could be one of the most widely used data analysis 
libraries out there. Imagine a system that allows **BOTH** a fast dataframe API 
(ala pol.rs) but also first class  SQL support (ala duckdb) that are both 
screaming fast (due to all the effort that goes into 
https://github.com/apache/arrow-datafusion) as well as easy to plug into the 
eco system (arrow / parquet) and extensible (UDFS, UDAs, etc)
   
   [DataFusion](https://arrow.apache.org/datafusion/) already posts great 
benchmark numbers, and I will post datafusion 28.0.0 benchmark when we have 
them.
   
   ## How is this different than the mission of DataFusion?
   [DataFusion](https://arrow.apache.org/datafusion/) is a great project but is 
currently focused on building the core analytic engine:
   
   > DataFusion is a very fast, extensible query engine for building 
high-quality data-centric systems in [Rust](http://rustlang.org/), using the 
[Apache Arrow](https://arrow.apache.org/) in-memory format.
   
   
![image](https://github.com/apache/arrow-datafusion-python/assets/490673/16941b00-830a-4488-adf5-13a7e86be977)
   
   This repository contains basic python bindings, but the user experience (UX) 
could be improved in so many ways. 
   
   
   ## The opportunity
   
   This would be a great opportunity for someone to:
   
   1. Build some really cool technology
   1. Learn how to help grow an open source project and community with help and 
guidance from the rest of the DataFusion community
   2. Learn about analytic database technology, Arrow, etc
   3. Influence the direction of Development in DataFusion
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to