alamb opened a new issue, #2427:
URL: https://github.com/apache/arrow-datafusion/issues/2427

   **Is your feature request related to a problem or challenge? Please describe 
what you are trying to do.**
   I plan to make sorting / merging faster. My reasons;
   
   1. I find it personally interesting
   2. It is a key  piece of technology to bring DataFusion's performance to be 
on par with things like DuckDB 
   1. It is important for my project IOx in the medium term
   
   **Describe the solution you'd like**
   Basically the plan is to follow the advice given by Goetz Graefe in 
[Implementing sorting in database systems
   ](https://dl.acm.org/doi/10.1145/1132960.1132964) and successfully 
implemented in systems like DuckDB (see [blog 
post](https://duckdb.org/2021/08/27/external-sorting.html))`
   
   It will likely involve some combination of a specialization of the row 
format and JIT comparisons
   
   Here is my rough plan and a sketch of the kinds of things I want to work on
   - [ ] Benchmarks
   - [ ] POC of comparing using row format
   - [ ] Add full type support for row format comparisons
   - [ ] Turn POC to real
   - [ ] #2150
   - [ ] #2151
   
   
   **Describe alternatives you've considered**
   A clear and concise description of any alternative solutions or features 
you've considered.
   
   **Additional context**
   Add any other context or screenshots about the feature request here.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to