[ 
https://issues.apache.org/jira/browse/ARROW-9649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andy Grove closed ARROW-9649.
-----------------------------
      Assignee: Andy Grove
    Resolution: Not A Problem

They updated the blog. It now shows that DataFusion is "only" 3x slower than 
Pandas.

> [Rust] [DataFusion] Investigate poor performance cited in towardsdatascience 
> blog post
> --------------------------------------------------------------------------------------
>
>                 Key: ARROW-9649
>                 URL: https://issues.apache.org/jira/browse/ARROW-9649
>             Project: Apache Arrow
>          Issue Type: Task
>          Components: Rust, Rust - DataFusion
>    Affects Versions: 1.0.0
>            Reporter: Andy Grove
>            Assignee: Andy Grove
>            Priority: Major
>
> According to a recently published blog post [1] DataFuson is ~20x slower than 
> Pandas for some simple queries against a tiny data set. I think it would be 
> good to try and reproduce these results to understand why performance is so 
> bad.
>  [1] 
> https://towardsdatascience.com/data-processing-in-rust-with-datafusion-arrow-56df5432de68



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to