[
https://issues.apache.org/jira/browse/ARROW-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated ARROW-10240:
-----------------------------------
Labels: pull-request-available (was: )
> [Rust] [Datafusion] Optionally load tpch data into memory before running
> benchmark query
> ----------------------------------------------------------------------------------------
>
> Key: ARROW-10240
> URL: https://issues.apache.org/jira/browse/ARROW-10240
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Rust, Rust - DataFusion
> Reporter: Jörn Horstmann
> Assignee: Jörn Horstmann
> Priority: Minor
> Labels: pull-request-available
> Time Spent: 10m
> Remaining Estimate: 0h
>
> The tpch benchmark runtime seems to be dominated by csv parsing code and it
> is really difficult to see any performance hotspots related to actual query
> execution in a flamegraph.
> With the date in memory and more iterations it should be easier to profile
> and find bottlenecks.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)