[
https://issues.apache.org/jira/browse/ARROW-16014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Antoine Pitrou resolved ARROW-16014.
------------------------------------
Fix Version/s: 8.0.0
Resolution: Fixed
Issue resolved by pull request 12755
[https://github.com/apache/arrow/pull/12755]
> [C++] Create more benchmarks for measuring expression evaluation overhead
> -------------------------------------------------------------------------
>
> Key: ARROW-16014
> URL: https://issues.apache.org/jira/browse/ARROW-16014
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: Weston Pace
> Assignee: Sanjiban Sengupta
> Priority: Major
> Labels: pull-request-available
> Fix For: 8.0.0
>
> Time Spent: 6.5h
> Remaining Estimate: 0h
>
> We have a lot of great benchmarks on individual kernels but I don't know if
> we have enough benchmarks on the function & expression evaluation itself.
> There are some benchmarks (function_benchmark.cc) which measure this but I
> would like to get a better sense of a "bytes per second" number from the
> function system for a trivial function (e.g. an identity function that simply
> returns the values as is).
> In addition, we should measure overhead for common tasks like preallocation,
> etc.
> I would also like these benchmarks to be parameterized by batch size. Being
> able to run on small batches enables (in theory) better cache utilization. I
> suspect that the overhead here may start to become a bottleneck.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)