[
https://issues.apache.org/jira/browse/ARROW-15785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17498144#comment-17498144
]
Jonathan Keane commented on ARROW-15785:
----------------------------------------
Do the Python [1] and R [2] benchmarks for single file reads do this?
Oddly(?) The python benchmarks do show a jump around January:
https://conbench.ursa.dev/benchmarks/8c5cc1a939d8485eb6c42af83f82c8c0/
https://conbench.ursa.dev/benchmarks/1b8d2dae6f664fd19579071a7cf7766b/
But the corresponding R ones do not:
https://conbench.ursa.dev/benchmarks/ca493bf17af84ae5babd97f385b69afc/
[1]
https://github.com/ursacomputing/benchmarks/blob/main/benchmarks/file_benchmark.py
[2] https://github.com/ursacomputing/arrowbench/blob/main/R/bm-read-file.R
> [Benchmarks] Add conbench benchmark for single-file parquet reads
> -----------------------------------------------------------------
>
> Key: ARROW-15785
> URL: https://issues.apache.org/jira/browse/ARROW-15785
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Benchmarking
> Reporter: Weston Pace
> Assignee: Weston Pace
> Priority: Major
>
> Release 7.0.0 introduced a regression in parquet single file reads. We
> should add a macro-level benchmark that does single-file reads to help us
> detect this in the future.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)