[ 
https://issues.apache.org/jira/browse/ARROW-15785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17498144#comment-17498144
 ] 

Jonathan Keane commented on ARROW-15785:
----------------------------------------

Do the Python [1] and R [2] benchmarks for single file reads do this?

Oddly(?) The python benchmarks do show a jump around January:
https://conbench.ursa.dev/benchmarks/8c5cc1a939d8485eb6c42af83f82c8c0/
https://conbench.ursa.dev/benchmarks/1b8d2dae6f664fd19579071a7cf7766b/

But the corresponding R ones do not: 
https://conbench.ursa.dev/benchmarks/ca493bf17af84ae5babd97f385b69afc/

[1] 
https://github.com/ursacomputing/benchmarks/blob/main/benchmarks/file_benchmark.py
[2] https://github.com/ursacomputing/arrowbench/blob/main/R/bm-read-file.R

> [Benchmarks] Add conbench benchmark for single-file parquet reads
> -----------------------------------------------------------------
>
>                 Key: ARROW-15785
>                 URL: https://issues.apache.org/jira/browse/ARROW-15785
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Benchmarking
>            Reporter: Weston Pace
>            Assignee: Weston Pace
>            Priority: Major
>
> Release 7.0.0 introduced a regression in parquet single file reads.  We 
> should add a macro-level benchmark that does single-file reads to help us 
> detect this in the future.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to