[ 
https://issues.apache.org/jira/browse/ARROW-15785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17498146#comment-17498146
 ] 

Jonathan Keane edited comment on ARROW-15785 at 2/25/22, 2:16 PM:
------------------------------------------------------------------

I think this is the PR that introduced the regression (though I might be 
totally off or it's a different regression...) 

https://github.com/apache/arrow/pull/11991#issuecomment-1009216946

And the conbench run: 
https://conbench.ursa.dev/compare/runs/c4d5e65d088243259e5198f4c0e219c9...5a1c693586c74471b7c8ba775005db54/

We should probably have the conbench bot alert more loudly that there are 
regressions of this magnitude. That 5% there is supposed to indicate that 
there's an issue, but we might have that set too low such that there's alarm 
fatigue or|and we should alert louder when there are this many high-change 
benchmarks (e.g. the file-read benchmark z-scores range from a -76 to -759, and 
we alert at -5) 


was (Author: jonkeane):
I think this is the PR that introduced the regression (though I might be 
totally off or it's a different one...) 

https://github.com/apache/arrow/pull/11991#issuecomment-1009216946

And the conbench run: 
https://conbench.ursa.dev/compare/runs/c4d5e65d088243259e5198f4c0e219c9...5a1c693586c74471b7c8ba775005db54/

We should probably have the conbench bot alert more loudly that there are 
regressions of this magnitude. That 5% there is supposed to indicate that 
there's an issue, but we might have that set too low such that there's alarm 
fatigue or|and we should alert louder when there are this many high-change 
benchmarks (e.g. the file-read benchmark z-scores range from a -76 to -759, and 
we alert at -5) 

> [Benchmarks] Add conbench benchmark for single-file parquet reads
> -----------------------------------------------------------------
>
>                 Key: ARROW-15785
>                 URL: https://issues.apache.org/jira/browse/ARROW-15785
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Benchmarking
>            Reporter: Weston Pace
>            Assignee: Weston Pace
>            Priority: Major
>
> Release 7.0.0 introduced a regression in parquet single file reads.  We 
> should add a macro-level benchmark that does single-file reads to help us 
> detect this in the future.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to