[
https://issues.apache.org/jira/browse/BEAM-14163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17511931#comment-17511931
]
Brian Hulette edited comment on BEAM-14163 at 3/24/22, 3:59 PM:
----------------------------------------------------------------
I think
https://ci-beam.apache.org/job/beam_LoadTests_Python_ParDo_Dataflow_Streaming
is the job that generates the ParDo numbers. It would be nice to identify the
specific first run with the regression.
https://ci-beam.apache.org/job/beam_LoadTests_Python_ParDo_Dataflow_Streaming/546/
was run on 03/17 at 1PM UTC, the regression is reported at 03/17 at 5PM (no
timezone). I'm not sure if there's a timezone that makes that make sense.
If the regression is at 546 the only python relevant changes are
https://github.com/apache/beam/commit/02d9657f68fc60bae9704eef0cc98810ea2b143f
and
https://github.com/apache/beam/commit/9c17960d748b56af4915a3fd1c618b470b0521c3
was (Author: bhulette):
I think
https://ci-beam.apache.org/job/beam_LoadTests_Python_ParDo_Dataflow_Streaming
is the job that generates the ParDo numbersIt would be nice to identify the
specific first run with the regression,
https://ci-beam.apache.org/job/beam_LoadTests_Python_ParDo_Dataflow_Streaming/546/
was run on 03/17 at 1PM UTC, the regression is reported at 03/17 at 5PM (no
timezone). I'm not sure if there's a timezone that makes that make sense.
If the regression is at 546 the only python relevant changes are
https://github.com/apache/beam/commit/02d9657f68fc60bae9704eef0cc98810ea2b143f
and
https://github.com/apache/beam/commit/9c17960d748b56af4915a3fd1c618b470b0521c3
> Performance Regressions in streaming python ParDo and GBK Load Tests
> --------------------------------------------------------------------
>
> Key: BEAM-14163
> URL: https://issues.apache.org/jira/browse/BEAM-14163
> Project: Beam
> Issue Type: Bug
> Components: community-metrics, sdk-py-core
> Affects Versions: 2.38.0
> Reporter: Daniel Oliveira
> Priority: P0
> Fix For: 2.38.0
>
>
> As specified in the [Beam Release
> Guide|https://beam.apache.org/contribute/release-guide/#4-investigate-performance-regressions],
> I'm investigating performance regressions. The following load test metrics
> show a clear and persistant performance regression starting approximately
> around March 17 and affecting version 2.38.0.
> ParDo Load Tests:
> http://metrics.beam.apache.org/d/MOi-kf3Zk/pardo-load-tests?orgId=1&var-processingType=streaming&var-sdk=python
> GBK Load Tests:
> http://metrics.beam.apache.org/d/UYZ-oJ3Zk/gbk-load-tests?orgId=1&var-processingType=streaming&var-sdk=python&from=now-30d&to=now
--
This message was sent by Atlassian Jira
(v8.20.1#820001)