[
https://issues.apache.org/jira/browse/HDDS-8711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17788004#comment-17788004
]
Attila Doroszlai commented on HDDS-8711:
----------------------------------------
One OM intermittently exits when restarted with old version in upgrade test:
{code}
2023-09-28 10:05:24,411 [pool-25-thread-1] ERROR om.OzoneManager: Terminating
with exit status 1: Failed to reload OM state and instantiate services.
org.apache.hadoop.metrics2.MetricsException: Metrics source
OMPerformanceMetrics already exists!
at
org.apache.hadoop.metrics2.lib.DefaultMetricsSystem.newSourceName(DefaultMetricsSystem.java:152)
at
org.apache.hadoop.metrics2.lib.DefaultMetricsSystem.sourceName(DefaultMetricsSystem.java:125)
at
org.apache.hadoop.metrics2.impl.MetricsSystemImpl.register(MetricsSystemImpl.java:229)
at
org.apache.hadoop.ozone.om.OMPerformanceMetrics.register(OMPerformanceMetrics.java:33)
at
org.apache.hadoop.ozone.om.OzoneManager.instantiateServices(OzoneManager.java:726)
at
org.apache.hadoop.ozone.om.OzoneManager.reloadOMState(OzoneManager.java:3981)
at
org.apache.hadoop.ozone.om.OzoneManager.installCheckpoint(OzoneManager.java:3835)
at
org.apache.hadoop.ozone.om.OzoneManager.installCheckpoint(OzoneManager.java:3747)
at
org.apache.hadoop.ozone.om.OzoneManager.installSnapshotFromLeader(OzoneManager.java:3724)
at
org.apache.hadoop.ozone.om.ratis.OzoneManagerStateMachine.lambda$6(OzoneManagerStateMachine.java:478)
{code}
The bug has been fixed in HDDS-8647, but it still affects the "downgrade"
scenario being tested, since old version doesn't have the fix.
> Intermittent timeout of Prepare Ozone Manager in upgrade acceptance test
> ------------------------------------------------------------------------
>
> Key: HDDS-8711
> URL: https://issues.apache.org/jira/browse/HDDS-8711
> Project: Apache Ozone
> Issue Type: Sub-task
> Components: OM HA
> Affects Versions: 1.3.0
> Reporter: Attila Doroszlai
> Assignee: Attila Doroszlai
> Priority: Major
> Attachments: acceptance-compat.zip
>
>
> {code}
> Prepare Ozone Manager | FAIL |
> Test timeout 5 minutes exceeded.
> {code}
> *
> https://github.com/adoroszlai/ozone-build-results/blob/master/2023/03/29/21166/acceptance-compat/output.log
> *
> https://github.com/adoroszlai/ozone-build-results/blob/master/2023/04/16/21575/acceptance-compat/output.log
> *
> https://github.com/adoroszlai/ozone-build-results/blob/master/2023/04/18/21622/acceptance-compat/output.log
> *
> https://github.com/adoroszlai/ozone-build-results/blob/master/2023/04/18/21627/acceptance-compat/output.log
> *
> https://github.com/adoroszlai/ozone-build-results/blob/master/2023/05/09/22222/acceptance-compat/output.log
> *
> https://github.com/adoroszlai/ozone-build-results/blob/master/2023/05/18/22481/acceptance-compat/output.log
> *
> https://github.com/adoroszlai/ozone-build-results/blob/master/2023/05/18/22512/acceptance-compat/output.log
> *
> https://github.com/adoroszlai/ozone-build-results/blob/master/2023/05/19/22533/acceptance-compat/output.log
> *
> https://github.com/adoroszlai/ozone-build-results/blob/master/2023/05/27/22762/acceptance-compat/output.log
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]