[jira] [Resolved] (HDDS-4186) Adjust RetryPolicy of SCMConnectionManager for SCM/Recon

2020-09-04 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng resolved HDDS-4186. - Resolution: Fixed > Adjust RetryPolicy of SCMConnectionManager for SCM/Recon >

[jira] [Updated] (HDDS-4192) enable SCM Raft Group based on config ozone.scm.names

2020-09-02 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4192: Description: (was:   Fix a bug in https://issues.apache.org/jira/browse/HDDS-3895  In

[jira] [Created] (HDDS-4192) enable SCM Raft Group based on config ozone.scm.names

2020-09-02 Thread Glen Geng (Jira)
Glen Geng created HDDS-4192: --- Summary: enable SCM Raft Group based on config ozone.scm.names Key: HDDS-4192 URL: https://issues.apache.org/jira/browse/HDDS-4192 Project: Hadoop Distributed Data Store

[jira] [Updated] (HDDS-4192) enable SCM Raft Group based on config ozone.scm.names

2020-09-02 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4192: Labels: (was: pull-request-available) > enable SCM Raft Group based on config ozone.scm.names >

[jira] [Updated] (HDDS-4192) enable SCM Raft Group based on config ozone.scm.names

2020-09-02 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4192: Description:   Say ozone.scm.names is "ip1,ip2,ip3", scm with ip1 identifies its RaftPeerId as scm1,  scm

[jira] [Updated] (HDDS-4192) enable SCM Raft Group based on config ozone.scm.names

2020-09-02 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4192: Description:   Say ozone.scm.names is "ip1,ip2,ip3", scm with ip1 identifies its RaftPeerId as scm1,  scm

[jira] [Updated] (HDDS-4186) Adjust RetryPolicy of SCMConnectionManager for SCM/Recon

2020-09-01 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4186: Description: *The problem is:* If setup one Recon and one SCM, then shutdown the Recon server, all

[jira] [Updated] (HDDS-4186) Adjust RetryPolicy of SCMConnectionManager

2020-09-01 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4186: Target Version/s: (was: 0.7.0) > Adjust RetryPolicy of SCMConnectionManager >

[jira] [Updated] (HDDS-4186) Adjust RetryPolicy of SCMConnectionManager

2020-09-01 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4186: Summary: Adjust RetryPolicy of SCMConnectionManager (was: CLONE - Improve performance of the BufferPool

[jira] [Updated] (HDDS-4186) Adjust RetryPolicy of SCMConnectionManager

2020-09-01 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4186: Labels: (was: pull-request-available) > Adjust RetryPolicy of SCMConnectionManager >

[jira] [Updated] (HDDS-4186) Adjust RetryPolicy of SCMConnectionManager for SCM/Recon

2020-09-01 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4186: Summary: Adjust RetryPolicy of SCMConnectionManager for SCM/Recon (was: Adjust RetryPolicy of

[jira] [Updated] (HDDS-4186) Adjust RetryPolicy of SCMConnectionManager for SCM/Recon

2020-09-01 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4186: Component/s: Ozone Datanode > Adjust RetryPolicy of SCMConnectionManager for SCM/Recon >

[jira] [Assigned] (HDDS-4186) CLONE - Improve performance of the BufferPool management of Ozone client

2020-09-01 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng reassigned HDDS-4186: --- Assignee: Glen Geng (was: Marton Elek) > CLONE - Improve performance of the BufferPool management

[jira] [Created] (HDDS-4186) CLONE - Improve performance of the BufferPool management of Ozone client

2020-09-01 Thread Glen Geng (Jira)
Glen Geng created HDDS-4186: --- Summary: CLONE - Improve performance of the BufferPool management of Ozone client Key: HDDS-4186 URL: https://issues.apache.org/jira/browse/HDDS-4186 Project: Hadoop

[jira] [Updated] (HDDS-4186) Adjust RetryPolicy of SCMConnectionManager

2020-09-01 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4186: Description: Current RetryPolicy of Datanode for SCM is retryForeverWithFixedSleep: {code:java} RetryPolicy

[jira] [Updated] (HDDS-4186) Adjust RetryPolicy of SCMConnectionManager

2020-09-01 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4186: Description: Current RetryPolicy of Datanode for SCM is retryForeverWithFixedSleep: {code:java} RetryPolicy

[jira] [Updated] (HDDS-4186) Adjust RetryPolicy of SCMConnectionManager

2020-09-01 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4186: Description: Current RetryPolicy of Datanode for SCM is retryForeverWithFixedSleep: {code:java} RetryPolicy

[jira] [Updated] (HDDS-4186) Adjust RetryPolicy of SCMConnectionManager for SCM/Recon

2020-09-01 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4186: Description: *The problem is:* If setup one Recon and one SCM, then shutdown the Recon server, all

[jira] [Created] (HDDS-4230) CLONE - Add failover proxy to SCM block protocol

2020-09-10 Thread Glen Geng (Jira)
Glen Geng created HDDS-4230: --- Summary: CLONE - Add failover proxy to SCM block protocol Key: HDDS-4230 URL: https://issues.apache.org/jira/browse/HDDS-4230 Project: Hadoop Distributed Data Store

[jira] [Updated] (HDDS-4230) SCMBlockLocationFailoverProxyProvider should handle LeaderNotReadyException

2020-09-10 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4230: Description: like OMFailoverProxyProvider,  SCMBlockLocationFailoverProxyProvider should also handle

[jira] [Updated] (HDDS-4230) SCMBlockLocationFailoverProxyProvider should handle LeaderNotReadyException

2020-09-10 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4230: Description: like OMFailoverProxyProvider,  (was: Need to supports 2N + 1 SCMs. Add configs and logic to

[jira] [Updated] (HDDS-4230) SCMBlockLocationFailoverProxyProvider should handle LeaderNotReadyException

2020-09-10 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4230: Summary: SCMBlockLocationFailoverProxyProvider should handle LeaderNotReadyException (was: CLONE - Add

[jira] [Updated] (HDDS-4230) SCMBlockLocationFailoverProxyProvider should handle LeaderNotReadyException

2020-09-10 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4230: Description: It is an enhancement for HDDS-3188. Like OMFailoverProxyProvider,

[jira] [Assigned] (HDDS-4230) SCMBlockLocationFailoverProxyProvider should handle LeaderNotReadyException

2020-09-10 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng reassigned HDDS-4230: --- Assignee: (was: Li Cheng) > SCMBlockLocationFailoverProxyProvider should handle

[jira] [Updated] (HDDS-4228) add field 'num' to ALLOCATE_BLOCK of scm audit log.

2020-09-09 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4228: Labels: pull-requests-available (was: pull-request-available) > add field 'num' to ALLOCATE_BLOCK of scm

[jira] [Created] (HDDS-4228) ALLOCATE_BLOCK of scm audit log miss num

2020-09-09 Thread Glen Geng (Jira)
Glen Geng created HDDS-4228: --- Summary: ALLOCATE_BLOCK of scm audit log miss num Key: HDDS-4228 URL: https://issues.apache.org/jira/browse/HDDS-4228 Project: Hadoop Distributed Data Store Issue

[jira] [Assigned] (HDDS-4228) ALLOCATE_BLOCK of scm audit log miss num

2020-09-09 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng reassigned HDDS-4228: --- Assignee: Glen Geng (was: Marton Elek) > ALLOCATE_BLOCK of scm audit log miss num >

[jira] [Updated] (HDDS-4228) ALLOCATE_BLOCK of scm audit log miss num

2020-09-09 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4228: Labels: (was: pull-request-available) > ALLOCATE_BLOCK of scm audit log miss num >

[jira] [Updated] (HDDS-4228) ALLOCATE_BLOCK of scm audit log miss num

2020-09-09 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4228: Priority: Minor (was: Blocker) > ALLOCATE_BLOCK of scm audit log miss num >

[jira] [Updated] (HDDS-4228) add field 'num' to ALLOCATE_BLOCK of scm audit log.

2020-09-09 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4228: Description:   The scm audit log for ALLOCATE_BLOCK is as follows: {code:java} 2020-09-10 03:42:08,196 |

[jira] [Updated] (HDDS-4228) add field 'num' to ALLOCATE_BLOCK of scm audit log.

2020-09-09 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4228: Description:   The scm audit log for ALLOCATE_BLOCK is as follows: {code:java} 2020-09-10 03:42:08,196 |

[jira] [Updated] (HDDS-4228) add field 'num' to ALLOCATE_BLOCK of scm audit log.

2020-09-09 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4228: Summary: add field 'num' to ALLOCATE_BLOCK of scm audit log. (was: ALLOCATE_BLOCK of scm audit log miss

[jira] [Updated] (HDDS-4228) ALLOCATE_BLOCK of scm audit log miss num

2020-09-09 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4228: Description:   The sac {code:java} 2020-09-10 03:42:08,196 | INFO | SCMAudit | user=root |

[jira] [Commented] (HDDS-4107) replace scmID with clusterID for container and volume at Datanode side

2020-09-04 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17190638#comment-17190638 ] Glen Geng commented on HDDS-4107: - I am working on the upgrade issues for this PR, found that renaming the

[jira] [Comment Edited] (HDDS-4107) replace scmID with clusterID for container and volume at Datanode side

2020-09-04 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17190638#comment-17190638 ] Glen Geng edited comment on HDDS-4107 at 9/4/20, 9:06 AM: -- I am working on the

[jira] [Created] (HDDS-4351) DN crash while RatisApplyTransactionExecutor tries to putBlock to rocksDB

2020-10-14 Thread Glen Geng (Jira)
Glen Geng created HDDS-4351: --- Summary: DN crash while RatisApplyTransactionExecutor tries to putBlock to rocksDB Key: HDDS-4351 URL: https://issues.apache.org/jira/browse/HDDS-4351 Project: Hadoop

[jira] [Updated] (HDDS-4343) CLONE - OM client request fails with "failed to commit as key is not found in OpenKey table"

2020-10-14 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4343: Description: {code:java} // If there are unhealthy replicas, then we should remove them even if it

[jira] [Updated] (HDDS-4343) ReplicationManager.handleOverReplicatedContainer does not handle

2020-10-14 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4343: Summary: ReplicationManager.handleOverReplicatedContainer does not handle (was: CLONE - OM client request

[jira] [Updated] (HDDS-4343) CLONE - OM client request fails with "failed to commit as key is not found in OpenKey table"

2020-10-14 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4343: Component/s: (was: OM HA) SCM > CLONE - OM client request fails with "failed to commit

[jira] [Updated] (HDDS-4343) CLONE - OM client request fails with "failed to commit as key is not found in OpenKey table"

2020-10-14 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4343: Description: {code:java} 20/08/28 03:21:53 WARN retry.RetryInvocationHandler: A failover has occurred

[jira] [Assigned] (HDDS-4343) CLONE - OM client request fails with "failed to commit as key is not found in OpenKey table"

2020-10-14 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng reassigned HDDS-4343: --- Assignee: Glen Geng (was: Bharat Viswanadham) > CLONE - OM client request fails with "failed to

[jira] [Created] (HDDS-4343) CLONE - OM client request fails with "failed to commit as key is not found in OpenKey table"

2020-10-14 Thread Glen Geng (Jira)
Glen Geng created HDDS-4343: --- Summary: CLONE - OM client request fails with "failed to commit as key is not found in OpenKey table" Key: HDDS-4343 URL: https://issues.apache.org/jira/browse/HDDS-4343

[jira] [Updated] (HDDS-4343) ReplicationManager.handleOverReplicatedContainer() does not handle unhealthyReplicas properly.

2020-10-14 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4343: Summary: ReplicationManager.handleOverReplicatedContainer() does not handle unhealthyReplicas properly.

[jira] [Updated] (HDDS-4343) CLONE - OM client request fails with "failed to commit as key is not found in OpenKey table"

2020-10-14 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4343: Description: {code:java} // If there are unhealthy replicas, then we should remove them even if it

[jira] [Updated] (HDDS-4343) ReplicationManager.handleOverReplicatedContainer() does not handle unhealthyReplicas properly.

2020-10-14 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4343: Priority: Minor (was: Blocker) > ReplicationManager.handleOverReplicatedContainer() does not handle >

[jira] [Commented] (HDDS-4351) DN crash while RatisApplyTransactionExecutor tries to putBlock to rocksDB

2020-10-16 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17215219#comment-17215219 ] Glen Geng commented on HDDS-4351: - Hello [~erose] [~arp] [~bharat] As requested by Ethan, I scheduled a

[jira] [Resolved] (HDDS-4128) RATIS ONE Pipeline is closed but not removed when a datanode goes stale

2020-08-18 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng resolved HDDS-4128. - Resolution: Duplicate > RATIS ONE Pipeline is closed but not removed when a datanode goes stale >

[jira] [Commented] (HDDS-4128) RATIS ONE Pipeline is closed but not removed when a datanode goes stale

2020-08-18 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17179446#comment-17179446 ] Glen Geng commented on HDDS-4128: - close as duplicate with HDDS-4125. > RATIS ONE Pipeline is closed but

[jira] [Commented] (HDDS-4125) Pipeline is not removed when a datanode goes stale

2020-08-17 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17179352#comment-17179352 ] Glen Geng commented on HDDS-4125: - RATIS THREE pipeline, if stay in closed state, should be removed by

[jira] [Updated] (HDDS-4128) RATIS ONE Pipeline is closed but not removed when a datanode goes stale

2020-08-17 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4128: Description: Since the Scheduler in SCMPipelineManager that used to destroyPipeline is removed,

[jira] [Updated] (HDDS-4128) RATIS ONE Pipeline is closed but not removed when a datanode goes stale

2020-08-17 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4128: Description: RATIS ONE Pipeline is closed but not removed when a datanode goes stale.   Since the

[jira] [Updated] (HDDS-4128) RATIS ONE Pipeline is closed but not removed when a datanode goes stale

2020-08-18 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4128: Description: Since the Scheduler in SCMPipelineManager that used to destroyPipeline is removed,

[jira] [Commented] (HDDS-4125) Pipeline is not removed when a datanode goes stale

2020-08-18 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17179375#comment-17179375 ] Glen Geng commented on HDDS-4125: - Create a new Jira https://issues.apache.org/jira/browse/HDDS-4128 to

[jira] [Updated] (HDDS-4125) Pipeline is not removed when a datanode goes stale

2020-08-18 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4125: Description: When a node goes stale the pipelines in that node have to be closed and removed from

[jira] [Updated] (HDDS-4125) Pipeline is not removed when a datanode goes stale

2020-08-18 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4125: Description: When a node goes stale the pipelines in that node have to be closed and removed from

[jira] [Updated] (HDDS-4128) RATIS ONE Pipeline is closed but not removed when a datanode goes stale

2020-08-17 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4128: Description: (was: When a node goes stale the pipelines in that node have to be closed and removed from

[jira] [Updated] (HDDS-4128) RATIS ONE Pipeline is closed but not removed when a datanode goes stale

2020-08-17 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4128: Summary: RATIS ONE Pipeline is closed but not removed when a datanode goes stale (was: CLONE - Pipeline is

[jira] [Created] (HDDS-4128) CLONE - Pipeline is not removed when a datanode goes stale

2020-08-17 Thread Glen Geng (Jira)
Glen Geng created HDDS-4128: --- Summary: CLONE - Pipeline is not removed when a datanode goes stale Key: HDDS-4128 URL: https://issues.apache.org/jira/browse/HDDS-4128 Project: Hadoop Distributed Data Store

[jira] [Commented] (HDDS-4128) RATIS ONE Pipeline is closed but not removed when a datanode goes stale

2020-08-17 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17179374#comment-17179374 ] Glen Geng commented on HDDS-4128: - FYI [~nanda]  [~timmylicheng] related to  

[jira] [Resolved] (HDDS-4093) update RATIS version from 1.0.0 to 1.1.0-85281b2-SNAPSHOT

2020-08-18 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng resolved HDDS-4093. - Release Note: PR merged. Resolution: Fixed > update RATIS version from 1.0.0 to

[jira] [Created] (HDDS-4130) remove the 1st edition RatisServer of SCM HA, which is copied from OM HA

2020-08-19 Thread Glen Geng (Jira)
Glen Geng created HDDS-4130: --- Summary: remove the 1st edition RatisServer of SCM HA, which is copied from OM HA Key: HDDS-4130 URL: https://issues.apache.org/jira/browse/HDDS-4130 Project: Hadoop

[jira] [Updated] (HDDS-4130) remove the 1st edition of RatisServer of SCM HA which is copied from OM HA

2020-08-19 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4130: Summary: remove the 1st edition of RatisServer of SCM HA which is copied from OM HA (was: remove the 1st

[jira] [Updated] (HDDS-4130) remove the 1st edition RatisServer of SCM HA which is copied from OM HA

2020-08-19 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4130: Summary: remove the 1st edition RatisServer of SCM HA which is copied from OM HA (was: remove the 1st

[jira] [Updated] (HDDS-4130) remove the 1st edition RatisServer of SCM HA, which is copied from OM HA

2020-08-19 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4130: Description: The 1st edition RatisServer of SCM HA is copied  (was: The disk layout per volume is as

[jira] [Updated] (HDDS-4130) remove the 1st edition of RatisServer of SCM HA which is copied from OM HA

2020-08-19 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4130: Description: The 1st edition of RatisServer of SCM HA is copied from OM HA. This version abandoned, since

[jira] [Updated] (HDDS-4130) remove the 1st edition of RatisServer of SCM HA which is copied from OM HA

2020-08-19 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4130: Labels: (was: backward-incompatible pull-request-available upgrade) > remove the 1st edition of

[jira] [Updated] (HDDS-4130) remove the 1st edition of RatisServer of SCM HA which is copied from OM HA

2020-08-19 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4130: Description: The 1st edition of RatisServer of SCM HA is copied from OM HA. This version abandoned, since

[jira] [Updated] (HDDS-4130) remove the 1st edition of RatisServer of SCM HA which is copied from OM HA

2020-08-19 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4130: Description: The 1st edition of RatisServer of SCM HA is copied from OM HA. This version is abandoned,

[jira] [Resolved] (HDDS-4125) Pipeline is not removed when a datanode goes stale

2020-08-21 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng resolved HDDS-4125. - Resolution: Fixed > Pipeline is not removed when a datanode goes stale >

[jira] [Created] (HDDS-4136) CLONE - In ContainerStateManagerV2, modification of RocksDB should be consistent with that of memory state.

2020-08-24 Thread Glen Geng (Jira)
Glen Geng created HDDS-4136: --- Summary: CLONE - In ContainerStateManagerV2, modification of RocksDB should be consistent with that of memory state. Key: HDDS-4136 URL: https://issues.apache.org/jira/browse/HDDS-4136

[jira] [Updated] (HDDS-4136) Design for Error/Exception handling in state update for container/pipeline V2

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4136: Summary: Design for Error/Exception handling in state update for container/pipeline V2 (was: Design for

[jira] [Updated] (HDDS-4136) Design for Error/Exception handling in state updates for container/pipeline V2

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4136: Summary: Design for Error/Exception handling in state updates for container/pipeline V2 (was: CLONE - In

[jira] [Created] (HDDS-4135) In ContainerStateManagerV2, modification of RocksDB should be in consistency with that of memory state.

2020-08-24 Thread Glen Geng (Jira)
Glen Geng created HDDS-4135: --- Summary: In ContainerStateManagerV2, modification of RocksDB should be in consistency with that of memory state. Key: HDDS-4135 URL: https://issues.apache.org/jira/browse/HDDS-4135

[jira] [Updated] (HDDS-4136) Design for Error/Exception handling in state update for container/pipeline V2

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4136: Description: I have a concern about how to handle exceptions occurred in writing RocksDB for container V2,

[jira] [Updated] (HDDS-4135) In ContainerStateManagerV2, modification of RocksDB should be consistent with that of memory state.

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4135: Description:   Fix a bug in https://issues.apache.org/jira/browse/HDDS-3895  In ContainerStateManagerV2,

[jira] [Updated] (HDDS-4135) In ContainerStateManagerV2, modification of RocksDB should be consistent with that of memory state.

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4135: Labels: (was: pull-request-available) > In ContainerStateManagerV2, modification of RocksDB should be

[jira] [Updated] (HDDS-4135) In ContainerStateManagerV2, modification of RocksDB should be in consistency with that of memory state.

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4135: Description:   Fix a bug in https://issues.apache.org/jira/browse/HDDS-3895   In

[jira] [Updated] (HDDS-4135) In ContainerStateManagerV2, modification of RocksDB should be consistent with that of memory state.

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4135: Summary: In ContainerStateManagerV2, modification of RocksDB should be consistent with that of memory

[jira] [Updated] (HDDS-4136) Design for Error/Exception handling in state update for container/pipeline V2

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4136: Description: I have a concern about how to handling exceptions occurred in writing RocksDB for container

[jira] [Updated] (HDDS-4136) Design for Error/Exception handling in state update for container/pipeline V2

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4136: Description: (was:   Fix a bug in https://issues.apache.org/jira/browse/HDDS-3895  In

[jira] [Updated] (HDDS-4136) Design for Error/Exception handling in state update for container/pipeline V2

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4136: Description: I have a concern about how to handle exceptions occurred in writing RocksDB for container V2,

[jira] [Updated] (HDDS-4135) In ContainerStateManagerV2, modification of RocksDB should be in consistency with that of memory state.

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4135: Description:   Fix a bug in https://issues.apache.org/jira/browse/HDDS-3895     was: The 1st edition

[jira] [Updated] (HDDS-4136) Design for Error/Exception handling in state update for container/pipeline V2

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4136: Description: I have a concern about how to handle exceptions occurred in writing RocksDB for container V2,

[jira] [Updated] (HDDS-4136) Design for Error/Exception handling in state update for container/pipeline V2

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4136: Description: I have a concern about how to handle exceptions occurred in writing RocksDB for container V2,

[jira] [Updated] (HDDS-4136) Design for Error/Exception handling in state update for container/pipeline V2

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4136: Description: I have a concern about how to handle exceptions occurred in writing RocksDB for container V2,

[jira] [Updated] (HDDS-4136) Design for Error/Exception handling in state update for container/pipeline V2

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4136: Description: I have a concern about how to handle exceptions occurred in writing RocksDB for container V2,

[jira] [Updated] (HDDS-4136) Design for Error/Exception handling in state update for container/pipeline V2

2020-08-24 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4136: Description: I have a concern about how to handle exceptions occurred in writing RocksDB for container V2,

[jira] [Commented] (HDDS-4125) Pipeline is not removed when a datanode goes stale

2020-08-17 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178919#comment-17178919 ] Glen Geng commented on HDDS-4125: - Since the thread pool that remove RATIS ONE pipeline is removed, we

[jira] [Comment Edited] (HDDS-4125) Pipeline is not removed when a datanode goes stale

2020-08-17 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178919#comment-17178919 ] Glen Geng edited comment on HDDS-4125 at 8/17/20, 11:32 AM: Since the thread

[jira] [Commented] (HDDS-4125) Pipeline is not removed when a datanode goes stale

2020-08-17 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178917#comment-17178917 ] Glen Geng commented on HDDS-4125: - Hi Nanda, I encountered this issue as well, and provide a fix. Please

[jira] [Resolved] (HDDS-4130) remove the 1st edition of RatisServer of SCM HA which is copied from OM HA

2020-08-20 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng resolved HDDS-4130. - Resolution: Fixed > remove the 1st edition of RatisServer of SCM HA which is copied from OM HA >

[jira] [Updated] (HDDS-4365) SCMBlockLocationFailoverProxyProvider should use ScmBlockLocationProtocolPB.class in RPC.setProtocolEngine

2020-10-21 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4365: Description: in SCMBlockLocationFailoverProxyProvider, currently it is {code:java} private

[jira] [Created] (HDDS-4365) SCMBlockLocationFailoverProxyProvider should use ScmBlockLocationProtocolPB.class in RPC.setProtocolEngine

2020-10-21 Thread Glen Geng (Jira)
Glen Geng created HDDS-4365: --- Summary: SCMBlockLocationFailoverProxyProvider should use ScmBlockLocationProtocolPB.class in RPC.setProtocolEngine Key: HDDS-4365 URL: https://issues.apache.org/jira/browse/HDDS-4365

[jira] [Updated] (HDDS-4365) SCMBlockLocationFailoverProxyProvider should use ScmBlockLocationProtocolPB.class in RPC.setProtocolEngine

2020-10-21 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4365: Description: in SCMBlockLocationFailoverProxyProvider, it should be  {code:java} private

[jira] [Assigned] (HDDS-4365) SCMBlockLocationFailoverProxyProvider should use ScmBlockLocationProtocolPB.class in RPC.setProtocolEngine

2020-10-21 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng reassigned HDDS-4365: --- Assignee: Glen Geng > SCMBlockLocationFailoverProxyProvider should use >

[jira] [Updated] (HDDS-4365) SCMBlockLocationFailoverProxyProvider should use ScmBlockLocationProtocolPB.class in RPC.setProtocolEngine

2020-10-21 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4365: Description: in SCMBlockLocationFailoverProxyProvider, currently it is {code:java} private

[jira] [Updated] (HDDS-4365) SCMBlockLocationFailoverProxyProvider should use ScmBlockLocationProtocolPB.class in RPC.setProtocolEngine

2020-10-21 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4365: Priority: Minor (was: Major) > SCMBlockLocationFailoverProxyProvider should use >

[jira] [Commented] (HDDS-4355) Deleted container is marked as missing on recon UI

2020-10-26 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17221081#comment-17221081 ] Glen Geng commented on HDDS-4355: - cc [~avijayan] > Deleted container is marked as missing on recon UI >

[jira] [Commented] (HDDS-4385) It would be nice if there is a search function using container ID on Recon Missing Container page

2020-10-26 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17221079#comment-17221079 ] Glen Geng commented on HDDS-4385: - cc @[~avijayan] > It would be nice if there is a search function using

[jira] [Updated] (HDDS-4386) Each EndpointStateMachine uses its own thread pool to talk with SCM/Recon

2020-10-22 Thread Glen Geng (Jira)
[ https://issues.apache.org/jira/browse/HDDS-4386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Glen Geng updated HDDS-4386: Description: In Tencent production environment, after start Recon for a while, we got warnings that all

  1   2   >