Elek, Marton created HDDS-343:
---------------------------------
Summary: Containers ars stuck in closing state in scm
Key: HDDS-343
URL: https://issues.apache.org/jira/browse/HDDS-343
Project: Hadoop Distributed Data Store
Issue Type: Bug
Components: SCM
Reporter: Elek, Marton
Fix For: 0.2.1
Containers could not been closed currently.
The datanode is closing the containers and sending the CLOSED state in the
container report but SCM doesn't register that the state is closed and sending
the close command again and again.
I think the ContainerMapping.processContainerReport should be improved.
{code}
scm_1 | --> RPC message request: SCMHeartbeatRequestProto from
172.25.0.2:33912
scm_1 | datanodeDetails {
scm_1 | uuid: "9c8f80bd-9424-4d74-99ef-a2bd58e66d7f"
scm_1 | ipAddress: "172.25.0.2"
scm_1 | hostName: "365fd1f44f0b"
scm_1 | ports {
scm_1 | name: "STANDALONE"
scm_1 | value: 9859
scm_1 | }
scm_1 | ports {
scm_1 | name: "RATIS"
scm_1 | value: 9858
scm_1 | }
scm_1 | ports {
scm_1 | name: "REST"
scm_1 | value: 9880
scm_1 | }
scm_1 | }
scm_1 | nodeReport {
scm_1 | storageReport {
scm_1 | storageUuid: "DS-61e76107-85c5-437a-95a7-aeb8b3e7827f"
scm_1 | storageLocation: "/tmp/hadoop-hadoop/dfs/data"
scm_1 | capacity: 491630870528
scm_1 | scmUsed: 2708828160
scm_1 | remaining: 24263614464
scm_1 | storageType: DISK
scm_1 | failed: false
scm_1 | }
scm_1 | }
scm_1 | containerReport {
scm_1 | reports {
scm_1 | containerID: 1
scm_1 | used: 1061158912
scm_1 | readCount: 0
scm_1 | writeCount: 64
scm_1 | readBytes: 0
scm_1 | writeBytes: 1061158912
scm_1 | state: CLOSED
scm_1 | }
scm_1 | reports {
scm_1 | containerID: 2
scm_1 | used: 1048576000
scm_1 | readCount: 0
scm_1 | writeCount: 64
scm_1 | readBytes: 0
scm_1 | writeBytes: 1048576000
scm_1 | state: CLOSED
scm_1 | }
scm_1 | reports {
scm_1 | containerID: 3
scm_1 | used: 511705088
scm_1 | readCount: 0
scm_1 | writeCount: 32
scm_1 | readBytes: 0
scm_1 | writeBytes: 511705088
scm_1 | state: OPEN
scm_1 | }
scm_1 | }
scm_1 | commandStatusReport {
scm_1 | }
scm_1 | containerActions {
scm_1 | containerActions {
scm_1 | containerID: 1
scm_1 | action: CLOSE
scm_1 | reason: CONTAINER_FULL
scm_1 | }
scm_1 | containerActions {
scm_1 | containerID: 2
scm_1 | action: CLOSE
scm_1 | reason: CONTAINER_FULL
scm_1 | }
scm_1 | }
scm_1 |
scm_1 | --> RPC message response: SCMHeartbeatRequestProto to
172.25.0.2:33912
scm_1 | datanodeUUID: "9c8f80bd-9424-4d74-99ef-a2bd58e66d7f"
scm_1 |
scm_1 | 2018-08-08 16:22:51 INFO CloseContainerEventHandler:56 -
Close container Event triggered for container : 1
scm_1 | 2018-08-08 16:22:51 INFO CloseContainerEventHandler:105 -
container with id : 1 is in CLOSING state and need not be closed.
scm_1 | 2018-08-08 16:22:51 INFO CloseContainerEventHandler:56 -
Close container Event triggered for container : 2
scm_1 | 2018-08-08 16:22:51 INFO CloseContainerEventHandler:105 -
container with id : 2 is in CLOSING state and need not be closed.
{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]