[ https://issues.apache.org/jira/browse/HDFS-15672?focusedWorklogId=514249&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-514249 ]
ASF GitHub Bot logged work on HDFS-15672: ----------------------------------------- Author: ASF GitHub Bot Created on: 19/Nov/20 17:59 Start Date: 19/Nov/20 17:59 Worklog Time Spent: 10m Work Description: iwasakims opened a new pull request #2475: URL: https://github.com/apache/hadoop/pull/2475 https://issues.apache.org/jira/browse/HDFS-15672 The setting of testBalancingBlockpoolsWithBlockPoolPolicy is * 2 NameNodes (2 name spaces) * 4 DataNodes (with 500 bytes capacity per node) * blocksize = 100, replication factor = 2 * creating 300 bytes file on both name spaces (6 blocks total) * add 2 DataNodes (with 500 bytes capacity per node) * running balancer If one of the DataNode is chosen for all 6 blocks, no free space is available. The error causes retry of block creation. ``` 2020-11-18 06:01:36,648 [DataXceiver for client DFSClient_NONMAPREDUCE_1983766438_12 at /127.0.0.1:46158 [Receiving block BP-631108559-172.31.197.233-1605679293748:blk_1073741827_1003]] ERROR datanode.DataNode (DataXceiver.java:run(324)) - host1.foo.com:43495:DataXceiver error processing WRITE_BLOCK operation src: /127.0.0.1:46158 dst: /127.0.0.1:43495 java.io.IOException: Creating block, no free space available ``` The garbage breaks assertion about total used space. ``` 2020-11-18 06:01:37,361 [Listener at localhost/43281] INFO balancer.Balancer (TestBalancerWithMultipleNameNodes.java:runBalancer(172)) - BALANCER 0: totalUsed=1200, totalCapacity=3000, avg=40.0 2020-11-18 06:01:37,361 [Listener at localhost/43281] INFO balancer.Balancer (TestBalancerWithMultipleNameNodes.java:wait(151)) - WAIT expectedUsedSpace=1200, expectedTotalSpace=3000 ...(snip) 2020-11-18 06:01:47,372 [Listener at localhost/43281] WARN balancer.Balancer (TestBalancerWithMultipleNameNodes.java:wait(161)) - WAIT i=100, s=[3000, 1300] ``` ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 514249) Remaining Estimate: 0h Time Spent: 10m > TestBalancerWithMultipleNameNodes#testBalancingBlockpoolsWithBlockPoolPolicy > fails on trunk > ------------------------------------------------------------------------------------------- > > Key: HDFS-15672 > URL: https://issues.apache.org/jira/browse/HDFS-15672 > Project: Hadoop HDFS > Issue Type: Sub-task > Reporter: Ahmed Hussein > Assignee: Masatake Iwasaki > Priority: Major > Time Spent: 10m > Remaining Estimate: 0h > > qbt report shows the following error: > {code:bash} > org.apache.hadoop.hdfs.server.balancer.TestBalancerWithMultipleNameNodes.testBalancingBlockpoolsWithBlockPoolPolicy > Failing for the past 1 build (Since Failed#317 ) > Took 10 min. > Error Message > test timed out after 600000 milliseconds > Stacktrace > org.junit.runners.model.TestTimedOutException: test timed out after 600000 > milliseconds > at java.lang.Thread.sleep(Native Method) > at > org.apache.hadoop.hdfs.server.balancer.TestBalancerWithMultipleNameNodes.sleep(TestBalancerWithMultipleNameNodes.java:353) > at > org.apache.hadoop.hdfs.server.balancer.TestBalancerWithMultipleNameNodes.wait(TestBalancerWithMultipleNameNodes.java:159) > at > org.apache.hadoop.hdfs.server.balancer.TestBalancerWithMultipleNameNodes.runBalancer(TestBalancerWithMultipleNameNodes.java:175) > at > org.apache.hadoop.hdfs.server.balancer.TestBalancerWithMultipleNameNodes.runTest(TestBalancerWithMultipleNameNodes.java:550) > at > org.apache.hadoop.hdfs.server.balancer.TestBalancerWithMultipleNameNodes.testBalancingBlockpoolsWithBlockPoolPolicy(TestBalancerWithMultipleNameNodes.java:609) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > at java.lang.reflect.Method.invoke(Method.java:498) > at > org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50) > at > org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12) > at > org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47) > at > org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17) > at > org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:298) > at > org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:292) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at java.lang.Thread.run(Thread.java:748) > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org