[
https://issues.apache.org/jira/browse/ACCUMULO-2227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13879034#comment-13879034
]
ASF subversion and git services commented on ACCUMULO-2227:
-----------------------------------------------------------
Commit 06f80305e4587f519cb3dfae0686b52b32e7a0b8 in branch refs/heads/master
from [~bhavanki]
[ https://git-wip-us.apache.org/repos/asf?p=accumulo.git;h=06f8030 ]
ACCUMULO-2227 / ACCUMULO-2228 Update randomwalk README with HA warning
Hadoop 2.1.0 includes better retry / failover handling than prior versions. This
commit adds a warning to the randomwalk README advising testers to expect more
failures exercising HA under Hadoop versions before 2.1.0.
> Concurrent randomwalk fails when namenode dies after bulk import step
> ---------------------------------------------------------------------
>
> Key: ACCUMULO-2227
> URL: https://issues.apache.org/jira/browse/ACCUMULO-2227
> Project: Accumulo
> Issue Type: Bug
> Components: test
> Affects Versions: 1.4.4
> Reporter: Bill Havanki
> Assignee: Bill Havanki
> Labels: ha, randomwalk, test
>
> Running Concurrent randomwalk under HDFS HA, if the active namenode is killed:
> {noformat}
> 20 12:27:51,119 [retry.RetryInvocationHandler] WARN : Exception while
> invoking class
> org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolTranslatorPB.delete.
> Not retrying because the invoked method is not idempotent, and unable to
> determine whether it was invoked
> java.io.IOException: Failed on local exception: java.io.IOException: Response
> is null.; Host Details : local host is: "slave.domain.com/10.20.200.113";
> destination host is: "namenode.domain.com":8020;
> ...
> at org.apache.hadoop.hdfs.DFSClient.delete(DFSClient.java:1487)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.delete(DistributedFileSystem.java:355)
> at
> org.apache.accumulo.server.test.randomwalk.concurrent.BulkImport.visit(BulkImport.java:140)
> ...
> Caused by: java.io.IOException: Response is null.
> at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:952)
> at org.apache.hadoop.ipc.Client$Connection.run(Client.java:847)
> {noformat}
> This arises from an HDFS path delete call that cleans up from the bulk
> import. The test should be resilient here (and when the paths are made
> earlier in the test) so that the test can continue once failover has
> completed.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)