[ 
https://issues.apache.org/jira/browse/HDFS-7916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15049200#comment-15049200
 ] 

Yongjun Zhang commented on HDFS-7916:
-------------------------------------

HI [~rushabh.shah],

Thanks for your reply and sorry I missed this update until now that I'm looking 
at a related issue.  I created HDFS-9532 when I look at the related code here.

One thing I'd like to check with you and [~vinayrpet]: the HDFS-7916 fix tries 
to handle StandbyException correctly as reported, but the fix catches  
RemoteException and did not check whether the exception wrapped by the 
RemoteException is StandbyException or not. Is it intended to handle all 
exceptions wrapped by RemoteException the same way as StandbyException? Is 
there any case that we don't want to do the same?  It seems worth some 
understanding here. Would you guys please comment?

Thanks.


> 'reportBadBlocks' from datanodes to standby Node BPServiceActor goes for 
> infinite loop
> --------------------------------------------------------------------------------------
>
>                 Key: HDFS-7916
>                 URL: https://issues.apache.org/jira/browse/HDFS-7916
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: datanode
>    Affects Versions: 2.7.0
>            Reporter: Vinayakumar B
>            Assignee: Rushabh S Shah
>            Priority: Critical
>             Fix For: 2.7.1
>
>         Attachments: HDFS-7916-01.patch, HDFS-7916-1.patch
>
>
> if any badblock found, then BPSA for StandbyNode will go for infinite times 
> to report it.
> {noformat}2015-03-11 19:43:41,528 WARN 
> org.apache.hadoop.hdfs.server.datanode.DataNode: Failed to report bad block 
> BP-1384821822-10.224.54.68-1422634566395:blk_1079544278_5812006 to namenode: 
> stobdtserver3/10.224.54.70:18010
> org.apache.hadoop.hdfs.server.datanode.BPServiceActorActionException: Failed 
> to report bad block 
> BP-1384821822-10.224.54.68-1422634566395:blk_1079544278_5812006 to namenode:
>         at 
> org.apache.hadoop.hdfs.server.datanode.ReportBadBlockAction.reportTo(ReportBadBlockAction.java:63)
>         at 
> org.apache.hadoop.hdfs.server.datanode.BPServiceActor.processQueueMessages(BPServiceActor.java:1020)
>         at 
> org.apache.hadoop.hdfs.server.datanode.BPServiceActor.offerService(BPServiceActor.java:762)
>         at 
> org.apache.hadoop.hdfs.server.datanode.BPServiceActor.run(BPServiceActor.java:856)
>         at java.lang.Thread.run(Thread.java:745)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to