[ https://issues.apache.org/jira/browse/HADOOP-1816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
stack resolved HADOOP-1816. --------------------------- Resolution: Fixed Fix Version/s: 0.15.0 I buy your rationale above Jim. There may be other states that a regionserver can get into like the one described herein where it wouldn't go down and it kept making achk, achk, achk noises like some wounded duck but we can open a new issue to address it when we see it. Resolving. Fixed by HADOOP-1801 > [hbase] Scan of .META. does socket timeout over and over again (rather than > ---------------------------------------------------------------------------- > > Key: HADOOP-1816 > URL: https://issues.apache.org/jira/browse/HADOOP-1816 > Project: Hadoop > Issue Type: Bug > Components: contrib/hbase > Reporter: stack > Assignee: stack > Priority: Trivial > Fix For: 0.15.0 > > Attachments: excerpt.txt > > > A mismatch in the code on the cluster revealed an infinite loop. The .META. > scanner is doing a socket timeout trying to contact a borked region server > (The borked server was having trouble contacting hdfs because of of code > version mismatch -- it was sort-of-working). We retry the timeout up to the > retry limit but then rather than try and redeploy the unreachable .META. we > just drop back into scanning at the old location.... I'll attach a log that > illustrates the goings-on. > I think this likely a trivial issue since it shouldn't really ever happen.... -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.