hong created HBASE-25274:
Summary: Regionserver crash by YouAreDeadException
Key: HBASE-25274
URL: https://issues.apache.org/jira/browse/HBASE-25274
Project: HBase
Issue Type: Bug
] for contributing.
> YouAreDeadException should be moved to hbase-server module
> --
>
> Key: HBASE-25201
> URL: https://issues.apache.org/jira/browse/HBASE-25201
> Project: HBase
>
niuyulin created HBASE-25201:
Summary: YouAreDeadException should be moved to hbase-server module
Key: HBASE-25201
URL: https://issues.apache.org/jira/browse/HBASE-25201
Project: HBase
Issue
processed its session expired
event. It heartbeats the Master as though nothing wrong.
4. Master is mid-recovery or beyond server recovery and on receipt of
the heartbeat in essence tells the regionserver to 'go away' by
sending him the YouAreDeadException.
5. By now the regionserver will have gotten its
. It heartbeats the Master as though nothing wrong.
4. Master is mid-recovery or beyond server recovery and on receipt of
the heartbeat in essence tells the regionserver to 'go away' by
sending him the YouAreDeadException.
5. By now the regionserver will have gotten its session expired
notification
the regionserver to 'go away' by
sending him the YouAreDeadException.
5. By now the regionserver will have gotten its session expired
notification and will have started an abort so the YADE is not news
when it receives the exception.
St.Ack
On Fri, Jan 14, 2011 at 7:49 PM, Ted Yu yuzhih
tells the regionserver to 'go away' by
sending him the YouAreDeadException.
5. By now the regionserver will have gotten its session expired
notification and will have started an abort so the YADE is not news
when it receives the exception.
St.Ack
On Fri, Jan 14, 2011 at 7:49 PM, Ted Yu yuzhih
the regionserver to 'go away' by
sending him the YouAreDeadException.
5. By now the regionserver will have gotten its session expired
notification and will have started an abort so the YADE is not news
when it receives the exception.
St.Ack
On Fri, Jan 14, 2011 at 7:49 PM, Ted Yu yuzhih
session expired
event. It heartbeats the Master as though nothing wrong.
4. Master is mid-recovery or beyond server recovery and on receipt of
the heartbeat in essence tells the regionserver to 'go away' by
sending him the YouAreDeadException.
5. By now the regionserver will have gotten
This is the cause:
org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server
serverName=sjc1-hadoop1.sjc1.carrieriq.com,60020,1294856823378,
load=(requests=0, regions=6, usedHeap=514, maxHeap=3983):
regionserver:60020-0x12d7b7b1c760004 regionserver:60020-0x12d7b7b1c760004
I'm seeing not much in the way of errors, timeouts, all to one machine
ending with .80, so that is probably your failed node.
Other than that, the log doesnt seem to say too much. Searching for
strings like FATAL and Exception is the way to go here.
Also things like this:
2011-01-14
Thanks for your analysis, Ryan.
The dev cluster has half as many nodes as our staging cluster. Each node has
half the number of cores as the node in staging.
I agree with your conclusion.
I will report back after I collect more data - the flow uses hbase heavily
toward the end.
On Fri, Jan 14,
12 matches
Mail list logo