@James

Thanks for the insight. I think that's also our case. I see the dead region 
list but it seems like our cluster is operating properly.
However, from a maintenance standpoint I'd like the cluster to always report as 
health. And having a list of "dead" servers is not a healthy thing to have.
So i was hoping that from the comments I'd be collecting here, I could write a 
shell file that would do this clean up in an automated fashion. I just needed 
insight as to what I should be cleaning up and when it's safe to do so.

jeff

________________________________
From: James Moore <jcmo...@hubspot.com>
Sent: Friday, May 26, 2017 11:35:22 AM
To: u...@hbase.apache.org
Cc: dev@hbase.apache.org
Subject: Re: What is Dead Region Servers and how to clear them up?

In HBase all data is stored in HDFS rather than inside of the region
server.  The HBase cluster itself considers any individual region
server process a region server and when that process dies it is considered
a dead region server, this tracking is particularly important during the
crash recovery process and dealing with network partitions, there isn't any
need to clean up dead region servers as an out of band maintenance task and
will be cleaned up by the HMasters eventually.

On Fri, May 26, 2017 at 2:03 PM, jeff saremi <jeffsar...@hotmail.com> wrote:

> Thank you for the GFY answer
>
> And i guess to figure out how to fix these I can always go through the
> HBase source code.
>
>
> ________________________________
> From: Dima Spivak <dimaspi...@apache.org>
> Sent: Friday, May 26, 2017 9:58:00 AM
> To: hbase-user
> Subject: Re: What is Dead Region Servers and how to clear them up?
>
> Sending this back to the user mailing list.
>
> RegionServers can die for many reasons. Looking at your RegionServer log
> files should give hints as to why it's happening.
>
>
> -Dima
>
> On Fri, May 26, 2017 at 9:48 AM, jeff saremi <jeffsar...@hotmail.com>
> wrote:
>
> > I had posted this to the user mailing list and I have not got any direct
> > answer to my question.
> >
> > Where do dead RS's come from and how can they be cleaned up? Someone in
> > the midst of developers should know this.
> >
> > thanks
> >
> > Jeff
> >
> > ________________________________
> > From: jeff saremi <jeffsar...@hotmail.com>
> > Sent: Thursday, May 25, 2017 10:23:17 AM
> > To: u...@hbase.apache.org
> > Subject: Re: What is Dead Region Servers and how to clear them up?
> >
> > I'm still looking to get hints on how to remove the dead regions. thanks
> >
> > ________________________________
> > From: jeff saremi <jeffsar...@hotmail.com>
> > Sent: Wednesday, May 24, 2017 12:27:06 PM
> > To: u...@hbase.apache.org
> > Subject: Re: What is Dead Region Servers and how to clear them up?
> >
> > i'm trying to eliminate the dead region servers.
> >
> > ________________________________
> > From: Ted Yu <yuzhih...@gmail.com>
> > Sent: Wednesday, May 24, 2017 12:17:40 PM
> > To: u...@hbase.apache.org
> > Subject: Re: What is Dead Region Servers and how to clear them up?
> >
> > bq. running hbck (many times
> >
> > Can you describe the specific inconsistencies you were trying to resolve
> ?
> > Depending on the inconsistencies, advice can be given on the best known
> > hbck command arguments to use.
> >
> > Feel free to pastebin master log if needed.
> >
> > On Wed, May 24, 2017 at 12:10 PM, jeff saremi <jeffsar...@hotmail.com>
> > wrote:
> >
> > > these are the things I have done so far:
> > >
> > >
> > > - restarting master (few times)
> > >
> > > - running hbck (many times; this tool does not seem to be doing
> anything
> > > at all)
> > >
> > > - checking the list of region servers in ZK (none of the dead ones are
> > > listed here)
> > >
> > > - checking the WALs under <hbase_hdfs>/WALs. Out of 11 dead ones only 3
> > > are listed here with "-splitting" at the end of their names and they
> > > contain one single file like: 1493846660401..meta.1493922323600.meta
> > >
> > >
> > >
> > >
> > > ________________________________
> > > From: jeff saremi <jeffsar...@hotmail.com>
> > > Sent: Wednesday, May 24, 2017 9:04:11 AM
> > > To: u...@hbase.apache.org
> > > Subject: What is Dead Region Servers and how to clear them up?
> > >
> > > Apparently having dead region servers is so common that a section of
> the
> > > master console is dedicated to that?
> > > How can we clean this up (preferably in an automated fashion)? Why
> isn't
> > > this being done by HBase automatically?
> > >
> > >
> > > thanks
> > >
> >
>

Reply via email to