This is bad. Yes, copyQueuesFromRSUsingMulti returns a list which it might not be able to move later on, resulting in bogus znodes. I'll fix this asap. Weird it didn't happen in my testing earlier. Sorry about this.
On Wed, Mar 13, 2013 at 6:27 PM, lars hofhansl <la...@apache.org> wrote: > Sorry 0.94.6RC1 > (I complain about folks not reporting the version all the time, and then I do > it too) > > > > ________________________________ > From: Ted Yu <yuzhih...@gmail.com> > To: dev@hbase.apache.org; lars hofhansl <la...@apache.org> > Sent: Wednesday, March 13, 2013 6:17 PM > Subject: Re: Replication hosed after simple cluster restart > > > Did this happen on 0.94.5 ? > > Thanks > > > On Wed, Mar 13, 2013 at 6:12 PM, lars hofhansl <la...@apache.org> wrote: > > We just ran into an interesting scenario. We restarted a cluster that was > setup as a replication source. >>The stop went cleanly. >> >>Upon restart *all* regionservers aborted within a few seconds with variations >>of these errors: >>http://pastebin.com/3iQVuBqS >> >>This is scary! >> >>-- Lars