I take that back, a 1.4.2 node handing off data to an older node during rebalancing is in fact reporting an unknown_msg error. I'll try your suggestion.
2013-09-24 21:02:35.337 [error] <0.15962.4>@riak_core_handoff_sender:start_fold:269 ownership_transfer transfer of riak_kv_vnode from '[email protected]' 456719261665907161938651510223838443642478919680 to '[email protected]' 456719261665907161938651510223838443642478919680 failed because of error:{case_clause,{ok,[255|<<"unknown_msg">>]}} [{riak_core_handoff_sender,start_fold,5,[{file,"src/riak_core_handoff_sender.erl"},{line,222}]}] Dave On Sep 24, 2013, at 3:42 PM, Dmitry Demeshchuk <[email protected]> wrote: > Seems like a potential problem with handoff. We had similar problems > upgrading from 1.2.1 to 1.4.0. Check the logs for handoff errors (something > like <<"unknown_msg">> or similar). > > If that's the case, leave that node be, and do in-place upgrade for the rest > of the nodes, without making them leave the cluster. The third node will > probably leave after that, so you'll be able to re-join it. > > > On Tue, Sep 24, 2013 at 12:33 PM, David Greenstein <[email protected]> > wrote: > > I'm performing a rolling upgrade to 1.4.2 (from 1.3.1). The first two nodes > that I replaced left the cluster without an issue and the new nodes joined > without an issue. Now, the next node seams to be in a state where it won't > leave the cluster. The status is leaving but it has been pending for several > hours. Perhaps it is due to the pending ownership handoff from ring_status > that also doesn't seem to be completing. Any insight or help on how to > "kickstart" the leave would be greatly appreciated! > > Dave > > [user@ip-10-0-1-12 user]# /db/riak/bin/riak-admin ring_status > ================================== Claimant > =================================== > Claimant: '[email protected]' > Status: up > Ring Ready: true > > ============================== Ownership Handoff > ============================== > Owner: [email protected] > Next Owner: [email protected] > > Index: 456719261665907161938651510223838443642478919680 > Waiting on: [riak_kv_vnode] > Complete: [riak_pipe_vnode] > > ------------------------------------------------------------------------------- > > ============================== Unreachable Nodes > ============================== > All nodes are up and reachable > > > > > > [user@ip-10-0-1-12 user]# /db/riak/bin/riak-admin member_status > ================================= Membership > ================================== > Status Ring Pending Node > ------------------------------------------------------------------------------- > leaving 14.1% 14.1% '[email protected]' > valid 14.1% 14.1% '[email protected]' > valid 14.1% 14.1% '[email protected]' > valid 17.2% 15.6% '[email protected]' > valid 12.5% 14.1% '[email protected]' > valid 14.1% 14.1% '[email protected]' > valid 14.1% 14.1% '[email protected]' > ------------------------------------------------------------------------------- > Valid:6 / Leaving:1 / Exiting:0 / Joining:0 / Down:0 > > > > > > _______________________________________________ > riak-users mailing list > [email protected] > http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com > > > > > -- > Best regards, > Dmitry Demeshchuk
_______________________________________________ riak-users mailing list [email protected] http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
