I take that back, a 1.4.2 node handing off data to an older node during 
rebalancing is in fact reporting an unknown_msg error. I'll try your suggestion.

2013-09-24 21:02:35.337 [error] 
<0.15962.4>@riak_core_handoff_sender:start_fold:269 ownership_transfer transfer 
of riak_kv_vnode from '[email protected]' 
456719261665907161938651510223838443642478919680 to '[email protected]' 
456719261665907161938651510223838443642478919680 failed because of 
error:{case_clause,{ok,[255|<<"unknown_msg">>]}} 
[{riak_core_handoff_sender,start_fold,5,[{file,"src/riak_core_handoff_sender.erl"},{line,222}]}]

Dave


On Sep 24, 2013, at 3:42 PM, Dmitry Demeshchuk <[email protected]> wrote:

> Seems like a potential problem with handoff. We had similar problems 
> upgrading from 1.2.1 to 1.4.0. Check the logs for handoff errors (something 
> like <<"unknown_msg">> or similar).
> 
> If that's the case, leave that node be, and do in-place upgrade for the rest 
> of the nodes, without making them leave the cluster. The third node will 
> probably leave after that, so you'll be able to re-join it.
> 
> 
> On Tue, Sep 24, 2013 at 12:33 PM, David Greenstein <[email protected]> 
> wrote:
> 
> I'm performing a rolling upgrade to 1.4.2 (from 1.3.1). The first two nodes 
> that I replaced left the cluster without an issue and the new nodes joined 
> without an issue. Now, the next node seams to be in a state where it won't 
> leave the cluster. The status is leaving but it has been pending for several 
> hours. Perhaps it is due to the pending ownership handoff from ring_status 
> that also doesn't seem to be completing. Any insight or help on how to 
> "kickstart" the leave would be greatly appreciated!
> 
> Dave
> 
> [user@ip-10-0-1-12 user]# /db/riak/bin/riak-admin ring_status
> ================================== Claimant 
> ===================================
> Claimant:  '[email protected]'
> Status:     up
> Ring Ready: true
> 
> ============================== Ownership Handoff 
> ==============================
> Owner:      [email protected]
> Next Owner: [email protected]
> 
> Index: 456719261665907161938651510223838443642478919680
>  Waiting on: [riak_kv_vnode]
>  Complete:   [riak_pipe_vnode]
> 
> -------------------------------------------------------------------------------
> 
> ============================== Unreachable Nodes 
> ==============================
> All nodes are up and reachable
> 
> 
> 
> 
> 
> [user@ip-10-0-1-12 user]# /db/riak/bin/riak-admin member_status
> ================================= Membership 
> ==================================
> Status     Ring    Pending    Node
> -------------------------------------------------------------------------------
> leaving    14.1%     14.1%    '[email protected]'
> valid      14.1%     14.1%    '[email protected]'
> valid      14.1%     14.1%    '[email protected]'
> valid      17.2%     15.6%    '[email protected]'
> valid      12.5%     14.1%    '[email protected]'
> valid      14.1%     14.1%    '[email protected]'
> valid      14.1%     14.1%    '[email protected]'
> -------------------------------------------------------------------------------
> Valid:6 / Leaving:1 / Exiting:0 / Joining:0 / Down:0
> 
> 
> 
> 
> 
> _______________________________________________
> riak-users mailing list
> [email protected]
> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
> 
> 
> 
> 
> -- 
> Best regards,
> Dmitry Demeshchuk

_______________________________________________
riak-users mailing list
[email protected]
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to