Hello Riak friends,
I'm rolling out a new Riak ring in EC2 on a set of m1.xlarges. I'm planning on
having six nodes. I wrote some automation that provisions the servers, installs
Riak, arranges the ring, etc., and after standing up three nodes I decided to
see how things are going.
Got Riak up and running on each machine just fine but there's something weird
in my logs that I don't quite get:
$ tail -n 5 /var/log/riak/error.log
2012-11-03 19:22:18.023 [error] <0.8981.0> CRASH REPORT Process <0.8981.0> with
0 neighbours exited with reason: no match of right hand value {error,timeout}
in riak_core_handoff_receiver:handle_call/3 line 59 in gen_server2:terminate/6
line 873
2012-11-03 19:22:18.030 [error] <0.158.0> Supervisor
riak_core_handoff_receiver_sup had child riak_core_handoff_receiver started
with {riak_core_handoff_receiver,start_link,undefined} at <0.8981.0> exit with
reason no match of right hand value {error,timeout} in
riak_core_handoff_receiver:handle_call/3 line 59 in context child_terminated
2012-11-03 19:22:23.028 [error] <0.8993.0> gen_server <0.8993.0> terminated
with reason:
{timeout,{gen_server2,call,[<0.9040.0>,{set_socket,#Port<0.8109>}]}}
2012-11-03 19:22:23.031 [error] <0.8993.0> CRASH REPORT Process
riak_core_handoff_listener with 1 neighbours exited with reason:
{timeout,{gen_server2,call,[<0.9040.0>,{set_socket,#Port<0.8109>}]}} in
gen_server:terminate/6 line 747
2012-11-03 19:22:23.035 [error] <0.160.0> Supervisor
riak_core_handoff_listener_sup had child riak_core_handoff_listener started
with riak_core_handoff_listener:start_link() at <0.8993.0> exit with reason
{timeout,{gen_server2,call,[<0.9040.0>,{set_socket,#Port<0.8109>}]}} in context
child_terminated
Otherwise, ring_status, member_status, and ring_ready return OK:
$ /usr/sbin/riak-admin ring_status
Attempting to restart script through sudo -H -u riak
================================== Claimant ===================================
Claimant: 'reporting_prod001@ my_private_ip1'
Status: up
Ring Ready: true
============================== Ownership Handoff ==============================
No pending changes.
============================== Unreachable Nodes ==============================
All nodes are up and reachable
$ /usr/sbin/riak-admin member_status
Attempting to restart script through sudo -H -u riak
================================= Membership ==================================
Status Ring Pending Node
-------------------------------------------------------------------------------
valid 33.6% -- 'reporting_prod001@my_private_ip1'
valid 33.2% -- 'reporting_prod002@ my_private_ip2'
valid 33.2% -- 'reporting_prod003@ my_private_ip3'
-------------------------------------------------------------------------------
Valid:3 / Leaving:0 / Exiting:0 / Joining:0 / Down:0
$ /usr/sbin/riak-admin ringready
Attempting to restart script through sudo -H -u riak
TRUE All nodes agree on the ring ['reporting_prod001@ my_private_ip1',
'reporting_prod002@ my_private_ip2',
'reporting_prod003@ my_private_ip3']
Additionally, if I try to curl either my ELB from my machine or the private IP
from one of the cluster members, I get back the content I would (mostly) expect:
$ curl --insecure https://my-elb.aws.amazon.com:8080/
<html><body><ul><li><a href="/admin">admin_cluster</a></li><li><a
href="/admin">admin_cluster_down</a></li><li><a
href="/admin">admin_cluster_join</a></li><li><a
href="/admin">admin_fallbacks</a></li><li><a
href="/admin">admin_gui</a></li><li><a href="/admin">admin_node</a></li><li><a
href="/admin">admin_node_leave</a></li><li><a
href="/admin">admin_node_stop</a></li><li><a
href="/admin">admin_overview</a></li><li><a
href="/admin">admin_ring</a></li><li><a
href="/buckets">riak_kv_wm_buckets</a></li><li><a
href="/riak">riak_kv_wm_buckets</a></li><li><a
href="/buckets">riak_kv_wm_index</a></li><li><a
href="/buckets">riak_kv_wm_keylist</a></li><li><a
href="/buckets">riak_kv_wm_link_walker</a></li><li><a
href="/riak">riak_kv_wm_link_walker</a></li><li><a
href="/mapred">riak_kv_wm_mapred</a></li><li><a
href="/buckets">riak_kv_wm_object</a></li><li><a
href="/riak">riak_kv_wm_object</a></li><li><a
href="/ping">riak_kv_wm_ping</a></li><li><a
href="/buckets">riak_kv_wm_props</a></li><li><a
href="/stats">riak_kv_wm_stats</a></li></ul></body></html>
So…it appears as if everything is OK. But what are the log messages about…?
Something to do with our ELB maybe?
Cheers,
Brad Heller | Engineering Lead | Cloudability.com | 541-231-1514 | Skype:
brad.heller | @bradhe | @cloudability
We're hiring! http://cloudability.com/jobs
_______________________________________________
riak-users mailing list
[email protected]
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com