hi there,
We have a couchdb instance hosted on AWS's x.large instance, and it's
restart about every 10 minutes. Here is couch.log looks like when it
crashed.
[Thu, 19 Sep 2013 19:31:02 GMT] [error] [<0.15907.7364>]
{error_report,<0.31.0>,
{<0.15907.7364>,crash_report,
[[{initial_call,
{couch_index_server,init,['Argument__1']}},
{pid,<0.15907.7364>},
{registered_name,[]},
{error_info,
{exit,
{timeout,
{gen_server,call,
[couch_config,
{register,
#Fun<couch_index_server.config_change.2>,
<0.15907.7364>}]}},
[{gen_server,init_it,6},
{proc_lib,init_p_do_apply,3}]}},
{ancestors,
[couch_secondary_services,couch_server_sup,
<0.32.0>]},
{messages,[]},
{links,[<0.29263.7321>]},
{dictionary,[]},
{trap_exit,true},
{status,running},
{heap_size,377},
{stack_size,24},
{reductions,132}],
[]]}}
[Thu, 19 Sep 2013 19:31:02 GMT] [error] [<0.29263.7321>]
{error_report,<0.31.0>,
{<0.29263.7321>,supervisor_report,
[{supervisor,{local,couch_secondary_services}},
{errorContext,start_error},
{reason,
{timeout,
{gen_server,call,
[couch_config,
{register,
#Fun<couch_index_server.config_change.2>,
<0.15907.7364>}]}}},
{offender,
[{pid,<0.11599.7348>},
{name,index_server},
{mfargs,{couch_index_server,start_link,[]}},
{restart_type,permanent},
{shutdown,brutal_kill},
{child_type,worker}]}]}}
[Thu, 19 Sep 2013 19:31:02 GMT] [error] [<0.29263.7321>]
{error_report,<0.31.0>,
{<0.29263.7321>,supervisor_report,
[{supervisor,{local,couch_secondary_services}},
{errorContext,shutdown},
{reason,reached_max_restart_intensity},
{offender,
[{pid,<0.11599.7348>},
{name,index_server},
{mfargs,{couch_index_server,start_link,[]}},
{restart_type,permanent},
{shutdown,brutal_kill},
{child_type,worker}]}]}}
[Thu, 19 Sep 2013 19:31:02 GMT] [error] [<0.83.0>]
{error_report,<0.31.0>,
{<0.83.0>,supervisor_report,
[{supervisor,{local,couch_server_sup}},
{errorContext,child_terminated},
{reason,shutdown},
{offender,
[{pid,<0.29263.7321>},
{name,couch_secondary_services},
{mfargs,{couch_secondary_sup,start_link,[]}},
{restart_type,permanent},
{shutdown,infinity},
{child_type,supervisor}]}]}}
Any hints on what the problem could be? We are hosting about 19000+
database of this machine with pretty high file descriptor limits.
Thanks in advance,
Herman