It looks as though there is some sort of loop in the transition graph that is preventing the cluster from making progress. What version is this?
On Wed, Jul 30, 2008 at 19:23, Kevin Harms <[EMAIL PROTECTED]> wrote: > > I'm running into a situation where I start HB on 3 nodes and they connect up > and such but the resources never start. I'm assuming there's some type of > error but I can't find it. Here's a small porition of the ha-log below. > > The 'tengine' has all these Pending states that don't occur when things > seems to go right. > > Where should I be looking for errors? > > thanks, > kevin > > pengine[8223]: 2008/07/29_14:54:43 notice: group_print: Resource Group: fs123 > pengine[8223]: 2008/07/29_14:54:43 notice: native_print: fs123_address > (heartbeat::ocf:IPaddr): Stopped > pengine[8223]: 2008/07/29_14:54:43 notice: native_print: fs123_filesystem > (external::ocf:Filesystem2): Stopped > pengine[8223]: 2008/07/29_14:54:43 notice: native_print: fs123_daemon > (external::ocf:PVFS2): Stopped > pengine[8223]: 2008/07/29_14:54:43 notice: StartRsc: fs121 Start > fs121_address > pengine[8223]: 2008/07/29_14:54:43 notice: StartRsc: fs121 Start > fs121_filesystem > pengine[8223]: 2008/07/29_14:54:43 notice: StartRsc: fs121 Start > fs121_daemon > pengine[8223]: 2008/07/29_14:54:43 notice: RecurringOp: fs121 > fs121_daemon_monitor_20000 > pengine[8223]: 2008/07/29_14:54:43 notice: StartRsc: fs122 Start > fs122_address > pengine[8223]: 2008/07/29_14:54:43 notice: StartRsc: fs122 Start > fs122_filesystem > pengine[8223]: 2008/07/29_14:54:43 notice: StartRsc: fs122 Start > fs122_daemon > pengine[8223]: 2008/07/29_14:54:43 notice: RecurringOp: fs122 > fs122_daemon_monitor_20000 > pengine[8223]: 2008/07/29_14:54:43 notice: StartRsc: fs123 Start > fs123_address > pengine[8223]: 2008/07/29_14:54:43 notice: StartRsc: fs123 Start > fs123_filesystem > pengine[8223]: 2008/07/29_14:54:43 notice: StartRsc: fs123 Start > fs123_daemon > pengine[8223]: 2008/07/29_14:54:43 notice: RecurringOp: fs123 > fs123_daemon_monitor_20000 > crmd[8169]: 2008/07/29_14:54:43 info: do_state_transition: State transition > S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS > cause=C_IPC_MESSAGE origin=route_message ] > tengine[8222]: 2008/07/29_14:54:43 info: unpack_graph: Unpacked transition 3: > 18 actions in 18 synapses > tengine[8222]: 2008/07/29_14:54:43 info: te_pseudo_action: Pseudo action 10 > fired and confirmed > tengine[8222]: 2008/07/29_14:54:43 info: te_pseudo_action: Pseudo action 18 > fired and confirmed > tengine[8222]: 2008/07/29_14:54:43 info: te_pseudo_action: Pseudo action 26 > fired and confirmed > tengine[8222]: 2008/07/29_14:54:43 notice: run_graph: > ==================================================== > tengine[8222]: 2008/07/29_14:54:43 WARN: run_graph: Transition 3: > (Complete=3, Pending=0, Fired=0, Skipped=0, Incomplete=15) > tengine[8222]: 2008/07/29_14:54:43 ERROR: te_graph_trigger: Transition > failed: terminated > tengine[8222]: 2008/07/29_14:54:43 WARN: print_graph: Graph 3 (18 actions in > 18 synapses): batch-limit=30 jobs, network-delay=60000ms > tengine[8222]: 2008/07/29_14:54:43 WARN: print_graph: Synapse 0 was confirmed > (priority: 0) > crmd[8169]: 2008/07/29_14:54:43 info: do_state_transition: State transition > S_TRANSITION_ENGINE -> S_IDLE [ input=I_TE_SUCCESS cause=C_IPC_MESSAGE > origin=route_message ] > tengine[8222]: 2008/07/29_14:54:43 WARN: print_graph: Synapse 1 is pending > (priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_elem: [Action 11]: Pending > (id: fs121_running_0, type: pseduo, priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_elem: * [Input 6]: > Pending (id: fs121_address_start_0, loc: fs121, priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_elem: * [Input 7]: > Pending (id: fs121_filesystem_start_0, loc: fs121, priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_elem: * [Input 8]: > Pending (id: fs121_daemon_start_0, loc: fs121, priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_elem: * [Input 10]: > Completed (id: fs121_start_0, type: pseduo, priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_graph: Synapse 2 is pending > (priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_elem: [Action 6]: Pending > (id: fs121_address_start_0, loc: fs121, priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_elem: * [Input 7]: > Pending (id: fs121_filesystem_start_0, loc: fs121, priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_elem: * [Input 10]: > Completed (id: fs121_start_0, type: pseduo, priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_graph: Synapse 3 is pending > (priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_elem: [Action 7]: Pending > (id: fs121_filesystem_start_0, loc: fs121, priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_elem: * [Input 6]: > Pending (id: fs121_address_start_0, loc: fs121, priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_graph: Synapse 4 is pending > (priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_elem: [Action 8]: Pending > (id: fs121_daemon_start_0, loc: fs121, priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_elem: * [Input 6]: > Pending (id: fs121_address_start_0, loc: fs121, priority: 0) > tengine[8222]: 2008/07/29_14:54:43 WARN: print_elem: * [Input 7]: > Pending (id: fs121_filesystem_start_0, loc: fs121, priority: 0) > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems > _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
