Fixed.

I had wrong broadcast entry on /etc/sysconfig/networ-scripts/ifcfg-eth0

I was able to ping every node from everynode and was able to rsh but becuase of 
this entry it was not working.



> From: [email protected]
> To: [email protected]
> Subject: [Linux-HA] Node stays offline how to join it back.
> Date: Thu, 12 Mar 2009 22:16:30 +0000
> 
> 
> I have four node setup. Something happen when I rebooted all four nodes that 
> one of the node now just stays offline
> 
> [r...@hos004a heartbeat]# crm_mon
> Defaulting to one-shot mode
> You need to have curses available at compile time to enable console mode
> 
> 
> ============
> Last updated: Thu Mar 12 17:10:22 2009
> Current DC: hos002a (a5793c77-363e-4a23-80df-cb7d552f998f)
> 4 Nodes configured.
> 6 Resources configured.
> ============
> 
> Node: hos001a (f3b0907e-b907-4057-89fe-a813ff5ef021): OFFLINE
> Node: hos002a (a5793c77-363e-4a23-80df-cb7d552f998f): online
> Node: hos003a (2cff3fca-3825-4429-a204-550885f4d952): online
> Node: hos004a (eda6f411-d37a-427b-9a18-7f77a11bd93c): online
> 
> 
> 02, 03 and 04's hostcache file looks like this. I try to change that but it 
> becomes 0000 again.
> 
> [r...@hos004a heartbeat]# cat /var/lib/heartbeat/hostcache 
> hos001a 00000000-0000-0000-0000-000000000000    100
> hos002a a5793c77-363e-4a23-80df-cb7d552f998f    100
> hos003a 2cff3fca-3825-4429-a204-550885f4d952    100
> hos004a eda6f411-d37a-427b-9a18-7f77a11bd93c    100
> 
> How do I get this node back in?
> 
> I am running heartbeat 2.1.3 on redhat 4.7
> 
> All node can ping each other.
> 
> Here is log from failed node. it just does not do anything
> 
> Mar 12 17:03:09 hos001a heartbeat: [7516]: info: Link hos001a:eth0 up.
> Mar 12 17:03:15 hos001a heartbeat: [7516]: info: Link hos003a:eth0 up.
> Mar 12 17:03:15 hos001a heartbeat: [7516]: info: Status update for node 
> hos003a: status active
> Mar 12 17:03:16 hos001a heartbeat: [7516]: info: Link hos004a:eth0 up.
> Mar 12 17:03:16 hos001a heartbeat: [7516]: info: Status update for node 
> hos004a: status active
> Mar 12 17:03:17 hos001a heartbeat: [7516]: info: Link hos002a:eth0 up.
> Mar 12 17:03:17 hos001a heartbeat: [7516]: info: Status update for node 
> hos002a: status active
> 
> 
> I also check document at 
> "http://www.linux-ha.org/v2/faq#head-09fdadc641deb9ee88120bb122c49502071b0495";
> but in my case this is not new node it just rebooted.
> 
> I do not have stonith configure yet.
> 
> 
> 
> 
> 
> 
> 
> _________________________________________________________________
> Windows Live™ Contacts: Organize your contact list. 
> http://windowslive.com/connect/post/marcusatmicrosoft.spaces.live.com-Blog-cns!503D1D86EBB2B53C!2285.entry?ocid=TXT_TAGLM_WL_UGC_Contacts_032009_______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems

_________________________________________________________________
Windows Live™: Life without walls.
http://windowslive.com/explore?ocid=TXT_TAGLM_WL_allup_1a_explore_032009_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to