Fixed. I had wrong broadcast entry on /etc/sysconfig/networ-scripts/ifcfg-eth0
I was able to ping every node from everynode and was able to rsh but becuase of this entry it was not working. > From: [email protected] > To: [email protected] > Subject: [Linux-HA] Node stays offline how to join it back. > Date: Thu, 12 Mar 2009 22:16:30 +0000 > > > I have four node setup. Something happen when I rebooted all four nodes that > one of the node now just stays offline > > [r...@hos004a heartbeat]# crm_mon > Defaulting to one-shot mode > You need to have curses available at compile time to enable console mode > > > ============ > Last updated: Thu Mar 12 17:10:22 2009 > Current DC: hos002a (a5793c77-363e-4a23-80df-cb7d552f998f) > 4 Nodes configured. > 6 Resources configured. > ============ > > Node: hos001a (f3b0907e-b907-4057-89fe-a813ff5ef021): OFFLINE > Node: hos002a (a5793c77-363e-4a23-80df-cb7d552f998f): online > Node: hos003a (2cff3fca-3825-4429-a204-550885f4d952): online > Node: hos004a (eda6f411-d37a-427b-9a18-7f77a11bd93c): online > > > 02, 03 and 04's hostcache file looks like this. I try to change that but it > becomes 0000 again. > > [r...@hos004a heartbeat]# cat /var/lib/heartbeat/hostcache > hos001a 00000000-0000-0000-0000-000000000000 100 > hos002a a5793c77-363e-4a23-80df-cb7d552f998f 100 > hos003a 2cff3fca-3825-4429-a204-550885f4d952 100 > hos004a eda6f411-d37a-427b-9a18-7f77a11bd93c 100 > > How do I get this node back in? > > I am running heartbeat 2.1.3 on redhat 4.7 > > All node can ping each other. > > Here is log from failed node. it just does not do anything > > Mar 12 17:03:09 hos001a heartbeat: [7516]: info: Link hos001a:eth0 up. > Mar 12 17:03:15 hos001a heartbeat: [7516]: info: Link hos003a:eth0 up. > Mar 12 17:03:15 hos001a heartbeat: [7516]: info: Status update for node > hos003a: status active > Mar 12 17:03:16 hos001a heartbeat: [7516]: info: Link hos004a:eth0 up. > Mar 12 17:03:16 hos001a heartbeat: [7516]: info: Status update for node > hos004a: status active > Mar 12 17:03:17 hos001a heartbeat: [7516]: info: Link hos002a:eth0 up. > Mar 12 17:03:17 hos001a heartbeat: [7516]: info: Status update for node > hos002a: status active > > > I also check document at > "http://www.linux-ha.org/v2/faq#head-09fdadc641deb9ee88120bb122c49502071b0495" > but in my case this is not new node it just rebooted. > > I do not have stonith configure yet. > > > > > > > > _________________________________________________________________ > Windows Live™ Contacts: Organize your contact list. > http://windowslive.com/connect/post/marcusatmicrosoft.spaces.live.com-Blog-cns!503D1D86EBB2B53C!2285.entry?ocid=TXT_TAGLM_WL_UGC_Contacts_032009_______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems _________________________________________________________________ Windows Live™: Life without walls. http://windowslive.com/explore?ocid=TXT_TAGLM_WL_allup_1a_explore_032009_______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
