Yes, always the same 4 nodes. -f doesn't produce anything, just returns immediately, possibly due to the fact fping is not installed.
I have performed the following sanity check: # rpower n01 off # nodestat -up n01 noping(off) # nodels n01 nodelist.appstatus nodelist.status n01: nodelist.appstatus: xend=down,sshd=up,rdp=down,https=up,pbs=down,msrpc=down n01: nodelist.status: noping(off) # tabch node=n01 nodelist.appstatus="" # nodels n01 nodelist.appstatus nodelist.status n01: nodelist.appstatus: n01: nodelist.status: noping(off) # nodestat -up n01 noping(off) # nodels n01 nodelist.appstatus nodelist.status n01: nodelist.appstatus: n01: nodelist.status: noping(off) # rpower n01 on # nodestat -up n01 sshd,https # nodels n01 nodelist.appstatus nodelist.status n01: nodelist.appstatus: xend=down,sshd=up,rdp=down,https=up,pbs=down,msrpc=down n01: nodelist.status: ping So it seems when a node is not responding (noping), the appstatus field doesn't update. I assume some kind of code optimization (why bother checking appstatus for a noping node) but it does present a slightly more complex logic to my monitoring code, How hard would it be to modify the code to update the appstatus regardless of status=noping (or even automatically set everything to 'down')? On Tue, Apr 21, 2015 at 5:34 PM, Ling Gao <[email protected]> wrote: > Do you consistently get the sshd=down for the same 4 nodes and their status > is "sshd"? Can you try "nodestat -up -f compute" and see if you get the > same result? (Can you clear the appstatus field for each try and compare the > result.? > > Ling > > Ling Gao > Poughkeepsie Unix Development Lab > IBM Systems and Technology Group > Internal: T/L 293-5692 > External: [email protected], 845-433-5692 > > "I never worry about the future. It comes soon enough." --- Albert Einstein > > > > From: Daniel Letai <[email protected]> > To: xCAT Users Mailing list <[email protected]> > Date: 04/21/2015 09:59 AM > Subject: [xcat-user] nodestat -u differs from nodels > nodelist.appstatus > ________________________________ > > > > Hi, > > I'm trying to implement some kind of basic liveness test for nodes in > the cluster based on results from nodestat -p: > 1. noping(off) - powered off node > 2. noping(on) - zombie node > 3. noping - dead node (both os and ipmi non-responsive) > 4. sshd - live node > 5. all else (installing prep etc.) > > I have put "nodestat -up" into cron, and nodelist table gets updated > as expected, however when trying to get the same results I notice some > big differences. > For example: > # nodestat -up compute | grep -v sshd | wc -l > 48 > # nodels compute nodelist.appstatus=~"sshd=down" | wc -l > 4 > > I have confirmed all 4 nodes are a subset of the 48 nodes from nodestat. > The status column does provide the ping/noping(on|off) info, but for > most nodes with status noping the appstatus field shows "sshd=up" > > Am I missing something? > > The xCAT version is 2.8.3 > An upgrade to 2.9.x is not practical at this time (it is scheduled), > but in any event I hadn't seen anything in the release notes that > might be relevant. > > ------------------------------------------------------------------------------ > BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT > Develop your own process in accordance with the BPMN 2 standard > Learn Process modeling best practices with Bonita BPM through live exercises > http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ > source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF > _______________________________________________ > xCAT-user mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/xcat-user > > > > ------------------------------------------------------------------------------ > BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT > Develop your own process in accordance with the BPMN 2 standard > Learn Process modeling best practices with Bonita BPM through live exercises > http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ > source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF > _______________________________________________ > xCAT-user mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/xcat-user > ------------------------------------------------------------------------------ BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF _______________________________________________ xCAT-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/xcat-user
