my current jffnms is a 0.8.4 version on rhel 4.4 and the kernel is

Linux nbsf000p5ap06 2.6.9-42.0.3.EL #1 SMP Mon Sep 25 17:22:04 EDT 2006
ppc64 ppc64 ppc64 GNU/Linux

and works fine


2014-07-22 14:32 GMT-03:00 wireless <wirel...@tampabay.rr.com>:

> On 07/21/14 08:58, Sebastian Martinez wrote:
>
>> os Red Hat Enterprise Linux Server release 6.5 (Santiago)
>> kernel Linux nbsf000p5ap06.nbsf.com.ar
>> <http://nbsf000p5ap06.nbsf.com.ar> 2.6.32-431.17.1.el6.ppc64 #1 SMP Fri
>>
>> Apr 11 17:30:35 EDT 2014 ppc64 ppc64 ppc64 GNU/Linux
>>
>
> This is a very old kernel. I'm not sure about the PPC architecture,
> but, you may want to try running jffnms on a system other than PPC
> with a kernel specifically tuned to run JFFNMS, since it is a standalone
> system for your needs.
>
>
>
>
>  the disk is a lun on dell storage vnx 5500
>>
>>   sdparm /dev/sdh
>>      /dev/sdh: DGC       VRAID             0532
>> Read write error recovery mode page:
>>    AWRE        1  [cha: n, def:  1, sav:  1]
>>    ARRE        1  [cha: n, def:  1, sav:  1]
>>    PER         1  [cha: n, def:  1, sav:  1]
>> Caching (SBC) mode page:
>>    WCE         0  [cha: y, def:  0, sav:  0]
>>    RCD         0  [cha: y, def:  1, sav:  1]
>> Control mode page:
>>    SWP         0  [cha: n, def:  0, sav:  0]
>>
>> the system is dedicated to jffnms
>>
>
> Might an SSD speed things along?  Ceratainly new, faster RAM will.
>
>  top - 10:57:31 up 41 days, 23:14,  1 user,  load average: 0.52, 0.36, 0.30
>> Tasks: 206 total,   2 running, 204 sleeping,   0 stopped,   0 zombie
>> Cpu(s):  1.5%us,  0.5%sy,  0.0%ni, 97.8%id,  0.1%wa,  0.1%hi,  0.0%si,
>>   0.0%st
>> Mem:   2052736k total,  2032832k used,    19904k free,   145664k buffers
>> Swap:  3145600k total,   155264k used,  2990336k free,   988992k cached
>>
>>    PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
>>   6734 mysql     20   0 2070m  33m 9664 S  3.0  1.7 632:36.61 mysqld
>> 23838 apache    20   0  102m  33m  16m R  2.0  1.7   0:22.14 httpd
>> 15752 jffcom    20   0  175m  46m  27m S  0.7  2.3   0:01.13 php
>>   2330 root      20   0  5632 3840 2432 R  0.3  0.2   0:00.01 top
>>   7403 jffcom    20   0  175m  46m  27m S  0.3  2.3   0:04.79 php
>> 15859 jffcom    20   0  175m  46m  27m S  0.3  2.3   0:01.35 php
>> 15860 jffcom    20   0  175m  47m  27m S  0.3  2.3   0:02.20 php
>> 19365 jffcom    20   0  175m  47m  27m S  0.3  2.4   0:07.26 php
>> 21821 jffcom    20   0  182m  41m  27m S  0.3  2.1   2:19.43 php
>>
>>
>>
>> 2014-07-21 11:51 GMT-03:00 wireless <wirel...@tampabay.rr.com
>> <mailto:wirel...@tampabay.rr.com>>:
>>
>>
>>     On 07/21/14 08:21, Sebastian Martinez wrote:
>>      > another example
>>      >
>>      > this inteface fails randomly but when i run the poller manualy
>>     works ok
>>      >
>>      > 10:01:50 CH:12 (13134):  :  H 255 :  I 1482 :  P   1 :
>>      > reachability_start:ping(): 53cd0f3d7ed74 -> buffer(): 1 (time P:
>>     3.36 | B:
>>      > 0.04)
>>      > 10:01:58 CH:12 (13134):  :  H 255 :  I 1482 :  P 123 :
>>      > reachability_values:rtt(rtt): 0 -> buffer(): 2 (time P: 0.11 | B:
>>     0.01)
>>      > 10:01:58 CH:12 (13134):  :  H 255 :  I 1482 :  P 124 :
>>      > reachability_values:packetloss(pl): 50 -> buffer(): 3 (time P:
>>     0.04 | B:
>>      > 0.01)
>>      > 10:01:58 CH:12 (13134):  :  H 255 :  I 1482 :  P 125 : status():
>>      > unreachable|100% Packet Loss -> alarm(40): Event Added: 1221844
>>     (time P:  0.01 | B: 4.02)
>>      >
>>
>
> Transient failures  on communications (packetloss) can cause problems?
> The question is what is the root cause of those dropped packets (wireshark
> is your friend, when searching for network bottlenecks).
> If the cause of the dropped packets in on the local machine, you have
> greater troubles than jffnms.....
>
>
>       >   php -q poller.php -i 1482 -o -F
>>
>
> suffice is to say,  php might not be the most robust of languages
> to use for this poller. popen is a facinating idea, that should be
> explored in craigs latest thread.
>
>
>
>      What is the Operating system you are running on ?  (version of OS?)
>>
>
> Redhat Enterprise.
>
>
>      If linux, what version of the kernel are you using?
>>
>
> 2.6.32-431.17.1.el6.ppc64
>
>
>      What is your hardrive)s) specs?  (sdparm /dev/sda ?   man sdparm)
>>
>
> An SSD would help keenly.
>
>
>>     Have you checked (htop, iotop etc) your system resources while the
>>     errors occur?
>>     What other major software do you have running on your system?
>>
>
> Delete as much as possible.
>
>
>      Do these problems correlate to other heavy load events on your system?
>>     Do these problems occur when your system is 'lightly loaded' ?
>>
>
> This ideas need to be correlated with the timestamps of your jffnms faults.
>
>
> What file systems are you using on your linux machine? Hopefully, at least
> ext4? if not something more aggressive?
> I'm suspecting the old kernel and the bloatwhere of the commercial linux
> distro are the culprits. Build up a Gentoo box, on a 64 bit (amd/intel)
> with ample ram and an SSD, and only put what you need on the machine. I'll
> bet your problems are not verifiable. (I bet they go away, otherwise other
> folks would be complaining [loudly])? PPC is a problematic linux build, for
> a variety of reasons. x86_64 is your best
> bet for a maintainable platform, imho.
>
> This one is up to you and Craig....
>
> Take all of this with a grain of salt, as your constraints may not be
> changeable.
>
>
> hth,
> James
>
>
>
------------------------------------------------------------------------------
Want fast and easy access to all the code in your enterprise? Index and
search up to 200,000 lines of code with a free copy of Black Duck
Code Sight - the same software that powers the world's largest code
search on Ohloh, the Black Duck Open Hub! Try it now.
http://p.sf.net/sfu/bds
_______________________________________________
jffnms-users mailing list
jffnms-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/jffnms-users

Reply via email to