Re: [Nut-upsuser] stopping ups services resets my network

2009-10-28 Thread Antoine Gatineau
I will give more details on the symptomes asap. It will take some time
because the issue occures only after a few days.

I had another report of malfunctioning with nut today but it is a different
issue. I'll create another thread later. (cpu load and performance issue
after several weeks).

The UPS model is MGE evolution S 1250.
Nut-2.4.1 (home made rpm)
Red Hat 4.7

I'll comme back to you soon.

Antoine


 

-Message d'origine-
De : Arnaud Quette [mailto:aquette@gmail.com] 
Envoyé : mercredi 28 octobre 2009 10:24
À : Charles Lepple
Cc : Antoine Gatineau; nut-upsuser@lists.alioth.debian.org
Objet : Re: [Nut-upsuser] stopping ups services resets my network

2009/10/27 Charles Lepple :
>
> On Oct 27, 2009, at 11:56 AM, Arnaud Quette  wrote:
>
>> 2009/10/27 Antoine Gatineau:
>>>
>>>  Hello,
>>
>> salut Antoine,
>>
>>> I'm back again on this issue.
>>> I have contacted Red Hat support but they won't do anything to help 
>>> since it is not an official Red Hat package. Unless I prove the 
>>> conflict come from RH.
>>>
>>> So I am trying to figure out what is going on.
>>>
>>> Adding -D will not add any more log at the services stop.
>>>
>>> When I look at the sources, I couldn't find the piece of code that 
>>> release properly the usb UPS device. There is just a normal SIGTERM 
>>> handling. Is this normal?
>>
>> not at all, everything is clean here ;-) what you're looking for is 
>> drivers/usbhid-ups.c->upsdrv_cleanup()
>> the comm_driver->close() will more specifically call, in your case, 
>> the libusb.c backend, and so the usb_close() function.
>>
>>> I also noticed that there are debug logs but I don't know how to 
>>> activate them... Can somebody tell me how to do it?
>>
>> I'm not sure for RHEL, but you should look around /etc/syslog.conf, 
>> man syslog.conf and /var/log/debug
>>
> Arnaud,

Hey Charles,

> Antoine may be referring to the debug information from the drivers but 
> I don't think we have narrowed it down to something that can be 
> resolved with those logs.

possible. imo, Antoine was talking about the few "upslogx(LOG_DEBUG...", but
it's quite possible that he was refering to the upsdebugx().

Antoine: if you're refering to the latter, you should launch the driver in
debug mode, ie:
$ /path/to/usbhid-ups -D -a 

cheers,
Arnaud
--
Linux / Unix Expert R&D - Eaton - http://www.eaton.com/mgeops Network UPS
Tools (NUT) Project Leader - http://www.networkupstools.org/ Debian
Developer - http://www.debian.org Free Software Developer -
http://arnaud.quette.free.fr/


___
Nut-upsuser mailing list
Nut-upsuser@lists.alioth.debian.org
http://lists.alioth.debian.org/mailman/listinfo/nut-upsuser


Re: [Nut-upsuser] stopping ups services resets my network

2009-10-28 Thread Arnaud Quette
2009/10/27 Charles Lepple :
>
> On Oct 27, 2009, at 11:56 AM, Arnaud Quette  wrote:
>
>> 2009/10/27 Antoine Gatineau:
>>>
>>>  Hello,
>>
>> salut Antoine,
>>
>>> I'm back again on this issue.
>>> I have contacted Red Hat support but they won't do anything to help since
>>> it
>>> is not an official Red Hat package. Unless I prove the conflict come from
>>> RH.
>>>
>>> So I am trying to figure out what is going on.
>>>
>>> Adding -D will not add any more log at the services stop.
>>>
>>> When I look at the sources, I couldn't find the piece of code that
>>> release
>>> properly the usb UPS device. There is just a normal SIGTERM handling. Is
>>> this normal?
>>
>> not at all, everything is clean here ;-)
>> what you're looking for is drivers/usbhid-ups.c->upsdrv_cleanup()
>> the comm_driver->close() will more specifically call, in your case,
>> the libusb.c backend, and so the usb_close() function.
>>
>>> I also noticed that there are debug logs but I don't know how to activate
>>> them... Can somebody tell me how to do it?
>>
>> I'm not sure for RHEL, but you should look around /etc/syslog.conf,
>> man syslog.conf and /var/log/debug
>>
> Arnaud,

Hey Charles,

> Antoine may be referring to the debug information from the drivers but I
> don't think we have narrowed it down to something that can be resolved with
> those logs.

possible. imo, Antoine was talking about the few
"upslogx(LOG_DEBUG...", but it's quite possible that he was refering
to the upsdebugx().

Antoine: if you're refering to the latter, you should launch the
driver in debug mode, ie:
$ /path/to/usbhid-ups -D -a 

cheers,
Arnaud
-- 
Linux / Unix Expert R&D - Eaton - http://www.eaton.com/mgeops
Network UPS Tools (NUT) Project Leader - http://www.networkupstools.org/
Debian Developer - http://www.debian.org
Free Software Developer - http://arnaud.quette.free.fr/

___
Nut-upsuser mailing list
Nut-upsuser@lists.alioth.debian.org
http://lists.alioth.debian.org/mailman/listinfo/nut-upsuser

Re: [Nut-upsuser] stopping ups services resets my network

2009-10-27 Thread Charles Lepple


On Oct 27, 2009, at 11:56 AM, Arnaud Quette   
wrote:



2009/10/27 Antoine Gatineau:

 Hello,


salut Antoine,


I'm back again on this issue.
I have contacted Red Hat support but they won't do anything to help  
since it
is not an official Red Hat package. Unless I prove the conflict  
come from

RH.

So I am trying to figure out what is going on.

Adding -D will not add any more log at the services stop.

When I look at the sources, I couldn't find the piece of code that  
release
properly the usb UPS device. There is just a normal SIGTERM  
handling. Is

this normal?


not at all, everything is clean here ;-)
what you're looking for is drivers/usbhid-ups.c->upsdrv_cleanup()
the comm_driver->close() will more specifically call, in your case,
the libusb.c backend, and so the usb_close() function.

I also noticed that there are debug logs but I don't know how to  
activate

them... Can somebody tell me how to do it?


I'm not sure for RHEL, but you should look around /etc/syslog.conf,
man syslog.conf and /var/log/debug


Arnaud,

Antoine may be referring to the debug information from the drivers but  
I don't think we have narrowed it down to something that can be  
resolved with those logs.



___
Nut-upsuser mailing list
Nut-upsuser@lists.alioth.debian.org
http://lists.alioth.debian.org/mailman/listinfo/nut-upsuser


Re: [Nut-upsuser] stopping ups services resets my network

2009-10-27 Thread Arnaud Quette
2009/10/27 Antoine Gatineau:
>  Hello,

salut Antoine,

> I'm back again on this issue.
> I have contacted Red Hat support but they won't do anything to help since it
> is not an official Red Hat package. Unless I prove the conflict come from
> RH.
>
> So I am trying to figure out what is going on.
>
> Adding -D will not add any more log at the services stop.
>
> When I look at the sources, I couldn't find the piece of code that release
> properly the usb UPS device. There is just a normal SIGTERM handling. Is
> this normal?

not at all, everything is clean here ;-)
what you're looking for is drivers/usbhid-ups.c->upsdrv_cleanup()
the comm_driver->close() will more specifically call, in your case,
the libusb.c backend, and so the usb_close() function.

> I also noticed that there are debug logs but I don't know how to activate
> them... Can somebody tell me how to do it?

I'm not sure for RHEL, but you should look around /etc/syslog.conf,
man syslog.conf and /var/log/debug

> Do you have any advice on how to debug this? I have not seen anything
> particuliar about IRQs and there is no message in syslog when stopping ups.
>
> The fact is that I don't know at all what is blocking my network card and
> even less why... :(

Charles has a point in his later mail: what if NUT is not running?

cheers,
Arnaud
-- 
Linux / Unix Expert R&D - Eaton - http://www.eaton.com/mgeops
Network UPS Tools (NUT) Project Leader - http://www.networkupstools.org/
Debian Developer - http://www.debian.org
Free Software Developer - http://arnaud.quette.free.fr/

___
Nut-upsuser mailing list
Nut-upsuser@lists.alioth.debian.org
http://lists.alioth.debian.org/mailman/listinfo/nut-upsuser

Re: [Nut-upsuser] stopping ups services resets my network

2009-10-27 Thread Charles Lepple
Also, is this the same UPS that had EMI issues in the past? What brand  
and model is it?


On Oct 27, 2009, at 6:36 AM, "Antoine Gatineau" > wrote:



Hello,

I'm back again on this issue.
I have contacted Red Hat support but they won't do anything to help  
since it
is not an official Red Hat package. Unless I prove the conflict come  
from

RH.

So I am trying to figure out what is going on.

Adding -D will not add any more log at the services stop.

When I look at the sources, I couldn't find the piece of code that  
release
properly the usb UPS device. There is just a normal SIGTERM  
handling. Is

this normal?

I also noticed that there are debug logs but I don't know how to  
activate

them... Can somebody tell me how to do it?

Do you have any advice on how to debug this? I have not seen anything
particuliar about IRQs and there is no message in syslog when  
stopping ups.


The fact is that I don't know at all what is blocking my network  
card and

even less why... :(

Thanks,
Antoine



-Message d'origine-
De : Charles Lepple [mailto:clep...@gmail.com]
Envoyé : vendredi 2 octobre 2009 01:05
À : Antoine Gatineau
Cc : nut-upsuser@lists.alioth.debian.org
Objet : Re: [Nut-upsuser] stopping ups services resets my network

On Mon, Sep 28, 2009 at 6:40 AM, Antoine Gatineau
 wrote:

Hello,


I have this issue: ups is working fine for a time but when I stop the
services or if I unplug the ups device (over usb) I get the following
in /var/log/messages

I am running nut-2.4.1 on RHEL4.

Sep 28 14:44:57 mabolas upsd[24151]: Signal 15: exiting Sep 28
14:45:06 mabolas upsmon[24156]: Signal 15: exiting Sep 28 15:04:10
mabolas sshd(pam_unix)[29190]: session closed for user root Sep 28
15:04:36 mabolas kernel: NETDEV WATCHDOG: eth0: transmit timed out  
Sep

28 15:04:36 mabolas kernel: tg3: eth0: transmit timed out, resetting
Sep 28 15:04:36 mabolas kernel: tg3: DEBUG: MAC_TX_STATUS[0008]
MAC_RX_STATUS[] Sep 28 15:04:36 mabolas kernel: tg3: DEBUG:
RDMAC_STATUS[] WDMAC_STATUS[] Sep 28 15:04:36 mabolas
kernel: tg3: eth0: Link is down.
Sep 28 15:04:38 mabolas kernel: tg3: eth0: Link is up at 1000 Mbps,
full duplex.
Sep 28 15:04:38 mabolas kernel: tg3: eth0: Flow control is off for TX
and off for RX.


This sounds like a bug in the tg3 driver in that kernel version.  
Have you

checked the RedHat bugzilla to see if anyone else sees this?

Depending on how the motherboard is laid out, moving the UPS to a  
different

USB port might avoid this problem. It could have something to do with
interrupt handling (for example, the tg3 driver expects an interrupt  
that is

stolen by the USB controller).

--
- Charles Lepple


___
Nut-upsuser mailing list
Nut-upsuser@lists.alioth.debian.org
http://lists.alioth.debian.org/mailman/listinfo/nut-upsuser


___
Nut-upsuser mailing list
Nut-upsuser@lists.alioth.debian.org
http://lists.alioth.debian.org/mailman/listinfo/nut-upsuser

Re: [Nut-upsuser] stopping ups services resets my network

2009-10-27 Thread Charles Lepple
You said this also happens when you unplug the USB cable for the UPS-  
what if NUT is not running when you unplug USB?


On Oct 27, 2009, at 6:36 AM, "Antoine Gatineau" > wrote:



Hello,

I'm back again on this issue.
I have contacted Red Hat support but they won't do anything to help  
since it
is not an official Red Hat package. Unless I prove the conflict come  
from

RH.

So I am trying to figure out what is going on.

Adding -D will not add any more log at the services stop.

When I look at the sources, I couldn't find the piece of code that  
release
properly the usb UPS device. There is just a normal SIGTERM  
handling. Is

this normal?

I also noticed that there are debug logs but I don't know how to  
activate

them... Can somebody tell me how to do it?

Do you have any advice on how to debug this? I have not seen anything
particuliar about IRQs and there is no message in syslog when  
stopping ups.


The fact is that I don't know at all what is blocking my network  
card and

even less why... :(

Thanks,
Antoine



-Message d'origine-
De : Charles Lepple [mailto:clep...@gmail.com]
Envoyé : vendredi 2 octobre 2009 01:05
À : Antoine Gatineau
Cc : nut-upsuser@lists.alioth.debian.org
Objet : Re: [Nut-upsuser] stopping ups services resets my network

On Mon, Sep 28, 2009 at 6:40 AM, Antoine Gatineau
 wrote:

Hello,


I have this issue: ups is working fine for a time but when I stop the
services or if I unplug the ups device (over usb) I get the following
in /var/log/messages

I am running nut-2.4.1 on RHEL4.

Sep 28 14:44:57 mabolas upsd[24151]: Signal 15: exiting Sep 28
14:45:06 mabolas upsmon[24156]: Signal 15: exiting Sep 28 15:04:10
mabolas sshd(pam_unix)[29190]: session closed for user root Sep 28
15:04:36 mabolas kernel: NETDEV WATCHDOG: eth0: transmit timed out  
Sep

28 15:04:36 mabolas kernel: tg3: eth0: transmit timed out, resetting
Sep 28 15:04:36 mabolas kernel: tg3: DEBUG: MAC_TX_STATUS[0008]
MAC_RX_STATUS[] Sep 28 15:04:36 mabolas kernel: tg3: DEBUG:
RDMAC_STATUS[] WDMAC_STATUS[] Sep 28 15:04:36 mabolas
kernel: tg3: eth0: Link is down.
Sep 28 15:04:38 mabolas kernel: tg3: eth0: Link is up at 1000 Mbps,
full duplex.
Sep 28 15:04:38 mabolas kernel: tg3: eth0: Flow control is off for TX
and off for RX.


This sounds like a bug in the tg3 driver in that kernel version.  
Have you

checked the RedHat bugzilla to see if anyone else sees this?

Depending on how the motherboard is laid out, moving the UPS to a  
different

USB port might avoid this problem. It could have something to do with
interrupt handling (for example, the tg3 driver expects an interrupt  
that is

stolen by the USB controller).

--
- Charles Lepple


___
Nut-upsuser mailing list
Nut-upsuser@lists.alioth.debian.org
http://lists.alioth.debian.org/mailman/listinfo/nut-upsuser


___
Nut-upsuser mailing list
Nut-upsuser@lists.alioth.debian.org
http://lists.alioth.debian.org/mailman/listinfo/nut-upsuser

Re: [Nut-upsuser] stopping ups services resets my network

2009-10-27 Thread Antoine Gatineau
 Hello,

I'm back again on this issue.
I have contacted Red Hat support but they won't do anything to help since it
is not an official Red Hat package. Unless I prove the conflict come from
RH.

So I am trying to figure out what is going on.

Adding -D will not add any more log at the services stop.

When I look at the sources, I couldn't find the piece of code that release
properly the usb UPS device. There is just a normal SIGTERM handling. Is
this normal?

I also noticed that there are debug logs but I don't know how to activate
them... Can somebody tell me how to do it?

Do you have any advice on how to debug this? I have not seen anything
particuliar about IRQs and there is no message in syslog when stopping ups.

The fact is that I don't know at all what is blocking my network card and
even less why... :(

Thanks,
Antoine



-Message d'origine-
De : Charles Lepple [mailto:clep...@gmail.com] 
Envoyé : vendredi 2 octobre 2009 01:05
À : Antoine Gatineau
Cc : nut-upsuser@lists.alioth.debian.org
Objet : Re: [Nut-upsuser] stopping ups services resets my network

On Mon, Sep 28, 2009 at 6:40 AM, Antoine Gatineau
 wrote:
> Hello,
>
>
> I have this issue: ups is working fine for a time but when I stop the 
> services or if I unplug the ups device (over usb) I get the following 
> in /var/log/messages
>
> I am running nut-2.4.1 on RHEL4.
>
> Sep 28 14:44:57 mabolas upsd[24151]: Signal 15: exiting Sep 28 
> 14:45:06 mabolas upsmon[24156]: Signal 15: exiting Sep 28 15:04:10 
> mabolas sshd(pam_unix)[29190]: session closed for user root Sep 28 
> 15:04:36 mabolas kernel: NETDEV WATCHDOG: eth0: transmit timed out Sep 
> 28 15:04:36 mabolas kernel: tg3: eth0: transmit timed out, resetting 
> Sep 28 15:04:36 mabolas kernel: tg3: DEBUG: MAC_TX_STATUS[0008] 
> MAC_RX_STATUS[] Sep 28 15:04:36 mabolas kernel: tg3: DEBUG: 
> RDMAC_STATUS[] WDMAC_STATUS[] Sep 28 15:04:36 mabolas 
> kernel: tg3: eth0: Link is down.
> Sep 28 15:04:38 mabolas kernel: tg3: eth0: Link is up at 1000 Mbps, 
> full duplex.
> Sep 28 15:04:38 mabolas kernel: tg3: eth0: Flow control is off for TX 
> and off for RX.

This sounds like a bug in the tg3 driver in that kernel version. Have you
checked the RedHat bugzilla to see if anyone else sees this?

Depending on how the motherboard is laid out, moving the UPS to a different
USB port might avoid this problem. It could have something to do with
interrupt handling (for example, the tg3 driver expects an interrupt that is
stolen by the USB controller).

--
- Charles Lepple


___
Nut-upsuser mailing list
Nut-upsuser@lists.alioth.debian.org
http://lists.alioth.debian.org/mailman/listinfo/nut-upsuser


Re: [Nut-upsuser] stopping ups services resets my network

2009-10-01 Thread Charles Lepple
On Mon, Sep 28, 2009 at 6:40 AM, Antoine Gatineau
 wrote:
> Hello,
>
>
> I have this issue: ups is working fine for a time but when I stop the
> services or if I unplug the ups device (over usb) I get the following in
> /var/log/messages
>
> I am running nut-2.4.1 on RHEL4.
>
> Sep 28 14:44:57 mabolas upsd[24151]: Signal 15: exiting
> Sep 28 14:45:06 mabolas upsmon[24156]: Signal 15: exiting
> Sep 28 15:04:10 mabolas sshd(pam_unix)[29190]: session closed for user root
> Sep 28 15:04:36 mabolas kernel: NETDEV WATCHDOG: eth0: transmit timed out
> Sep 28 15:04:36 mabolas kernel: tg3: eth0: transmit timed out, resetting
> Sep 28 15:04:36 mabolas kernel: tg3: DEBUG: MAC_TX_STATUS[0008]
> MAC_RX_STATUS[]
> Sep 28 15:04:36 mabolas kernel: tg3: DEBUG: RDMAC_STATUS[]
> WDMAC_STATUS[]
> Sep 28 15:04:36 mabolas kernel: tg3: eth0: Link is down.
> Sep 28 15:04:38 mabolas kernel: tg3: eth0: Link is up at 1000 Mbps, full
> duplex.
> Sep 28 15:04:38 mabolas kernel: tg3: eth0: Flow control is off for TX and
> off for RX.

This sounds like a bug in the tg3 driver in that kernel version. Have
you checked the RedHat bugzilla to see if anyone else sees this?

Depending on how the motherboard is laid out, moving the UPS to a
different USB port might avoid this problem. It could have something
to do with interrupt handling (for example, the tg3 driver expects an
interrupt that is stolen by the USB controller).

-- 
- Charles Lepple

___
Nut-upsuser mailing list
Nut-upsuser@lists.alioth.debian.org
http://lists.alioth.debian.org/mailman/listinfo/nut-upsuser


Re: [Nut-upsuser] stopping ups services resets my network

2009-10-01 Thread Antoine Gatineau
 Hi,

So anybody heard about this issue?
I've been trying to troubleshoot but nothing wierd in the debug output or in
syslog...

Antoine

-Message d'origine-
De : Antoine Gatineau [mailto:antoine.gatin...@alcatel-lucent.com] 
Envoyé : lundi 28 septembre 2009 12:40
À : nut-upsuser@lists.alioth.debian.org
Cc : antoine.gatin...@alcatel-lucent.com
Objet : [Nut-upsuser] stopping ups services resets my network

Hello,


I have this issue: ups is working fine for a time but when I stop the
services or if I unplug the ups device (over usb) I get the following in
/var/log/messages

I am running nut-2.4.1 on RHEL4.

Sep 28 14:44:57 mabolas upsd[24151]: Signal 15: exiting 
Sep 28 14:45:06 mabolas upsmon[24156]: Signal 15: exiting 
Sep 28 15:04:10 mabolas sshd(pam_unix)[29190]: session closed for user root 
Sep 28 15:04:36 mabolas kernel: NETDEV WATCHDOG: eth0: transmit timed out 
Sep 28 15:04:36 mabolas kernel: tg3: eth0: transmit timed out, resetting 
Sep 28 15:04:36 mabolas kernel: tg3: DEBUG: MAC_TX_STATUS[0008]
MAC_RX_STATUS[] 
Sep 28 15:04:36 mabolas kernel: tg3: DEBUG: RDMAC_STATUS[]
WDMAC_STATUS[] 
Sep 28 15:04:36 mabolas kernel: tg3: eth0: Link is down.
Sep 28 15:04:38 mabolas kernel: tg3: eth0: Link is up at 1000 Mbps, full
duplex.
Sep 28 15:04:38 mabolas kernel: tg3: eth0: Flow control is off for TX and
off for RX. 

As soon I kill usbhid-ups, I lost my ssh connexion. Logs say ssh is lost
after 15m but in fact I couldn't type anything just after the ups stop. The
network is resetted 15 minutes later. 

It appears this error occures only when nut has run for some days (4 days in
this case). When stopping ups after few hours, nothing happens.

I also have noticed some wierd behaviours on other applications that might
be related. I'm not so sure it is the same so I will make a seperate thread
when I have more debug info on that.

Can you help me fix this?

Regards,
Antoine


___
Nut-upsuser mailing list
Nut-upsuser@lists.alioth.debian.org
http://lists.alioth.debian.org/mailman/listinfo/nut-upsuser


[Nut-upsuser] stopping ups services resets my network

2009-09-28 Thread Antoine Gatineau
Hello,


I have this issue: ups is working fine for a time but when I stop the
services or if I unplug the ups device (over usb) I get the following in
/var/log/messages

I am running nut-2.4.1 on RHEL4.

Sep 28 14:44:57 mabolas upsd[24151]: Signal 15: exiting
Sep 28 14:45:06 mabolas upsmon[24156]: Signal 15: exiting
Sep 28 15:04:10 mabolas sshd(pam_unix)[29190]: session closed for user root
Sep 28 15:04:36 mabolas kernel: NETDEV WATCHDOG: eth0: transmit timed out
Sep 28 15:04:36 mabolas kernel: tg3: eth0: transmit timed out, resetting
Sep 28 15:04:36 mabolas kernel: tg3: DEBUG: MAC_TX_STATUS[0008]
MAC_RX_STATUS[]
Sep 28 15:04:36 mabolas kernel: tg3: DEBUG: RDMAC_STATUS[]
WDMAC_STATUS[]
Sep 28 15:04:36 mabolas kernel: tg3: eth0: Link is down.
Sep 28 15:04:38 mabolas kernel: tg3: eth0: Link is up at 1000 Mbps, full
duplex.
Sep 28 15:04:38 mabolas kernel: tg3: eth0: Flow control is off for TX and
off for RX. 

As soon I kill usbhid-ups, I lost my ssh connexion. Logs say ssh is lost
after 15m but in fact I couldn't type anything just after the ups stop. The
network is resetted 15 minutes later. 

It appears this error occures only when nut has run for some days (4 days in
this case). When stopping ups after few hours, nothing happens.

I also have noticed some wierd behaviours on other applications that might
be related. I'm not so sure it is the same so I will make a seperate thread
when I have more debug info on that.

Can you help me fix this?

Regards,
Antoine


___
Nut-upsuser mailing list
Nut-upsuser@lists.alioth.debian.org
http://lists.alioth.debian.org/mailman/listinfo/nut-upsuser