Re: started getting repeated "bge0: PHY read timed out" messages

2007-03-15 Thread Chad Leigh -- Shire.Net LLC


On Mar 15, 2007, at 12:48 AM, Michael K. Smith - Adhost wrote:



Have you considered hard-setting the speed/duplex to 1000/Full instead
of 100/Full?  There may be some issues in the autonegotiation  
happening

between switch and server.  We used to see some of this early on in
inter-vendor GigE connections; perhaps the switch vendor and the  
FreeBSD

devels are reading the standards differently.


I thought of that.  However, I thought that if that were the case,  
the problem would happen at boot and not start at some random time  
after boot and work for a while first.  Is that not a valid thought?


Chad



Mike


---
Chad Leigh -- Shire.Net LLC
Your Web App and Email hosting provider
chad at shire.net



___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


RE: started getting repeated "bge0: PHY read timed out" messages

2007-03-14 Thread Michael K. Smith - Adhost
Hello Chad:

-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Chad Leigh --
Shire.Net LLC
Sent: Wednesday, March 14, 2007 11:19 PM
To: User Questions
Subject: Re: started getting repeated "bge0: PHY read timed out"
messages


On Mar 6, 2007, at 9:55 PM, Chad Leigh -- Shire.Net LLC wrote:

>
> On Mar 6, 2007, at 5:08 PM, Michael K. Smith - Adhost wrote:
>
>>
>> Have you looked at the output of 'netstat -i' to see if there are
>> interface errors?  Also, have you looked at the switch-side interface
>> for errors, buffer problems, etc. (if that's possible)?  Finally,  
>> have
>> you swapped ports/cables on the switch?
>>
>> Regards,
>>
>> Mike
>
>
> Hi Mike
>
> Right now it seems to be behaving.  netstat -i shows no issues and   
> I have not had the problem recur again once I forced it from auto  
> negotiation which ended up at 1000tx full duplex to now be 100tx at  
> full duplex.  I don't know if it is related to the problem or not  
> or is a coincidence that once I did that the problem has not  
> arisen.  It happened about 3 times today before forcing the  
> connection to be 100tx.

OK, this problem has not resurfaced since I hardwired the settings on  
the port to 100tx and full duplex.

I have another system with the same board that got the same problem  
last Fall when I upgraded from 5.4 to 6.0 or 6.1.  The problem this  
time was on 6.2.  I have another one of these boards running on a  
system with 5.4 without issue at gigabit speed.  It appears that  
something happened in the bge driver from 5.x to 6.x at gigabit  
speeds, at least with some of the supported chipsets.

Chad



Have you considered hard-setting the speed/duplex to 1000/Full instead
of 100/Full?  There may be some issues in the autonegotiation happening
between switch and server.  We used to see some of this early on in
inter-vendor GigE connections; perhaps the switch vendor and the FreeBSD
devels are reading the standards differently.

Mike
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: started getting repeated "bge0: PHY read timed out" messages

2007-03-14 Thread Chad Leigh -- Shire.Net LLC


On Mar 6, 2007, at 9:55 PM, Chad Leigh -- Shire.Net LLC wrote:



On Mar 6, 2007, at 5:08 PM, Michael K. Smith - Adhost wrote:



Have you looked at the output of 'netstat -i' to see if there are
interface errors?  Also, have you looked at the switch-side interface
for errors, buffer problems, etc. (if that's possible)?  Finally,  
have

you swapped ports/cables on the switch?

Regards,

Mike



Hi Mike

Right now it seems to be behaving.  netstat -i shows no issues and   
I have not had the problem recur again once I forced it from auto  
negotiation which ended up at 1000tx full duplex to now be 100tx at  
full duplex.  I don't know if it is related to the problem or not  
or is a coincidence that once I did that the problem has not  
arisen.  It happened about 3 times today before forcing the  
connection to be 100tx.


OK, this problem has not resurfaced since I hardwired the settings on  
the port to 100tx and full duplex.


I have another system with the same board that got the same problem  
last Fall when I upgraded from 5.4 to 6.0 or 6.1.  The problem this  
time was on 6.2.  I have another one of these boards running on a  
system with 5.4 without issue at gigabit speed.  It appears that  
something happened in the bge driver from 5.x to 6.x at gigabit  
speeds, at least with some of the supported chipsets.


Chad

---
Chad Leigh -- Shire.Net LLC
Your Web App and Email hosting provider
chad at shire.net



___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: started getting repeated "bge0: PHY read timed out" messages

2007-03-06 Thread Chad Leigh -- Shire.Net LLC


On Mar 6, 2007, at 5:08 PM, Michael K. Smith - Adhost wrote:



Have you looked at the output of 'netstat -i' to see if there are
interface errors?  Also, have you looked at the switch-side interface
for errors, buffer problems, etc. (if that's possible)?  Finally, have
you swapped ports/cables on the switch?

Regards,

Mike



Hi Mike

Right now it seems to be behaving.  netstat -i shows no issues and  I  
have not had the problem recur again once I forced it from auto  
negotiation which ended up at 1000tx full duplex to now be 100tx at  
full duplex.  I don't know if it is related to the problem or not or  
is a coincidence that once I did that the problem has not arisen.  It  
happened about 3 times today before forcing the connection to be 100tx.


If it starts to happen again I will try and log in to the console and  
do the netstat -i then...


In the morning I will break out my powerbook and get the switch port  
assignments off a chart I have there so I can log in to the switch  
and check the port for errors.


I have not swapped ports/cables on the switch this time around but  
have had the exact same symptoms on a server with the same HW that  
was using different cables and a different switch...  I may swap  
cables but the machine is downtown and I am not so unless I cannot  
make progress I want to avoid going downtown :-)


I may swap the mainboard if it happens again to see if it is HW.   I  
have one or two spare mainboards of the same sort.


Thanks!
Chad


---
Chad Leigh -- Shire.Net LLC
Your Web App and Email hosting provider
chad at shire.net



___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


RE: started getting repeated "bge0: PHY read timed out" messages

2007-03-06 Thread Michael K. Smith - Adhost
Hello:

> -Original Message-
> From: [EMAIL PROTECTED] [mailto:owner-freebsd-
> [EMAIL PROTECTED] On Behalf Of Chad Leigh -- Shire.Net LLC
> Sent: Tuesday, March 06, 2007 12:05 PM
> To: User Questions
> Subject: Re: started getting repeated "bge0: PHY read timed out"
> messages
> 
> 
> On Mar 6, 2007, at 9:20 AM, Chad Leigh -- Shire.Net LLC wrote:
> 
> > Hi
> >
> > After running fine for a while, my new server running 6.2-RELEASE
> > with latest security patches as of last Thursday or Friday started
> > giving the message
> >
> > bge0: PHY read timed out
> >
> > and I found the following in the system log
> 
> ok, it started happening again after about 1.5 hours after the last
> reboot.  bge0 started going down and up a few times over about an
> hour, and then the read timedout messages started up again.  Previous
> to the very first time that this started, the server had run for
> about 4 days since it was newly installed.
> 
> Mar  6 09:09:23 server su: chad to root on /dev/ttyp0
> Mar  6 10:26:29 server kernel: bge0: link state changed to DOWN
> Mar  6 10:26:31 server kernel: bge0: link state changed to UP
> Mar  6 10:42:33 server kernel: bge0: link state changed to DOWN
> Mar  6 10:42:35 server kernel: bge0: link state changed to UP
> Mar  6 11:31:19 server kernel: bge0: PHY read timed out
> Mar  6 11:31:19 server last message repeated 3 times
> Mar  6 11:31:19 server kernel: bge0: link state changed to DOWN
> Mar  6 11:31:21 server kernel: bge0: PHY read timed out
> Mar  6 11:31:52 server last message repeated 116 times
> Mar  6 11:33:53 server last message repeated 488 times
> Mar  6 11:43:54 server last message repeated 2356 times
> Mar  6 11:53:56 server last message repeated 2372 times
> Mar  6 12:03:57 server last message repeated 2368 times
> Mar  6 12:09:53 server last message repeated 1399 times
> Mar  6 12:09:53 server kernel: bge0: watchdog timeout -- resetting
> Mar  6 12:09:53 server kernel: bge0: PHY read timed out
> Mar  6 12:09:53 server last message repeated 4 times
> Mar  6 12:09:53 server kernel: bge0: RX CPU self-diagnostics failed!
> Mar  6 12:09:53 server kernel: bge0: flow-through queue init failed
> Mar  6 12:09:53 server kernel: bge0: initialization failure
> Mar  6 12:09:54 server kernel: bge0: PHY read timed out
> Mar  6 12:10:25 server last message repeated 152 times
> Mar  6 12:12:27 server last message repeated 616 times
> Mar  6 12:22:29 server last message repeated 2540 times
> Mar  6 12:32:30 server last message repeated 2452 times
> Mar  6 12:42:31 server last message repeated 2524 times
> Mar  6 12:46:27 server last message repeated 1127 times
> Mar  6 12:46:27 server login: ROOT LOGIN (root) ON ttyv0
> Mar  6 12:46:29 server kernel: bge0: PHY read timed out
> Mar  6 12:46:41 server last message repeated 107 times
> Mar  6 12:46:40 server reboot: rebooted by root
> 
> here is an ifconfig
> 
> bge0: flags=8843 mtu 1500
>  options=1b
>  inet 166.70.252.128 netmask 0xff00 broadcast
> 166.70.252.255
>  inet 166.70.252.120 netmask 0x broadcast
> 166.70.252.120
>  inet 166.70.252.199 netmask 0x broadcast
> 166.70.252.199
>  ether 00:e0:81:61:e9:a0
>  media: Ethernet autoselect (1000baseTX )
>  status: active
> 
> and uname
> 
> # uname -a
> FreeBSD server.shire.net 6.2-RELEASE-p2 FreeBSD 6.2-RELEASE-p2 #1:
> Sat Mar  3 13:11:00 UTC 2007 [EMAIL PROTECTED]:/usr/obj/usr/
> src/sys/server  i386
> #
> 
> It is a TYAN S2850 single opteron system with 2.4ghz single core
> opteron.
> 
> Its dmesg ID is seen below in the quoted section.
> 
> I had another machine with this same MB that ran for a long time fine
> until I upgraded it to 6.0 or 6.1 last Fall and then I started to
> have the same problem (a post about it is in the archives).  I
> assumed it was a HW issue and turned off the port in the BIOS and
> used the other port until I took the machine offline as the customer
> using it no longer needed it.
> 
> Now this machine is having the same symptoms and I remember reading
> in the lists something about PHY and bge and some driver problems a
> while back but cannot fnd it now in the archives.
> 
> Could this be a SW problem or is it a HW issue?  Could it be related
> to the port it is connected to or the cable or something?  The other
> machine that had this problem was on a different switch brand.
> 
> Thanks
> Chad
> 
> >
> >
> > This appears to be a HW problem at first look.  But when the server
> > boots, it works fine for a while (hours, days??)
> >
> > Here is the id in the boot message
> >
>

Re: started getting repeated "bge0: PHY read timed out" messages

2007-03-06 Thread Chad Leigh -- Shire.Net LLC


On Mar 6, 2007, at 9:20 AM, Chad Leigh -- Shire.Net LLC wrote:


Hi

After running fine for a while, my new server running 6.2-RELEASE  
with latest security patches as of last Thursday or Friday started  
giving the message


bge0: PHY read timed out

and I found the following in the system log


ok, it started happening again after about 1.5 hours after the last  
reboot.  bge0 started going down and up a few times over about an  
hour, and then the read timedout messages started up again.  Previous  
to the very first time that this started, the server had run for  
about 4 days since it was newly installed.


Mar  6 09:09:23 server su: chad to root on /dev/ttyp0
Mar  6 10:26:29 server kernel: bge0: link state changed to DOWN
Mar  6 10:26:31 server kernel: bge0: link state changed to UP
Mar  6 10:42:33 server kernel: bge0: link state changed to DOWN
Mar  6 10:42:35 server kernel: bge0: link state changed to UP
Mar  6 11:31:19 server kernel: bge0: PHY read timed out
Mar  6 11:31:19 server last message repeated 3 times
Mar  6 11:31:19 server kernel: bge0: link state changed to DOWN
Mar  6 11:31:21 server kernel: bge0: PHY read timed out
Mar  6 11:31:52 server last message repeated 116 times
Mar  6 11:33:53 server last message repeated 488 times
Mar  6 11:43:54 server last message repeated 2356 times
Mar  6 11:53:56 server last message repeated 2372 times
Mar  6 12:03:57 server last message repeated 2368 times
Mar  6 12:09:53 server last message repeated 1399 times
Mar  6 12:09:53 server kernel: bge0: watchdog timeout -- resetting
Mar  6 12:09:53 server kernel: bge0: PHY read timed out
Mar  6 12:09:53 server last message repeated 4 times
Mar  6 12:09:53 server kernel: bge0: RX CPU self-diagnostics failed!
Mar  6 12:09:53 server kernel: bge0: flow-through queue init failed
Mar  6 12:09:53 server kernel: bge0: initialization failure
Mar  6 12:09:54 server kernel: bge0: PHY read timed out
Mar  6 12:10:25 server last message repeated 152 times
Mar  6 12:12:27 server last message repeated 616 times
Mar  6 12:22:29 server last message repeated 2540 times
Mar  6 12:32:30 server last message repeated 2452 times
Mar  6 12:42:31 server last message repeated 2524 times
Mar  6 12:46:27 server last message repeated 1127 times
Mar  6 12:46:27 server login: ROOT LOGIN (root) ON ttyv0
Mar  6 12:46:29 server kernel: bge0: PHY read timed out
Mar  6 12:46:41 server last message repeated 107 times
Mar  6 12:46:40 server reboot: rebooted by root

here is an ifconfig

bge0: flags=8843 mtu 1500
options=1b
inet 166.70.252.128 netmask 0xff00 broadcast 166.70.252.255
inet 166.70.252.120 netmask 0x broadcast 166.70.252.120
inet 166.70.252.199 netmask 0x broadcast 166.70.252.199
ether 00:e0:81:61:e9:a0
media: Ethernet autoselect (1000baseTX )
status: active

and uname

# uname -a
FreeBSD server.shire.net 6.2-RELEASE-p2 FreeBSD 6.2-RELEASE-p2 #1:  
Sat Mar  3 13:11:00 UTC 2007 [EMAIL PROTECTED]:/usr/obj/usr/ 
src/sys/server  i386

#

It is a TYAN S2850 single opteron system with 2.4ghz single core  
opteron.


Its dmesg ID is seen below in the quoted section.

I had another machine with this same MB that ran for a long time fine  
until I upgraded it to 6.0 or 6.1 last Fall and then I started to  
have the same problem (a post about it is in the archives).  I  
assumed it was a HW issue and turned off the port in the BIOS and  
used the other port until I took the machine offline as the customer  
using it no longer needed it.


Now this machine is having the same symptoms and I remember reading  
in the lists something about PHY and bge and some driver problems a  
while back but cannot fnd it now in the archives.


Could this be a SW problem or is it a HW issue?  Could it be related  
to the port it is connected to or the cable or something?  The other  
machine that had this problem was on a different switch brand.


Thanks
Chad




This appears to be a HW problem at first look.  But when the server  
boots, it works fine for a while (hours, days??)


Here is the id in the boot message

Mar  6 09:01:21 server kernel: bge0: rev. 0x3003> mem 0xfeab-0xfeab irq 16 at device 14.0 on pci1

Mar  6 09:01:21 server kernel: miibus0:  on bge0
Mar  6 09:01:21 server kernel: brgphy0: PHY> on miibus0
Mar  6 09:01:21 server kernel: brgphy0:  10baseT, 10baseT-FDX,  
100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto
Mar  6 09:01:21 server kernel: bge0: Ethernet address:  
00:e0:81:61:e9:a0


Is this some sort of SW driver issue or is it a HW issue at first  
glance?  I remember kind of reading about some BGE issues a while  
back.


Thanks
Chad



---
Chad Leigh -- Shire.Net LLC
Your Web App and Email hosting provider
chad at shire.net



___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: started getting repeated "bge0: PHY read timed out" messages

2007-03-06 Thread Chad Leigh -- Shire.Net LLC


On Mar 6, 2007, at 9:20 AM, Chad Leigh -- Shire.Net LLC wrote:


Hi

After running fine for a while, my new server running 6.2-RELEASE  
with latest security patches as of last Thursday or Friday started  
giving the message


Probably should have said  i386 version

# uname -a
FreeBSD server.shire.net 6.2-RELEASE-p2 FreeBSD 6.2-RELEASE-p2 #1:  
Sat Mar  3 13:11:00 UTC 2007 [EMAIL PROTECTED]:/usr/obj/usr/ 
src/sys/server  i386

#

Tyan S2850 single opteron 2gb ecc

new install from ISO then cvsup'ed to latest

Chad



bge0: PHY read timed out

and I found the following in the system log

Mar  6 02:59:33 server kernel: bge0: link state changed to DOWN
Mar  6 02:59:36 server kernel: bge0: link state changed to UP
Mar  6 03:00:27 server kernel: bge0: link state changed to DOWN
Mar  6 03:00:29 server kernel: bge0: link state changed to UP
Mar  6 05:09:27 server kernel: bge0: PHY read timed out
Mar  6 05:09:27 server last message repeated 3 times
Mar  6 05:09:27 server kernel: bge0: link state changed to DOWN
Mar  6 05:09:29 server kernel: bge0: PHY read timed out
Mar  6 05:09:50 server last message repeated 79 times
Mar  6 05:09:50 server kernel: bge0: watchdog timeout -- resetting
Mar  6 05:09:50 server kernel: bge0: PHY read timed out
Mar  6 05:09:50 server last message repeated 4 times
Mar  6 05:09:50 server kernel: bge0: RX CPU self-diagnostics failed!
Mar  6 05:09:50 server kernel: bge0: flow-through queue init failed
Mar  6 05:09:50 server kernel: bge0: initialization failure
Mar  6 05:09:52 server kernel: bge0: PHY read timed out
Mar  6 05:10:23 server last message repeated 140 times
Mar  6 05:12:24 server last message repeated 616 times
Mar  6 05:22:26 server last message repeated 2580 times
Mar  6 05:32:28 server last message repeated 2460 times
Mar  6 05:42:30 server last message repeated 2540 times
Mar  6 05:52:32 server last message repeated 2572 times
Mar  6 06:02:33 server last message repeated 2652 times
Mar  6 06:12:34 server last message repeated 2532 times
Mar  6 06:22:35 server last message repeated 2516 times
Mar  6 06:32:37 server last message repeated 2452 times
Mar  6 06:42:38 server last message repeated 2520 times
Mar  6 06:52:39 server last message repeated 2544 times
Mar  6 07:02:39 server last message repeated 2919 times
Mar  6 07:12:41 server last message repeated 2521 times
Mar  6 07:22:43 server last message repeated 2524 times
Mar  6 07:32:44 server last message repeated 2440 times
Mar  6 07:42:45 server last message repeated 2524 times
Mar  6 07:52:48 server last message repeated 2528 times
Mar  6 08:02:50 server last message repeated 2656 times
Mar  6 08:12:51 server last message repeated 2552 times
Mar  6 08:22:53 server last message repeated 2528 times
Mar  6 08:32:55 server last message repeated 2436 times
Mar  6 08:42:57 server last message repeated 2536 times
Mar  6 08:52:58 server last message repeated 2544 times
Mar  6 08:58:55 server last message repeated 1631 times

This appears to be a HW problem at first look.  But when the server  
boots, it works fine for a while (hours, days??)


Here is the id in the boot message

Mar  6 09:01:21 server kernel: bge0: rev. 0x3003> mem 0xfeab-0xfeab irq 16 at device 14.0 on pci1

Mar  6 09:01:21 server kernel: miibus0:  on bge0
Mar  6 09:01:21 server kernel: brgphy0: PHY> on miibus0
Mar  6 09:01:21 server kernel: brgphy0:  10baseT, 10baseT-FDX,  
100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto
Mar  6 09:01:21 server kernel: bge0: Ethernet address:  
00:e0:81:61:e9:a0


Is this some sort of SW driver issue or is it a HW issue at first  
glance?  I remember kind of reading about some BGE issues a while  
back.


Thanks
Chad

---
Chad Leigh -- Shire.Net LLC
Your Web App and Email hosting provider
chad at shire.net



___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "freebsd-questions- 
[EMAIL PROTECTED]"


---
Chad Leigh -- Shire.Net LLC
Your Web App and Email hosting provider
chad at shire.net



___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: started getting repeated "bge0: PHY read timed out" messages

2007-03-06 Thread tequnix

> 
> After running fine for a while, my new server running 6.2-RELEASE  
> with latest security patches as of last Thursday or Friday started  
> giving the message
> 
> bge0: PHY read timed out
> 
> and I found the following in the system log
> 
> Mar  6 02:59:33 server kernel: bge0: link state changed to DOWN
> Mar  6 02:59:36 server kernel: bge0: link state changed to UP
> Mar  6 03:00:27 server kernel: bge0: link state changed to DOWN
> Mar  6 03:00:29 server kernel: bge0: link state changed to UP
> Mar  6 05:09:27 server kernel: bge0: PHY read timed out
> Mar  6 05:09:27 server last message repeated 3 times
> Mar  6 05:09:27 server kernel: bge0: link state changed to DOWN
> Mar  6 05:09:29 server kernel: bge0: PHY read timed out
> Mar  6 05:09:50 server last message repeated 79 times
> Mar  6 05:09:50 server kernel: bge0: watchdog timeout -- resetting
> Mar  6 05:09:50 server kernel: bge0: PHY read timed out
> Mar  6 05:09:50 server last message repeated 4 times
> Mar  6 05:09:50 server kernel: bge0: RX CPU self-diagnostics failed!
> Mar  6 05:09:50 server kernel: bge0: flow-through queue init failed
> Mar  6 05:09:50 server kernel: bge0: initialization failure
> Mar  6 05:09:52 server kernel: bge0: PHY read timed out
> Mar  6 05:10:23 server last message repeated 140 times
> Mar  6 05:12:24 server last message repeated 616 times
> Mar  6 05:22:26 server last message repeated 2580 times
> Mar  6 05:32:28 server last message repeated 2460 times
> Mar  6 05:42:30 server last message repeated 2540 times
> Mar  6 05:52:32 server last message repeated 2572 times
> Mar  6 06:02:33 server last message repeated 2652 times
> Mar  6 06:12:34 server last message repeated 2532 times
> Mar  6 06:22:35 server last message repeated 2516 times
> Mar  6 06:32:37 server last message repeated 2452 times
> Mar  6 06:42:38 server last message repeated 2520 times
> Mar  6 06:52:39 server last message repeated 2544 times
> Mar  6 07:02:39 server last message repeated 2919 times
> Mar  6 07:12:41 server last message repeated 2521 times
> Mar  6 07:22:43 server last message repeated 2524 times
> Mar  6 07:32:44 server last message repeated 2440 times
> Mar  6 07:42:45 server last message repeated 2524 times
> Mar  6 07:52:48 server last message repeated 2528 times
> Mar  6 08:02:50 server last message repeated 2656 times
> Mar  6 08:12:51 server last message repeated 2552 times
> Mar  6 08:22:53 server last message repeated 2528 times
> Mar  6 08:32:55 server last message repeated 2436 times
> Mar  6 08:42:57 server last message repeated 2536 times
> Mar  6 08:52:58 server last message repeated 2544 times
> Mar  6 08:58:55 server last message repeated 1631 times
> 
> This appears to be a HW problem at first look.  But when the server  
> boots, it works fine for a while (hours, days??)
> 
> Here is the id in the boot message
> 
> Mar  6 09:01:21 server kernel: bge0:  0x3003> mem 0xfeab-0xfeab irq 16 at device 14.0 on pci1
> Mar  6 09:01:21 server kernel: miibus0:  on bge0
> Mar  6 09:01:21 server kernel: brgphy0:  PHY> on miibus0
> Mar  6 09:01:21 server kernel: brgphy0:  10baseT, 10baseT-FDX,  
> 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto
> Mar  6 09:01:21 server kernel: bge0: Ethernet address: 00:e0:81:61:e9:a0
> 
> Is this some sort of SW driver issue or is it a HW issue at first  
> glance?  I remember kind of reading about some BGE issues a while back.
> 
> Thanks
> Chad

sorry, no solution here, just a `me too`:

FreeBSD 6.2-RELEASE #0 i386
(in my case, it started right after upgrading from 6.1-RELEASE to
6.2-RELEASE)

[..]
Mar  4 20:01:39 lyekka kernel: sk0: link state changed to DOWN
Mar  4 20:01:45 lyekka kernel: sk0: link state changed to UP
Mar  4 20:02:18 lyekka kernel: sk0: link state changed to DOWN
Mar  4 20:02:20 lyekka kernel: sk0: link state changed to UP
Mar  4 20:02:35 lyekka kernel: sk0: link state changed to DOWN
Mar  4 20:02:39 lyekka kernel: sk0: link state changed to UP
Mar  4 21:50:35 lyekka kernel: sk0: link state changed to DOWN
Mar  4 21:50:38 lyekka kernel: sk0: link state changed to UP
Mar  4 21:50:52 lyekka kernel: sk0: link state changed to DOWN
Mar  4 21:50:54 lyekka kernel: sk0: link state changed to UP
Mar  5 00:45:00 lyekka kernel: sk0: link state changed to DOWN
Mar  5 05:28:56 lyekka kernel: sk0: phy write timeout
[..]

card is:
[..]
skc0:  port 0xa800-0xa8ff mem
0xef004000-0xef007fff irq 12 at device 7.0 on pci0
skc0: SK-9521 10/100/1000Base-T Adapter rev. (0x1)
sk0:  on skc0
[..]

after some time (it differs, normally after a few days) it is not
possible to send/receive data via this interface.
`ifconfig sk0 down ; ifconfig sk0 up` helps

i notice this UP/DOWN flap when i start/shutdown another machine with
direct connection to that one (nve card on

started getting repeated "bge0: PHY read timed out" messages

2007-03-06 Thread Chad Leigh -- Shire.Net LLC

Hi

After running fine for a while, my new server running 6.2-RELEASE  
with latest security patches as of last Thursday or Friday started  
giving the message


bge0: PHY read timed out

and I found the following in the system log

Mar  6 02:59:33 server kernel: bge0: link state changed to DOWN
Mar  6 02:59:36 server kernel: bge0: link state changed to UP
Mar  6 03:00:27 server kernel: bge0: link state changed to DOWN
Mar  6 03:00:29 server kernel: bge0: link state changed to UP
Mar  6 05:09:27 server kernel: bge0: PHY read timed out
Mar  6 05:09:27 server last message repeated 3 times
Mar  6 05:09:27 server kernel: bge0: link state changed to DOWN
Mar  6 05:09:29 server kernel: bge0: PHY read timed out
Mar  6 05:09:50 server last message repeated 79 times
Mar  6 05:09:50 server kernel: bge0: watchdog timeout -- resetting
Mar  6 05:09:50 server kernel: bge0: PHY read timed out
Mar  6 05:09:50 server last message repeated 4 times
Mar  6 05:09:50 server kernel: bge0: RX CPU self-diagnostics failed!
Mar  6 05:09:50 server kernel: bge0: flow-through queue init failed
Mar  6 05:09:50 server kernel: bge0: initialization failure
Mar  6 05:09:52 server kernel: bge0: PHY read timed out
Mar  6 05:10:23 server last message repeated 140 times
Mar  6 05:12:24 server last message repeated 616 times
Mar  6 05:22:26 server last message repeated 2580 times
Mar  6 05:32:28 server last message repeated 2460 times
Mar  6 05:42:30 server last message repeated 2540 times
Mar  6 05:52:32 server last message repeated 2572 times
Mar  6 06:02:33 server last message repeated 2652 times
Mar  6 06:12:34 server last message repeated 2532 times
Mar  6 06:22:35 server last message repeated 2516 times
Mar  6 06:32:37 server last message repeated 2452 times
Mar  6 06:42:38 server last message repeated 2520 times
Mar  6 06:52:39 server last message repeated 2544 times
Mar  6 07:02:39 server last message repeated 2919 times
Mar  6 07:12:41 server last message repeated 2521 times
Mar  6 07:22:43 server last message repeated 2524 times
Mar  6 07:32:44 server last message repeated 2440 times
Mar  6 07:42:45 server last message repeated 2524 times
Mar  6 07:52:48 server last message repeated 2528 times
Mar  6 08:02:50 server last message repeated 2656 times
Mar  6 08:12:51 server last message repeated 2552 times
Mar  6 08:22:53 server last message repeated 2528 times
Mar  6 08:32:55 server last message repeated 2436 times
Mar  6 08:42:57 server last message repeated 2536 times
Mar  6 08:52:58 server last message repeated 2544 times
Mar  6 08:58:55 server last message repeated 1631 times

This appears to be a HW problem at first look.  But when the server  
boots, it works fine for a while (hours, days??)


Here is the id in the boot message

Mar  6 09:01:21 server kernel: bge0: 0x3003> mem 0xfeab-0xfeab irq 16 at device 14.0 on pci1

Mar  6 09:01:21 server kernel: miibus0:  on bge0
Mar  6 09:01:21 server kernel: brgphy0: PHY> on miibus0
Mar  6 09:01:21 server kernel: brgphy0:  10baseT, 10baseT-FDX,  
100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto

Mar  6 09:01:21 server kernel: bge0: Ethernet address: 00:e0:81:61:e9:a0

Is this some sort of SW driver issue or is it a HW issue at first  
glance?  I remember kind of reading about some BGE issues a while back.


Thanks
Chad

---
Chad Leigh -- Shire.Net LLC
Your Web App and Email hosting provider
chad at shire.net



___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: bge0: PHY read timed out

2006-09-09 Thread Chad Leigh -- Shire.Net LLC


On Sep 9, 2006, at 11:52 PM, Chad Leigh -- Shire.Net LLC wrote:



I have a machine that is acting up.  It was doing this a couple  
weeks ago and I replaced the motherboard thinking that maybe the  
physical ethernet port was going bad (connector or chip).  The new  
motherboard is starting to act up the same way after a week or two  
of running fine and I am wondering if something else might be the  
issue.


I forgot to mention that the MB at issue is the Tyan S2850 single  
opteron MB.  This particular server had its MB replaced a week or two  
ago for the same sort of issue on bge1 that got progressively worse  
and I eventually turned it off in the BIOS.  Then a while later it  
started  seeming to freeze up with this same issue as listed below  
and I replaced the MB, which is starting to show the same issue.


I have a few other servers with the same MB and they have never had  
any issues.


Chad



After the uname I list the stuff from the syslog before it goes off  
the net and my auto rebooter power strip thinks it has crashed and  
forces a reboot (which I have now turned off so I can go in and  
look more next time it happens).  This did not happen for the last  
week or two and all of a sudden happened about 5 times today...


# uname -a
FreeBSD whitwell.shire.net 6.1-RELEASE FreeBSD 6.1-RELEASE #1: Fri  
May 19 01:50:39 MDT 2006 [EMAIL PROTECTED]:/usr/obj/usr/src/ 
sys/WHITWELL  i386

#


Sep  9 20:49:02 whitwell kernel: bge0: PHY read timed out
Sep  9 20:49:02 whitwell last message repeated 3 times
Sep  9 20:49:02 whitwell kernel: bge0: link state changed to DOWN
Sep  9 20:49:04 whitwell kernel: bge0: PHY read timed out

What does this message mean?  PHY read timed out?   I did a google  
on it but the issues were various other things.


Could this be a bad cable?

Thanks
Chad


---
Chad Leigh -- Shire.Net LLC
Your Web App and Email hosting provider
chad at shire.net



___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "freebsd-questions- 
[EMAIL PROTECTED]"


---
Chad Leigh -- Shire.Net LLC
Your Web App and Email hosting provider
chad at shire.net



___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


bge0: PHY read timed out

2006-09-09 Thread Chad Leigh -- Shire.Net LLC


I have a machine that is acting up.  It was doing this a couple weeks  
ago and I replaced the motherboard thinking that maybe the physical  
ethernet port was going bad (connector or chip).  The new motherboard  
is starting to act up the same way after a week or two of running  
fine and I am wondering if something else might be the issue.


After the uname I list the stuff from the syslog before it goes off  
the net and my auto rebooter power strip thinks it has crashed and  
forces a reboot (which I have now turned off so I can go in and look  
more next time it happens).  This did not happen for the last week or  
two and all of a sudden happened about 5 times today...


# uname -a
FreeBSD whitwell.shire.net 6.1-RELEASE FreeBSD 6.1-RELEASE #1: Fri  
May 19 01:50:39 MDT 2006 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/ 
WHITWELL  i386

#


Sep  9 20:49:02 whitwell kernel: bge0: PHY read timed out
Sep  9 20:49:02 whitwell last message repeated 3 times
Sep  9 20:49:02 whitwell kernel: bge0: link state changed to DOWN
Sep  9 20:49:04 whitwell kernel: bge0: PHY read timed out

What does this message mean?  PHY read timed out?   I did a google on  
it but the issues were various other things.


Could this be a bad cable?

Thanks
Chad


---
Chad Leigh -- Shire.Net LLC
Your Web App and Email hosting provider
chad at shire.net



___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[EMAIL PROTECTED]"