Re: Troubleshooting a slowdown problem?

KEVIN ZEMBOWER Fri, 18 Jun 2004 08:41:13 -0700

Frank, thanks, again, for your analysis.

When you mentioned the connection speed, I remembered that I had to ask our network 
administrators to change the speed and auto-negotiation properties on the Cisco switch 
that the old centernet host was plugged into to fix the speed at 100baseTx-FD and turn 
off auto-negoiation. I've forgotten to do that for the new centernet host. On the 
admin host, it was okay:


admin:~ # mii-diag
Using the default interface 'eth0'.
Basic registers of MII PHY #1:  2100 780d 02a8 0154 05e1 0000 0000 0000.
 Basic mode control register 0x2100: Auto-negotiation disabled, with
 Speed fixed at 100 mbps, full-duplex.
 You have link beat, and everything is working OK.
 Link partner information is not exchanged when in fixed speed mode.
   End of basic transceiver information.

admin:~ # 
admin:~ # ifconfig eth0
eth0      Link encap:Ethernet  HWaddr 00:90:27:B6:FB:E7  
          inet addr:172.16.2.7  Bcast:172.16.255.255  Mask:255.255.0.0
          inet6 addr: fe80::290:27ff:feb6:fbe7/10 Scope:Link
          inet6 addr: fe80::90:27b6:fbe7/10 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:22539412 errors:0 dropped:0 overruns:0 frame:0
          TX packets:16228564 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:100 
          RX bytes:768887232 (733.2 Mb)  TX bytes:1301570470 (1241.2 Mb)
          Interrupt:21 Base address:0x8000 

admin:~ # uptime
 11:00am  up 2 days, 18:06,  1 user,  load average: 2.01, 1.97, 1.59
admin:~ # date
Fri Jun 18 11:00:23 EDT 2004
admin:~ # 
 
Even thought the admin host has only been up 2 days, there are zero collisions and 
carrier errors. When I tested the file transfer speed between centernet and admin, 
before making any changes, I also noticed that auto-negoiation was on, and that it was 
not set to full duplex:

cn2:~# mii-diag
Using the default interface 'eth0'.
Basic registers of MII PHY #1:  3000 782d 02a8 0154 05e1 4081 0003 0000.
 The autonegotiated capability is 0080.
The autonegotiated media type is 100baseTx.
 Basic mode control register 0x3000: Auto-negotiation enabled.
 You have link beat, and everything is working OK.
 Your link partner advertised 4081: 100baseTx.
   End of basic transceiver informaion.

cn2:~# 

[EMAIL PROTECTED]:~/dblogs$ ncftpput -u kevinz -p xxxxxx admin ~/ 
20040610.popline..wpd 
20040610.popline.wpd:                                  988.09 MB   36.13 kB/s  
ncftpput 20040610.popline.wpd: data transfer aborted by local user.
[EMAIL PROTECTED]:~/dblogs$ 

After fixing the speed to 100baseTx-FD and turning auto-negoiation off, the speed 
improved 300 times:

cn2:~# mii-diag -F 100baseTx-FD
Using the default interface 'eth0'.
Setting the speed to "fixed", Control register 2100.
Basic registers of MII PHY #1:  2100 780d 02a8 0154 05e1 4081 0001 0000.
 The autonegotiated capability is 0080.
The autonegotiated media type is 100baseTx.
 Basic mode control register 0x2100: Auto-negotiation disabled, with
 Speed fixed at 100 mbps, full-duplex.
 You have link beat, and everything is working OK.
 Your link partner advertised 4081: 100baseTx.
   End of basic transceiver informaion.

cn2:~#

cn2:~# mii-diag   
Using the default interface 'eth0'.
Basic registers of MII PHY #1:  2100 780d 02a8 0154 05e1 4081 0001 0000.
 The autonegotiated capability is 0080.
The autonegotiated media type is 100baseTx.
 Basic mode control register 0x2100: Auto-negotiation disabled, with
 Speed fixed at 100 mbps, full-duplex.
 You have link beat, and everything is working OK.
 Your link partner advertised 4081: 100baseTx.
   End of basic transceiver informaion.

[EMAIL PROTECTED]:~/dblogs$ ncftpput -u kevinz -p xxxxxx admin ~/ 
20040610.popline..wpd 
20040610.popline.wpd:                                  988.09 MB   11.04 MB/s  
ncftpput 20040610.popline.wpd: data transfer aborted by local user.
[EMAIL PROTECTED]:~/dblogs$ 

I should have also noticed the large number of collision and carrier errors on 
centernet:

cn2:~# ifconfig eth0
eth0      Link encap:Ethernet  HWaddr 00:B0:D0:49:55:20  
          inet addr:172.16.2.4  Bcast:172.16.255.255  Mask:255.255.0.0
          IPX/Ethernet 802.3 addr:00000958:00B0D0495520
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:43966452 errors:0 dropped:0 overruns:0 frame:0
          TX packets:50236123 errors:0 dropped:0 overruns:25 carrier:10736563
          collisions:11145079 txqueuelen:100 
          RX bytes:1206617378 (1.1 GiB)  TX bytes:4000069366 (3.7 GiB)
          Interrupt:16 Base address:0x5000 

cn2:~# uptime
 11:00:06 up 45 days, 13 min,  1 user,  load average: 0.00, 0.04, 0.13
cn2:~# date
Fri Jun 18 11:00:07 EDT 2004
cn2:~# 

I compared this with the OLD centernet host, which I still have up and has been up 
much longer than either admin or the new centernet:
OLD centernet:
centernet:~ # ifconfig eth0
eth0      Link encap:Ethernet  HWaddr 00:A0:C9:E4:5B:D5  
          inet addr:172.16.2.6  Bcast:172.16.255.255  Mask:255.255.0.0
          inet6 addr: fe80::a0:c9e4:5bd5/10 Scope:Link
          inet6 addr: fe80::2a0:c9ff:fee4:5bd5/10 Scope:Link
          IPX/Ethernet 802.3 addr:00000958:00A0C9E45BD5
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:244996240 errors:0 dropped:0 overruns:0 frame:0
          TX packets:178882777 errors:0 dropped:0 overruns:0 carrier:2
          collisions:2 txqueuelen:100 
          Interrupt:14 Base address:0x2000 

centernet:~ # uptime
 11:06am  up 228 days, 5 min,  1 user,  load average: 0.99, 0.97, 0.91
centernet:~ # date
Fri Jun 18 11:06:55 EDT 2004
centernet:~ # 

So, based on all this, I think the main problem was the transfer speed. I've taken 
these steps to correct that, although I wish I could remember what I need to do to set 
the NIC to this speed and no auto-negoiation on boot-up. I'm also going to increase 
the interface le0 speed in amanda.conf from 400 to 4000kbps, as the backup runs at 
night when the network is lightly loaded. I don't know why the 33G of holding disk 
space doesn't seem like enough or isn't getting used. I'll reverse the order of 
holding disk hd1, which is only 8G, with hd2, 33G, so maybe it'll use the larger one 
first.

Thanks, again, for all your help. I'll address the problem you spotted with the last 
full dump of admin://db/f$ being overwritten in another message to the list, as I'm 
confused about this, too.

-Kevin

Re: Troubleshooting a slowdown problem?

Reply via email to