Re: Packet-corruption with re(4)

2008-05-18 Thread Peter Ankerstål


On Apr 29, 2008, at 2:08 PM, Jeremy Chadwick wrote:





I'd recommend staying away from Realtek NICs.  Pick up an Intel Pro/ 
1000

GT or PT.  Realtek has a well-known history of issues.


Just wanted to tell you guys that so far a em(4) seems to have fixed  
the problem.

--
Peter Ankerstål
[EMAIL PROTECTED]


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Packet-corruption with re(4)

2008-05-01 Thread Alexandre Sunny Kovalenko
On Tue, 2008-04-29 at 22:58 +0200, Torfinn Ingolfsen wrote:
 On Tue, 29 Apr 2008 05:08:34 -0700
 Jeremy Chadwick [EMAIL PROTECTED] wrote:
 
  I'd recommend staying away from Realtek NICs.  Pick up an Intel
  Pro/1000 GT or PT.  Realtek has a well-known history of issues.
 
 I hear that story very often, so often that I almost think it's a fairy
 tale. :-)
 Most of the times the RealTek NIC story is told, it isn't backed up
 with any references to evidence. Draw your own conclusions.
Well, my latest encounter with Netgear GA311 Rev. A1 (RTL8169S-32)
Gigabit Ethernet falls neatly into the fairy tale category -- it has
jittery bit in the chip ID register and will advertise itself as 8129 or
8169 depending on the phase of the moon. While it is not fair to judge
the whole product line based on the single specimen, it took me some
time to figure it out, and, on top of that, I have wasted time of some
people, valuable to the FreeBSD project, who were kind enough to review
submitted PR and close it. I am not planning on buying more of these to
get representative statistics, though ;)

The card in question was purchased as the replacement for the 'fxp',
which was sitting in that system for a long while, so I think I am in my
right to blame the card and not the system.

-- 
Alexandre Sunny Kovalenko (Олександр Коваленко)

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Packet-corruption with re(4)

2008-04-29 Thread Peter Ankerstål

Hi,

Im using a built in re(4) card and it seems like its causes data- 
corruption as soon as it gets some load (or after a few hours online)



The machine is running FreeBSD 7.0R:

FreeBSD ninja 7.0-RELEASE FreeBSD 7.0-RELEASE #0: Wed Apr 16 22:49:15  
CEST 2008 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/NINJA  amd64


(using a RELENG_7 kernel updated today)


If I download alot of files with wget or something everything works  
fine for a while until the download just stalls and tcpdump reports  
bad cksum


13:04:25.839408 IP (tos 0x0, ttl 64, id 23912, offset 0, flags [DF],  
proto TCP (6), length 1500, bad cksum 0 (-f40a)!) host1.http   
host2.53895: . 115065:116513(1448) ack 1421 win 8326  
nop,nop,timestamp 2316944721 2174233192

and so on

again and again until the connection times out.

re0: RealTek 8168/8111B PCIe Gigabit Ethernet port 0xb800-0xb8ff mem  
0xff9ff000-0xff9f irq 19 at device 0.0 on pci3

re0: Using 2 MSI messages
miibus0: MII bus on re0
rgephy0: RTL8169S/8110S/8211B media interface PHY 1 on miibus0
rgephy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT,  
1000baseT-FDX, auto

re0: Ethernet address: 00:1d:60:74:be:e3
re0: [FILTER]
re0: [FILTER]


[EMAIL PROTECTED]:3:0:0:	class=0x02 card=0x81aa1043 chip=0x816810ec  
rev=0x01 hdr=0x00

vendor = 'Realtek Semiconductor'
device = 'RTL8168/8111 PCI-E Gigabit Ethernet NIC'
class  = network
subclass   = ethernet


A strange thing is that encrypted data seems to work better than  
plaintext. https works for a longer time (after boot) then http for  
example.



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Packet-corruption with re(4)

2008-04-29 Thread Jeremy Chadwick
On Tue, Apr 29, 2008 at 01:18:10PM +0200, Peter Ankerstål wrote:
 Im using a built in re(4) card and it seems like its causes data-corruption 
 as soon as it gets some load (or after a few hours online)

 The machine is running FreeBSD 7.0R:

 FreeBSD ninja 7.0-RELEASE FreeBSD 7.0-RELEASE #0: Wed Apr 16 22:49:15 CEST 
 2008 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/NINJA  amd64

 (using a RELENG_7 kernel updated today)


 If I download alot of files with wget or something everything works fine 
 for a while until the download just stalls and tcpdump reports bad cksum

 13:04:25.839408 IP (tos 0x0, ttl 64, id 23912, offset 0, flags [DF], proto 
 TCP (6), length 1500, bad cksum 0 (-f40a)!) host1.http  host2.53895: . 
 115065:116513(1448) ack 1421 win 8326 nop,nop,timestamp 2316944721 
 2174233192
 and so on
 again and again until the connection times out.

tcpdump reporting bad cksum can occur due to TX/RX checksum
offloading.  Do you not see this message normally, but only when the
problem begins?

Have you tried turning off TX/RX offloading to see if the erroneous
behaviour goes away?

Have you tried disabling RFC1323 to see if that's actually what's
responsible for the stalls you're seeing?  sysctl net.inet.tcp.rfc1323=0

I'd recommend staying away from Realtek NICs.  Pick up an Intel Pro/1000
GT or PT.  Realtek has a well-known history of issues.

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Packet-corruption with re(4)

2008-04-29 Thread Peter Ankerstål


On Apr 29, 2008, at 2:08 PM, Jeremy Chadwick wrote:




tcpdump reporting bad cksum can occur due to TX/RX checksum
offloading.  Do you not see this message normally, but only when the
problem begins?

Have you tried turning off TX/RX offloading to see if the erroneous
behaviour goes away?

Have you tried disabling RFC1323 to see if that's actually what's
responsible for the stalls you're seeing?  sysctl  
net.inet.tcp.rfc1323=0


I'd recommend staying away from Realtek NICs.  Pick up an Intel Pro/ 
1000

GT or PT.  Realtek has a well-known history of issues.



Now Ive tried:
ninja# sysctl net.inet.tcp.rfc1323=0
net.inet.tcp.rfc1323: 1 - 0
ninja# ifconfig re0 -rxcsum -txcsum -tso -lro

The problem remains, but when I have -rxcsum/tx tcpdump does not say  
anything

about bad checksums but it send out the same ack over and over.
and it seems like bigger files are the most effected (images rather  
then html-files)


I know, I have em in all other machines..  :/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Packet-corruption with re(4)

2008-04-29 Thread Jeremy Chadwick
On Tue, Apr 29, 2008 at 02:30:27PM +0200, Peter Ankerstål wrote:

 On Apr 29, 2008, at 2:08 PM, Jeremy Chadwick wrote:


 tcpdump reporting bad cksum can occur due to TX/RX checksum
 offloading.  Do you not see this message normally, but only when the
 problem begins?

 Have you tried turning off TX/RX offloading to see if the erroneous
 behaviour goes away?

 Have you tried disabling RFC1323 to see if that's actually what's
 responsible for the stalls you're seeing?  sysctl net.inet.tcp.rfc1323=0

 I'd recommend staying away from Realtek NICs.  Pick up an Intel Pro/1000
 GT or PT.  Realtek has a well-known history of issues.


 Now Ive tried:
 ninja# sysctl net.inet.tcp.rfc1323=0
 net.inet.tcp.rfc1323: 1 - 0
 ninja# ifconfig re0 -rxcsum -txcsum -tso -lro

 The problem remains, but when I have -rxcsum/tx tcpdump does not say 
 anything
 about bad checksums but it send out the same ack over and over.
 and it seems like bigger files are the most effected (images rather then 
 html-files)

Okay, so the problem is likely not with checksums, but more of a problem
with network I/O suddenly stopping for no apparent reason.

Have you tried disabling MSI/MSI-X via /boot/loader.conf to see if that
makes any difference?  I wonder if somehow interrupts are no longer
firing for the re(4) card.  Try putting these in loader.conf and
rebooting:

hw.pci.enable_msi=0
hw.pci.enable_msix=0

Otherwise, I think Pyun YongHyeon might have some better ideas.

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Packet-corruption with re(4)

2008-04-29 Thread Torfinn Ingolfsen
On Tue, 29 Apr 2008 13:18:10 +0200
Peter Ankerstål [EMAIL PROTECTED] wrote:

 Hi,
 
 Im using a built in re(4) card and it seems like its causes data- 
 corruption as soon as it gets some load (or after a few hours online)

IIRC, this is a known issue with if_re and 7.0-R. It is fixed in
7.0-stable, but I don't remember the date. 
You can find patches here: http://people.freebsd.org/~yongari/re/

In my case, the problem raised its ugly head after some (unknown) amount
of data had been transferred.

HTH
-- 
Regards,
Torfinn Ingolfsen

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Packet-corruption with re(4)

2008-04-29 Thread Torfinn Ingolfsen
On Tue, 29 Apr 2008 05:08:34 -0700
Jeremy Chadwick [EMAIL PROTECTED] wrote:

 I'd recommend staying away from Realtek NICs.  Pick up an Intel
 Pro/1000 GT or PT.  Realtek has a well-known history of issues.

I hear that story very often, so often that I almost think it's a fairy
tale. :-)
Most of the times the RealTek NIC story is told, it isn't backed up
with any references to evidence. Draw your own conclusions.

I can only tell in my experience (several motherboards from
different manufacturers) with integrated RealTek NICs, both 10/100 and
10/100/1000), RealTek NICs have mostly been working as advertised,
without trouble.
-- 
Regards,
Torfinn Ingolfsen

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Packet-corruption with re(4)

2008-04-29 Thread Jeremy Chadwick
On Tue, Apr 29, 2008 at 10:58:55PM +0200, Torfinn Ingolfsen wrote:
 On Tue, 29 Apr 2008 05:08:34 -0700
 Jeremy Chadwick [EMAIL PROTECTED] wrote:
 
  I'd recommend staying away from Realtek NICs.  Pick up an Intel
  Pro/1000 GT or PT.  Realtek has a well-known history of issues.
 
 I hear that story very often, so often that I almost think it's a fairy
 tale. :-)
 Most of the times the RealTek NIC story is told, it isn't backed up
 with any references to evidence. Draw your own conclusions.

First, I'd check out the BUGS section of the re(4) and rl(4) manpages.

Second, I'd look at some of the CVS commit messages for said drivers;
you'll find a plethora of bugs which aren't mentioned in the manpages
either:

http://www.freebsd.org/cgi/cvsweb.cgi/src/sys/pci/if_rl.c
http://www.freebsd.org/cgi/cvsweb.cgi/src/sys/dev/re/if_re.c

Third, have you tried reading any of their technical/designer docs for
their ICs?  They're horrible.  Very hard to read, and lack concise
descriptions.  There's a lot of missing data.

 I can only tell in my experience (several motherboards from
 different manufacturers) with integrated RealTek NICs, both 10/100 and
 10/100/1000), RealTek NICs have mostly been working as advertised,
 without trouble.

And in my experiences, Realtek PHYs are absolutely horrible when it
comes to auto-negotiation.  I've only had the pleasure of dealing with
Realtek trash on Windows, where occasionally you'll find a Changelog
that comes with the driver set.  They play around with auto-neg
methodologies in the code quite often.

If you go through the -stable archives, you'll see Realtek is
continually complained about from an end-user perspective.  The state of
affairs is sad, since many consumer motherboard vendors use Realtek
exclusively (I'm looking at you, Gigabyte).  It's sad that most (not
all) consumer boards lack Intel PHY/NICs these days.

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Packet-corruption with re(4)

2008-04-29 Thread Sean C. Farley

On Tue, 29 Apr 2008, Jeremy Chadwick wrote:


I'd recommend staying away from Realtek NICs.  Pick up an Intel
Pro/1000 GT or PT.  Realtek has a well-known history of issues.


Just a note about the Intel Pro/1000 GT:  it is on-sale[1] at NewEgg
with free shipping.  I just bought my third one to replace an on-board
NIC (Marvell Yukon 88E8056) with auto-negotiation problems and an
inability to take advantage of jumbo frames at the same time as checksum
offloading.

Sean
  1. http://www.newegg.com/Product/Product.aspx?Item=N82E16833106121
--
[EMAIL PROTECTED]
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]