I'm a user of gentoo linux, and would like to raise a bug report. I hope these
are the correct channels; if not, I apologise in advance.

I'm using a realtek-based USB3 to RJ45 gigabit adapter. This plugs
directly into my laptop, which is a Toshiba Radius P20W-C-103, skylake
based, with the following controller:

```
00:14.0 USB controller: Intel Corporation Sunrise Point-LP USB 3.0
xHCI Controller (rev 21)
```

I am experiencing this on 4.9.79-r1, and also 4.14.22.

When I plug the device in, unless I disable power management on USB
hubs 3 and 4, I get errors saying 'root hub lost power or was reset'.
However, if I disable PM using powertop, I get the device to work
seemingly well. But, as soon as I start heavy transfers (in my case
distributed compile), the network device stops responding

The error messages that I'm receiving are very similar to a bug described
here: https://bugs.launchpad.net/dell-sputnik/+bug/1729674
But I've been told that it's unrelated.

First, these are the logs showing the dongle being plugged in.
```
Feb 26 20:17:09 nizuc kernel: usb usb3: root hub lost power or was reset
Feb 26 20:17:09 nizuc kernel: usb usb4: root hub lost power or was reset
Feb 26 20:17:41 nizuc kernel: usb 4-1: new SuperSpeed USB device
number 2 using xhci_hcd
Feb 26 20:17:41 nizuc kernel: usb 4-1: New USB device found,
idVendor=0bda, idProduct=8153
Feb 26 20:17:41 nizuc kernel: usb 4-1: New USB device strings: Mfr=1,
Product=2, SerialNumber=6
Feb 26 20:17:41 nizuc kernel: usb 4-1: Product: USB 10/100/1000 LAN
Feb 26 20:17:41 nizuc kernel: usb 4-1: Manufacturer: Realtek
Feb 26 20:17:41 nizuc kernel: usb 4-1: SerialNumber: 000001
Feb 26 20:17:41 nizuc kernel: usb 4-1: reset SuperSpeed USB device
number 2 using xhci_hcd
Feb 26 20:17:41 nizuc NetworkManager[2049]: <info>  [1519676261.9009]
manager: (eth0): new Ethernet device
(/org/freedesktop/NetworkManager/Devices/5)
Feb 26 20:17:41 nizuc kernel: r8152 4-1:1.0 eth0: v1.09.9
Feb 26 20:17:42 nizuc mtp-probe[3673]: checking bus 4, device 2:
"/sys/devices/pci0000:00/0000:00:1c.0/0000:01:00.0/usb4/4-1"
Feb 26 20:17:42 nizuc mtp-probe[3673]: bus: 4, device: 2 was not an MTP device
Feb 26 20:17:42 nizuc upowerd[2168]: unhandled action 'bind' on
/sys/devices/pci0000:00/0000:00:1c.0/0000:01:00.0/usb4/4-1
Feb 26 20:17:42 nizuc systemd-udevd[3676]: link_config:
autonegotiation is unset or enabled, the speed and duplex are not
writable.
Feb 26 20:17:42 nizuc kernel: r8152 4-1:1.0 enp1s0u1: renamed from eth0
Feb 26 20:17:42 nizuc NetworkManager[2049]: <info>  [1519676262.2645]
device (eth0): interface index 4 renamed iface from 'eth0' to
'enp1s0u1'
Feb 26 20:17:42 nizuc kernel: IPv6: ADDRCONF(NETDEV_UP): enp1s0u1:
link is not ready
Feb 26 20:17:42 nizuc NetworkManager[2049]: <info>  [1519676262.2829]
device (enp1s0u1): state change: unmanaged -> unavailable (reason
'managed', internal state 'external')
Feb 26 20:17:42 nizuc upowerd[2168]: unhandled action 'bind' on
/sys/devices/pci0000:00/0000:00:1c.0/0000:01:00.0/usb4/4-1/4-1:1.0
Feb 26 20:17:42 nizuc kernel: IPv6: ADDRCONF(NETDEV_UP): enp1s0u1:
link is not ready
Feb 26 20:17:46 nizuc kernel: r8152 4-1:1.0 enp1s0u1: carrier on
Feb 26 20:17:46 nizuc kernel: IPv6: ADDRCONF(NETDEV_CHANGE): enp1s0u1:
link becomes ready
```

but once I begin distrubted compile, the network drops, with the following
messages:

```
Feb 26 20:25:32 nizuc distccd[13928]: (dcc_r_token_int) got ARGV0000000f
Feb 26 20:25:32 nizuc distccd[13928]: (dcc_r_token_string) got '-Wa,-mtune=i686'
Feb 26 20:25:32 nizuc distccd[13928]: (dcc_r_argv) argv[22] = "-Wa,-mtune=i686"
Feb 26 20:25:32 nizuc distccd[13928]: (dcc_r_token_int) got ARGV00000011
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: ERROR Transfer
event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: Looking for
event-dma 000000020c8aea40 trb-start 000000020c8aea20 trb-end
000000020c8aea20 seg-start 000000020c8ae000 s>
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: ERROR Transfer
event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: Looking for
event-dma 000000020c8aea50 trb-start 000000020c8aea20 trb-end
000000020c8aea20 seg-start 000000020c8ae000 s>
Feb 26 20:25:32 nizuc distccd[14319]: (dcc_check_client) connection
from 192.168.2.1:42290
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: ERROR Transfer
event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: Looking for
event-dma 000000020c8aea60 trb-start 000000020c8aea20 trb-end
000000020c8aea20 seg-start 000000020c8ae000 s>
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: ERROR Transfer
event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: Looking for
event-dma 000000020c8aea70 trb-start 000000020c8aea20 trb-end
000000020c8aea20 seg-start 000000020c8ae000 s>
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: ERROR Transfer
event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: Looking for
event-dma 000000020c8aea80 trb-start 000000020c8aea20 trb-end
000000020c8aea20 seg-start 000000020c8ae000 s>
Feb 26 20:25:32 nizuc distccd[14110]: (dcc_check_client) connection
from 192.168.2.1:42292
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: ERROR Transfer
event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: Looking for
event-dma 000000020c8aea90 trb-start 000000020c8aea20 trb-end
000000020c8aea20 seg-start 000000020c8ae000 s>
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: ERROR Transfer
event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: Looking for
event-dma 000000020c8aeaa0 trb-start 000000020c8aea20 trb-end
000000020c8aea20 seg-start 000000020c8ae000 s>
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: ERROR Transfer
event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
Feb 26 20:25:32 nizuc kernel: xhci_hcd 0000:01:00.0: Looking for
event-dma 000000020c8aeab0 trb-start 000000020c8aea20 trb-end
000000020c8aea20 seg-start 000000020c8ae000 s>
Feb 26 20:25:32 nizuc distccd[13928]: compile from check_addrinfo.c to
check_addrinfo.oS
Feb 26 20:25:32 nizuc distccd[14319]: compile from check_dns_packet.c
to check_dns_packet.oS
Feb 26 20:25:32 nizuc distccd[14110]: compile from check_hostent.c to
check_hostent.oS
Feb 26 20:26:23 nizuc kernel: r8152 4-1:1.0 enp1s0u1: Stop submitting
intr, status -71
Feb 26 20:26:24 nizuc kernel: xhci_hcd 0000:01:00.0: Cannot set link state.
Feb 26 20:26:24 nizuc kernel: usb usb4-port1: cannot disable (err = -32)
Feb 26 20:26:24 nizuc kernel: usb 4-1: USB disconnect, device number 2
Feb 26 20:26:26 nizuc kernel: r8152 4-1:1.0 enp1s0u1: linking down
Feb 26 20:26:26 nizuc NetworkManager[2049]: <info>  [1519676786.9555]
device (enp1s0u1): state change: activated -> unmanaged (reason
'removed', internal state 'removed')
```

The ubuntu bug which I believe to be related, had a link to the kernel mailing
list, where a patch was discussed, and the following makes be believe this may
be the same problem (https://lkml.org/lkml/2018/1/18/567):

>  > >> Excuse me. I don't understand why this patch is for specific USB nic 
> rather than
>  > xHCI.
>  > >> It seems to make the specific USB nic working and the other ones 
> keeping error.
>  > >
>  > > Well, are we sure that the device being in the TB16 dock doesn't
>  > > contribute to the issue as well?
>
>  Previous version of this patch checked the parent device to ensure it was in 
> TB16.
>  I believe there was negative feedback to that approach, which prompted the 
> discussion
>  to check bcdDevice and iSerialNumber with all vendors involved.
>
>  If it's still desirable to analyze parentage tree, I suppose bcdDevice, 
> iSerialNumber
>  and parent's USB device VID/PID can be analyzed  all at the same time.
>
>  >
>  > This is what vendors concluded for now. The very same NIC on WD15 doesn’t
>  > have the issue.
>
>  And just so it's extra clear to everyone on this list - WD15 has different 
> bcdDevice
>  iSerialNumber, and doesn't connect to ASMedia host controller.

I hope somebody here can direct me as to how to best proceed. Should I try
modifying the aforementioned patch and see if it helps (note the patch only
activates for certain usb vendor/device IDs). Also, the ubuntu link suggests
that this may be the result of "offloading checksuming" which can be disabled
with ethtool. Should I try disabling this and see if it makes a difference?
Many thanks for your patience. Please let me know if there is any more
information I can provide.
--
To unsubscribe from this list: send the line "unsubscribe linux-usb" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to