Dear all, we purchased six PE-740xd systems with integrated Broadcom 57416 dual port 10Gbit network daughter cards. Systems are running Scientific Linux 7.4 - with 7.5 kernel already. Nevertheless the problem has been repeatedly showing up since we put the systems in production (with SL-7.3 at that time).
Current kernel version:
[pascal05] /root # uname -a
Linux pascal05.zeuthen.desy.de 3.10.0-862.3.3.el7.x86_64 #1 SMP Thu Jun 14
15:28:39 CDT 2018 x86_64 x86_64 x86_64 GNU/Linux
All of these six systems lose network connectivity sporadically, although
one or two of them seem to be more affected than the others. Syslog is full
of these errors in this case:
Jun 21 08:12:51 pascal05 kernel: bnxt_en 0000:19:00.0 em1: Error (timeout: 500)
msg {0x51 0x11f} len:0
Jun 21 08:12:51 pascal05 kernel: bnxt_en 0000:19:00.0 em1: hwrm_ring_free cp
failed. rc:-1
Googling the problem I found only one other report on pastebin:
https://pastebin.com/DBhMSsaG
It's an exact copy of our problem - although it's an R640 in this case.
Anybody else seeing this? Solutions / mitigations to overcome this nasty
issue are most welcome!
Thanks in advance,
Andreas
--
| Andreas Haupt | E-Mail: [email protected]
| DESY Zeuthen | WWW: http://www-zeuthen.desy.de/~ahaupt
| Platanenallee 6 | Phone: +49/33762/7-7359
| D-15738 Zeuthen | Fax: +49/33762/7-7216
smime.p7s
Description: S/MIME cryptographic signature
_______________________________________________ Linux-PowerEdge mailing list [email protected] https://lists.us.dell.com/mailman/listinfo/linux-poweredge
