Hello all,

Linux (stable) 3.10.27 built from the git sources crashes right at
boot with a "skb_over_panic" error. This only happens during boot
(probably related to the sysvinit sequence), it does not happen when I
modprobe the driver. It seems to be some race condition because
sometimes I do not hit the bug.

I attached gdb to the kernel via serial, here is a meaningful backtrace:
(gdb) bt
#0  skb_panic (skb=skb@entry=0xe874f200, sz=<optimized out>,
addr=<optimized out>, msg=msg@entry=0xc14
75e3c "skb_over_panic") at net/core/skbuff.c:126
#1  0xc13851db in skb_over_panic (addr=<optimized out>, sz=<optimized
out>, skb=0xe874f200) at net/core/skbuff.c:131
#2  skb_put (skb=skb@entry=0xe874f200, len=len@entry=46969) at
net/core/skbuff.c:1283
#3  0xc12f7e7e in e1000_clean_rx_irq (rx_ring=0xe8f11340,
work_done=0xe8cfdf84, work_to_do=64) at driv
ers/net/ethernet/intel/e1000e/netdev.c:1011
#4  0xc12fd5c7 in e1000e_poll (napi=0xe8e78788, weight=64) at
drivers/net/ethernet/intel/e1000e/netdev
.c:2654
#5  0xc138edb9 in net_rx_action (h=<optimized out>) at net/core/dev.c:4197
#6  0xc10302e9 in __do_softirq () at kernel/softirq.c:253
#7  0xc10030ec in call_on_stack (stack=0xe8cbfecc, func=<optimized
out>) at arch/x86/kernel/irq_32.c:7
1
#8  do_softirq () at arch/x86/kernel/irq_32.c:173
#9  0xc1030466 in invoke_softirq () at kernel/softirq.c:342
#10 irq_exit () at kernel/softirq.c:376
#11 0xc1002fa5 in do_IRQ (regs=<optimized out>) at arch/x86/kernel/irq.c:198
#12 <signal handler called>
#13 0xc1367d19 in cpuidle_enter_state (dev=dev@entry=0xeafe6600,
drv=drv@entry=0xc1574420, index=index
@entry=8) at drivers/cpuidle/cpuidle.c:92
#14 0xc1367e03 in cpuidle_idle_call () at drivers/cpuidle/cpuidle.c:157
#15 0xc1008485 in arch_cpu_idle () at arch/x86/kernel/process.c:301
#16 0xc1054b4f in cpu_idle_loop () at kernel/cpu/idle.c:98
#17 cpu_startup_entry (state=state@entry=CPUHP_ONLINE) at kernel/cpu/idle.c:133
#18 0xc15c6938 in start_secondary (unused=<optimized out>) at
arch/x86/kernel/smpboot.c:287
#19 0x00000000 in ?? ()

The relevant section of lspci:
lspci -t -v
-[0000:00]-+-1c.0-[01]----00.0  Intel Corporation 82574L Gigabit
Network Connection

lspci -s 01:00.0 -vvv
01:00.0 Ethernet controller: Intel Corporation 82574L Gigabit Network Connection
    Subsystem: Intel Corporation Device 0000
    Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
ParErr- Stepping- SERR- FastB2B- DisINTx+
    Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
<TAbort- <MAbort- >SERR- <PERR- INTx-
    Latency: 0, Cache Line Size: 64 bytes
    Interrupt: pin A routed to IRQ 16
    Region 0: Memory at b4600000 (32-bit, non-prefetchable) [size=128K]
    Region 2: I/O ports at 7000 [size=32]
    Region 3: Memory at b4620000 (32-bit, non-prefetchable) [size=16K]
    Capabilities: [c8] Power Management version 2
        Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA
PME(D0+,D1-,D2-,D3hot+,D3cold-)
        Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=1 PME-
    Capabilities: [d0] MSI: Enable- Count=1/1 Maskable- 64bit+
        Address: 0000000000000000  Data: 0000
    Capabilities: [e0] Express (v1) Endpoint, MSI 00
        DevCap:    MaxPayload 256 bytes, PhantFunc 0, Latency L0s
<512ns, L1 <64us
            ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ FLReset-
        DevCtl:    Report errors: Correctable+ Non-Fatal+ Fatal+ Unsupported+
            RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+
            MaxPayload 128 bytes, MaxReadReq 512 bytes
        DevSta:    CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend-
        LnkCap:    Port #0, Speed 2.5GT/s, Width x1, ASPM L0s L1,
Latency L0 <128ns, L1 <64us
            ClockPM- Surprise- LLActRep- BwNot-
        LnkCtl:    ASPM Disabled; RCB 64 bytes Disabled- Retrain- CommClk+
            ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
        LnkSta:    Speed 2.5GT/s, Width x1, TrErr- Train- SlotClk+
DLActive- BWMgmt- ABWMgmt-
    Capabilities: [a0] MSI-X: Enable+ Count=5 Masked-
        Vector table: BAR=3 offset=00000000
        PBA: BAR=3 offset=00002000
    Capabilities: [100 v1] Advanced Error Reporting
        UESta:    DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt-
RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
        UEMsk:    DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt-
RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
        UESvrt:    DLP+ SDES- TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt-
RxOF+ MalfTLP+ ECRC- UnsupReq- ACSViol-
        CESta:    RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr-
        CEMsk:    RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr+
        AERCap:    First Error Pointer: 00, GenCap- CGenEn- ChkCap- ChkEn-
    Capabilities: [140 v1] Device Serial Number 00-11-1f-ff-ff-00-01-11
    Kernel driver in use: e1000e

The relevant section of dmesg when the kernel does not crash:
(...)
pci 0000:01:00.0: [8086:10d3] type 00 class 0x020000
pci 0000:01:00.0: reg 10: [mem 0xb4600000-0xb461ffff]
pci 0000:01:00.0: reg 18: [io  0x7000-0x701f]
pci 0000:01:00.0: reg 1c: [mem 0xb4620000-0xb4623fff]
pci 0000:01:00.0: PME# supported from D0 D3hot
pci 0000:00:1c.0: PCI bridge to [bus 01]
pci 0000:00:1c.0:   bridge window [io  0x7000-0x7fff]
pci 0000:00:1c.0:   bridge window [mem 0xb4600000-0xb46fffff]
(...)
e1000e: Intel(R) PRO/1000 Network Driver - 2.3.2-k
e1000e: Copyright(c) 1999 - 2013 Intel Corporation.
e1000e 0000:01:00.0: Disabling ASPM L0s L1
e1000e 0000:01:00.0: Interrupt Throttling Rate (ints/sec) set to
dynamic conservative mode
e1000e 0000:01:00.0: irq 63 for MSI/MSI-X
e1000e 0000:01:00.0: irq 64 for MSI/MSI-X
e1000e 0000:01:00.0: irq 65 for MSI/MSI-X
ACPI Warning: 0x00008040-0x0000805f SystemIO conflicts with Region
\_SB_.PCI0.SBUS.SMBI 1 (20130328/utaddress-251)
ACPI: If an ACPI driver is available for this device, you should use
it instead of the native driver
e1000e 0000:01:00.0 eth0: registered PHC clock
e1000e 0000:01:00.0 eth0: (PCI Express:2.5GT/s:Width x1) 00:11:1f:00:01:11
e1000e 0000:01:00.0 eth0: Intel(R) PRO/1000 Network Connection
e1000e 0000:01:00.0 eth0: MAC: 3, PHY: 8, PBA No: FFFFFF-0FF

Regards,
-- 
Martin Boutin

------------------------------------------------------------------------------
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments & Everything In Between.
Get a Quote or Start a Free Trial Today. 
http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk
_______________________________________________
E1000-devel mailing list
E1000-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/e1000-devel
To learn more about Intel&#174; Ethernet, visit 
http://communities.intel.com/community/wired

Reply via email to