Bug#996667: kernel panic with kernel upgrade on diskless PXE + NFS system

2021-10-17 Thread Mike
Thanks, Salvatore, for replying.

> From the booting kernel could you please add the boot log

Attached as boot.log.

> any of the information which
> would be collected by reportbug kernel, so we can understand what NIC
> you have on eth0.

Attached "lspci -vv" output.  LMK if there's more that you need.
Linux version 4.19.0-16-amd64 (debian-ker...@lists.debian.org) (gcc version 
8.3.0 (Debian 8.3.0-6)) #1 SMP Debian 4.19.181-1 (2021-03-19)
Command line: vga=795 root=/dev/nfs nfsroot=__.__.__.__:/__ ip=dhcp rw 
initrd=___/initrd.img-4.19.0-16-amd64 BOOT_IMAGE=vmlinuz-4.19.0-16-amd64 
x86/fpu: x87 FPU will use FXSAVE
BIOS-provided physical RAM map:
BIOS-e820: [mem 0x-0x0009fbff] usable
BIOS-e820: [mem 0x0009fc00-0x0009] reserved
BIOS-e820: [mem 0x000e4000-0x000f] reserved
BIOS-e820: [mem 0x0010-0x7f68] usable
BIOS-e820: [mem 0x7f69-0x7f69dfff] ACPI data
BIOS-e820: [mem 0x7f69e000-0x7f6c] ACPI NVS
BIOS-e820: [mem 0x7f6d-0x7f6ddfff] reserved
BIOS-e820: [mem 0x7f6e-0x7f6f] reserved
BIOS-e820: [mem 0xfee0-0xfee00fff] reserved
BIOS-e820: [mem 0xfff0-0x] reserved
NX (Execute Disable) protection: active
SMBIOS 2.5 present.
DMI: System manufacturer System Product Name/P5KPL-CM, BIOS 060202/24/2009
tsc: Fast TSC calibration using PIT
tsc: Detected 1814.842 MHz processor
last_pfn = 0x7f690 max_arch_pfn = 0x4
x86/PAT: Configuration [0-7]: WB  WC  UC- UC  WB  WP  UC- WT  
found SMP MP-table at [mem 0x000ff780-0x000ff78f]
RAMDISK: [mem 0x7de3-0x7f66efff]
ACPI: Early table checksum verification disabled
ACPI: RSDP 0x000FB7B0 14 (v00 ACPIAM)
ACPI: RSDT 0x7F69 3C (v01 A_M_I_ OEMRSDT  02000924 MSFT 
0097)
ACPI: FACP 0x7F690200 84 (v02 A_M_I_ OEMFACP  02000924 MSFT 
0097)
ACPI: DSDT 0x7F6905C0 007C10 (v01 A0968  A0968000  INTL 
20051117)
ACPI: FACS 0x7F69E000 40
ACPI: APIC 0x7F690390 6C (v01 A_M_I_ OEMAPIC  02000924 MSFT 
0097)
ACPI: MCFG 0x7F690400 3C (v01 A_M_I_ OEMMCFG  02000924 MSFT 
0097)
ACPI: OEMB 0x7F69E040 80 (v01 A_M_I_ AMI_OEM  02000924 MSFT 
0097)
ACPI: HPET 0x7F6981D0 38 (v01 A_M_I_ OEMHPET  02000924 MSFT 
0097)
ACPI: GSCI 0x7F69E0C0 002024 (v01 A_M_I_ GMCHSCI  02000924 MSFT 
0097)
No NUMA configuration found
Faking a node at [mem 0x-0x7f68]
NODE_DATA(0) allocated [mem 0x7f68b000-0x7f68]
Zone ranges:
  DMA  [mem 0x1000-0x00ff]
  DMA32[mem 0x0100-0x7f68]
  Normal   empty
  Device   empty
Movable zone start for each node
Early memory node ranges
  node   0: [mem 0x1000-0x0009efff]
  node   0: [mem 0x0010-0x7f68]
Zeroed struct page in unavailable ranges: 2514 pages
Initmem setup node 0 [mem 0x1000-0x7f68]
Reserving Intel graphics memory at [mem 0x7f80-0x7fff]
ACPI: PM-Timer IO Port: 0x808
IOAPIC[0]: apic_id 1, version 32, address 0xfec0, GSI 0-23
ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
Using ACPI (MADT) for SMP configuration information
ACPI: HPET id: 0x8086a201 base: 0xfed0
smpboot: Allowing 4 CPUs, 3 hotplug CPUs
PM: Registered nosave memory: [mem 0x-0x0fff]
PM: Registered nosave memory: [mem 0x0009f000-0x0009]
PM: Registered nosave memory: [mem 0x000a-0x000e3fff]
PM: Registered nosave memory: [mem 0x000e4000-0x000f]
[mem 0x8000-0xfedf] available for PCI devices
Booting paravirtualized kernel on bare hardware
clocksource: refined-jiffies: mask: 0x max_cycles: 0x, 
max_idle_ns: 7645519600211568 ns
random: get_random_bytes called from start_kernel+0x93/0x52a with crng_init=0
setup_percpu: NR_CPUS:512 nr_cpumask_bits:512 nr_cpu_ids:4 nr_node_ids:1
percpu: Embedded 45 pages/cpu s144536 r8192 d31592 u524288
Built 1 zonelists, mobility grouping on.  Total pages: 513598
Policy zone: DMA32
Kernel command line: vga=795 root=/dev/nfs nfsroot=__.__.__.__:/__ ip=dhcp 
rw initrd=__/initrd.img-4.19.0-16-amd64 BOOT_IMAGE=vmlinuz-4.19.0-16-amd64 
Memory: 2002128K/2087096K available (10252K kernel code, 1242K rwdata, 3328K 
rodata, 1600K init, 2260K bss, 84968K reserved, 0K cma-reserved)
SLUB: HWalign=64, Order=0-3, MinObjects=0, CPUs=4, Nodes=1
Kernel/User page tables isolation: enabled
ftrace: allocating 31978 entries in 125 pages
rcu: Hierarchical RCU implementation.
rcu:RCU restricting CPUs from NR_CPUS=512 to nr_cpu_ids=4.
rcu: Adjusting geometry for rcu_fanout_leaf=16, nr_cpu_ids=4
NR_IRQS: 33024, nr_irqs: 456, preallocated irqs: 16
Console: colour dummy device 80x25
console [tty0] enabled
ACPI: Core revision 20180810
clocksource: hpet: ma

Bug#996667: kernel panic with kernel upgrade on diskless PXE + NFS system

2021-10-17 Thread Salvatore Bonaccorso
Control: tags -1 + moreinfo
Control: severity -1 important 

On Sun, Oct 17, 2021 at 02:52:31AM +, deb...@good-with-numbers.com wrote:
> Package: linux-image-4.19.0-17-amd64
> Version: 4.19.194-3
> Severity: critical
> 
> 
> I have the following packages installed:
> 
> linux-image-4.19.0-16-amd64
> linux-image-4.19.0-17-amd64
> linux-image-4.19.0-18-amd64
> 
> The system boots PXE on NFS--completely diskless.  eth0 connects to the
> PXE/NFS server.  The -16 release runs just fine.  I tried to upgrade to
> the -18, and then -17, releases by switching the PXE configuration, but
> both failed.  The -18 release failed at:
> 
> --
> Begin: Loading essential drivers ... done.
> Begin: Running /scripts/init-premount ... done.
> Begin: Mounting root file system ... Begin: Running /scripts/nfs-top ... done.
> Begin: Running /scripts/nfs-premount ... done.
> ipconfig: eth0: SIOCGIFINDEX: No such device
> ipconfig: no devices to configure
> ipconfig: eth0: SIOCGIFINDEX: No such device
> ipconfig: no devices to configure
> ipconfig: eth0: SIOCGIFINDEX: No such device
> ipconfig: no devices to configure
> ipconfig: eth0: SIOCGIFINDEX: No such device
> ipconfig: no devices to configure
> ipconfig: eth0: SIOCGIFINDEX: No such device
> ipconfig: no devices to configure
> ipconfig: eth0: SIOCGIFINDEX: No such device
> ipconfig: no devices to configure
> ipconfig: eth0: SIOCGIFINDEX: No such device
> ipconfig: no devices to configure
> ipconfig: eth0: SIOCGIFINDEX: No such device
> ipconfig: no devices to configure
> ipconfig: eth0: SIOCGIFINDEX: No such device
> ipconfig: no devices to configure
> ipconfig: eth0: SIOCGIFINDEX: No such device
> ipconfig: no devices to configure
> /init: .: line 275: can't open '/run/net-eth0.conf': No such file or directory
> [4.xx] Kernel panic - not syncing: Attempted to kill init! 
> exitcode=0x0200
> --
> 
> etc.  (Retyped from a photo.)  The -17 release output is the same.
> 
> I don't know what file "line 275" refers to.
> 
> There were a lot of updates from 4.19.181-1 to 4.19.194-3.

We have too little information here. From the booting kernel could you
please add the boot log, and as well any of the information which
would be collected by reportbug kernel, so we can understand what NIC
you have on eth0.

Regards,
Salvatore



Bug#996667: kernel panic with kernel upgrade on diskless PXE + NFS system

2021-10-16 Thread debian
Package: linux-image-4.19.0-17-amd64
Version: 4.19.194-3
Severity: critical


I have the following packages installed:

linux-image-4.19.0-16-amd64
linux-image-4.19.0-17-amd64
linux-image-4.19.0-18-amd64

The system boots PXE on NFS--completely diskless.  eth0 connects to the
PXE/NFS server.  The -16 release runs just fine.  I tried to upgrade to
the -18, and then -17, releases by switching the PXE configuration, but
both failed.  The -18 release failed at:

--
Begin: Loading essential drivers ... done.
Begin: Running /scripts/init-premount ... done.
Begin: Mounting root file system ... Begin: Running /scripts/nfs-top ... done.
Begin: Running /scripts/nfs-premount ... done.
ipconfig: eth0: SIOCGIFINDEX: No such device
ipconfig: no devices to configure
ipconfig: eth0: SIOCGIFINDEX: No such device
ipconfig: no devices to configure
ipconfig: eth0: SIOCGIFINDEX: No such device
ipconfig: no devices to configure
ipconfig: eth0: SIOCGIFINDEX: No such device
ipconfig: no devices to configure
ipconfig: eth0: SIOCGIFINDEX: No such device
ipconfig: no devices to configure
ipconfig: eth0: SIOCGIFINDEX: No such device
ipconfig: no devices to configure
ipconfig: eth0: SIOCGIFINDEX: No such device
ipconfig: no devices to configure
ipconfig: eth0: SIOCGIFINDEX: No such device
ipconfig: no devices to configure
ipconfig: eth0: SIOCGIFINDEX: No such device
ipconfig: no devices to configure
ipconfig: eth0: SIOCGIFINDEX: No such device
ipconfig: no devices to configure
/init: .: line 275: can't open '/run/net-eth0.conf': No such file or directory
[4.xx] Kernel panic - not syncing: Attempted to kill init! 
exitcode=0x0200
--

etc.  (Retyped from a photo.)  The -17 release output is the same.

I don't know what file "line 275" refers to.

There were a lot of updates from 4.19.181-1 to 4.19.194-3.