Bug#1022806: linux-image-5.10.0-19-amd64: amggpu unbootable problem persists

2022-11-09 Thread Andreas Wirooks

I can also confirm that an unmodified 5.10.153 from kernel.org with
debians .config file works.


Bug#1022806: linux-image-5.10.0-19-amd64: amggpu unbootable problem persists

2022-11-04 Thread Alexis Huxley
Hi All,

Bernhard informed me that you might no longer want me to
do the kernel building, due to messages on the bug's mailing
list that didn't get sent to me. 

Please advise; ideally "please test deb at " :-)

Regards,

Alexis



Bug#1022806: linux-image-5.10.0-19-amd64: amggpu unbootable problem persists

2022-11-04 Thread Alexis Huxley
> I will try to do this, but owing to my schedule, it will
> be only Tuesday before I can try.

everything slipped. Building ten kernel packages today. Further
feedback some time over the weekend.

Alexis



Bug#1022806: linux-image-5.10.0-19-amd64: amggpu unbootable problem persists

2022-10-31 Thread Andreas Wirooks

On Fri, 28 Oct 2022 20:26:55 +0200 Gert  wrote:
> 5.19.0-0.deb11.2-amd64 works fine at first glance.

I am also affected and i also tried this kernel and it works as you can
read here:

https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1022042#160

I also filed a new drm bug here:

https://gitlab.freedesktop.org/drm/amd/-/issues/2237

Kind regards,

Andreas



Bug#1022806: linux-image-5.10.0-19-amd64: amggpu unbootable problem persists

2022-10-28 Thread Gert

I suspect to have this same bug.

In case it's useful, I got kernel logs via serial console, see attachment.
Therein I have marked where VGA console output stops.
Serial console goes a bit further.
SysRq+B still works.

My system: MSI 785GM-E51 (AMD 785G+SB710) + Dell Radeon R5 240 OEM (Oland).
I use radeon.si_support=0 amdgpu.si_support=1 amdgpu.dc=0.
I put the drivers inside initramfs. Outside doesn't work either but 
hangs a bit later.

5.19.0-0.deb11.2-amd64 works fine at first glance.

Currently I have little time for "big" things like git-bisect, sorry.
But I'm happy to test smaller things.
Thanks, GertSerial console log, also visible on VGA console:

[0.00] Linux version 5.10.0-19-amd64 (debian-ker...@lists.debian.org) 
(gcc-10 (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU Binutils for Debian) 
2.35.2) #1 SMP Debian 5.10.149-2 (2022-10-21)
[0.00] Command line: BOOT_IMAGE=/vmlinuz-5.10.0-19-amd64 
root=UUID=4594ae40-1c56-4123-b65a-bfad982b83b3 ro rootflags=subvol=debian 
nmi_watchdog=0 clocksource=hpet ipv6.disable=1 nokaiser nopti pti=off 
usbhid.quirks=0x2341:0x8037:0x040 usbhid.jspoll=1 console=tty0 
console=ttyS0,115200n8 text
[0.00] x86/fpu: x87 FPU will use FXSAVE
[0.00] BIOS-provided physical RAM map:
[0.00] BIOS-e820: [mem 0x-0x0009f3ff] usable
[0.00] BIOS-e820: [mem 0x0009f400-0x0009] reserved
[0.00] BIOS-e820: [mem 0x000e6000-0x000f] reserved
[0.00] BIOS-e820: [mem 0x0010-0xcfe8] usable
[0.00] BIOS-e820: [mem 0xcfe9-0xcfe9dfff] ACPI data
[0.00] BIOS-e820: [mem 0xcfe9e000-0xcfed] ACPI NVS
[0.00] BIOS-e820: [mem 0xcfee-0xcfef] reserved
[0.00] BIOS-e820: [mem 0xfff0-0x] reserved
[0.00] BIOS-e820: [mem 0x0001-0x00022fff] usable
[0.00] NX (Execute Disable) protection: active
[0.00] SMBIOS 2.5 present.
[0.00] DMI: MICRO-STAR INTERNATIONAL CO.,LTD MS-7596/785GM-E51 
(MS-7596), BIOS V2.12 02/18/2011
[0.00] tsc: Fast TSC calibration using PIT
[0.00] tsc: Detected 3599.944 MHz processor
[0.013438] AGP: No AGP bridge found
[0.013513] last_pfn = 0x23 max_arch_pfn = 0x4
[0.013756] x86/PAT: Configuration [0-7]: WB  WC  UC- UC  WB  WP  UC- WT  
[0.013902] last_pfn = 0xcfe90 max_arch_pfn = 0x4
[0.017282] found SMP MP-table at [mem 0x000ff780-0x000ff78f]
[0.024303] Using GB pages for direct mapping
[0.024641] RAMDISK: [mem 0x33e2b000-0x35f0cfff]
[0.024647] ACPI: Early table checksum verification disabled
[0.024652] ACPI: RSDP 0x000FAD00 14 (v00 ACPIAM)
[0.024655] ACPI: RSDT 0xCFE9 40 (v01 7596MS A7596200 
20110218 MSFT 0097)
[0.024661] ACPI: FACP 0xCFE90200 84 (v01 7596MS A7596200 
20110218 MSFT 0097)
[0.024665] ACPI BIOS Warning (bug): Optional FADT field Pm2ControlBlock has 
valid Length but zero Address: 0x/0x1 (20200925/tbfadt-615)
[0.024670] ACPI: DSDT 0xCFE905D0 0093B2 (v01 A7596  A7596200 
0200 INTL 20051117)
[0.024673] ACPI: FACS 0xCFE9E000 40
[0.024676] ACPI: APIC 0xCFE90390 7C (v01 7596MS A7596200 
20110218 MSFT 0097)
[0.024679] ACPI: MCFG 0xCFE90410 3C (v01 7596MS OEMMCFG  
20110218 MSFT 0097)
[0.024682] ACPI: OEMB 0xCFE9E040 72 (v01 7596MS A7596200 
20110218 MSFT 0097)
[0.024685] ACPI: SRAT 0xCFE9A5D0 C8 (v03 AMDFAM_F_10 
0002 AMD  0001)
[0.024688] ACPI: HPET 0xCFE9A6A0 38 (v01 7596MS OEMHPET  
20110218 MSFT 0097)
[0.024691] ACPI: SSDT 0xCFE9A6E0 000458 (v01 A M I  POWERNOW 
0001 AMD  0001)
[0.024694] ACPI: Reserving FACP table memory at [mem 0xcfe90200-0xcfe90283]
[0.024696] ACPI: Reserving DSDT table memory at [mem 0xcfe905d0-0xcfe99981]
[0.024697] ACPI: Reserving FACS table memory at [mem 0xcfe9e000-0xcfe9e03f]
[0.024698] ACPI: Reserving APIC table memory at [mem 0xcfe90390-0xcfe9040b]
[0.024699] ACPI: Reserving MCFG table memory at [mem 0xcfe90410-0xcfe9044b]
[0.024700] ACPI: Reserving OEMB table memory at [mem 0xcfe9e040-0xcfe9e0b1]
[0.024702] ACPI: Reserving SRAT table memory at [mem 0xcfe9a5d0-0xcfe9a697]
[0.024703] ACPI: Reserving HPET table memory at [mem 0xcfe9a6a0-0xcfe9a6d7]
[0.024704] ACPI: Reserving SSDT table memory at [mem 0xcfe9a6e0-0xcfe9ab37]
[0.024742] SRAT: PXM 0 -> APIC 0x00 -> Node 0
[0.024743] SRAT: PXM 0 -> APIC 0x01 -> Node 0
[0.024746] ACPI: SRAT: Node 0 PXM 0 [mem 0x-0x0009]
[0.024747] ACPI: SRAT: Node 0 PXM 0 [mem 0x0010-0xcfff]
[0.024748] ACPI: SRAT: Node 0 PXM 0 [mem 0x1-0x22fff]
[0.024751] NUMA: Node 0 [mem 0x-0x0009] + [mem 

Bug#1022806: linux-image-5.10.0-19-amd64: amggpu unbootable problem persists

2022-10-27 Thread Felix Miata
Oland [Radeon HD 8570 / R5 430 OEM R7 240/340 Radeon 520 OEM]
vendor: Dell driver: radeon v: kernel arch: GCN-1 pcie: speed: 2.5 GT/s
lanes: 8 ports: active: DP-1,DVI-I-1 empty: none bus-ID: 01:00.0
chip-ID: 1002:6611

Same failure as in my previous comment #45 about GCN2. In Bullseye in both PCs,
workaround is to omit 'plymouth.enable=0 radeon.si_support=1' from linu line in
Grub, which absent installation of radeon DDX, forces use of modesetting DIX
instead of amdgpu DDX, which was working just fine until
linux-image-5.10.0-19-amd64 was installed. The resulting change in display 
output
names requires the xrandr script locating all the displays to be rewritten. On
this one I've not as yet attempted installation of backport kernel.
-- 
Evolution as taught in public schools is, like religion,
based on faith, not based on science.

 Team OS/2 ** Reg. Linux User #211409 ** a11y rocks!

Felix Miata



Bug#1022806: linux-image-5.10.0-19-amd64: amggpu unbootable problem persists

2022-10-27 Thread Felix Miata
5.10.149-2 produced no improvement for my A10-7850K Radeon R7.

5.19.0-0.deb11.2-amd64 solves the problem for me, which is no surprise given
Bookworm has been working fine on same PC using its latest 5.19 kernel.
-- 
Evolution as taught in public schools is, like religion,
based on faith, not based on science.

 Team OS/2 ** Reg. Linux User #211409 ** a11y rocks!

Felix Miata



Bug#1022806: linux-image-5.10.0-19-amd64: amggpu unbootable problem persists

2022-10-27 Thread Diederik de Haas
On woensdag 26 oktober 2022 12:43:02 CEST Alexis Huxley wrote:
> Following other reports of post-grub kernel hangs on systems with
> amdgpu, I waited for new release of linux-image-5.10.0-19-amd64,
> which came quickly, but it did not solve the problem for me.
> 
> Symptoms are: grub loads kernel and a few seconds into the
> scrolling messages from the kernel the system hangs. The screen
> is blank. The system is not accessible over the network.

AFAICT, which may be incorrect, the issue is slightly different from the other 
related bug report (#1022042) in that here it is a system which does not boot.
The other problem that is fixed, is where the system does boot, but the 
graphics don't work correctly.

https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1022042#105 is where Andreas 
Wirooks (See also https://gitlab.freedesktop.org/drm/amd/-/issues/
2216#note_1601632) reported on that/this issue and in msg 110 'inasprecali' 
reported the same issue (AFAICT). See also the msgs that follow, which may 
give hints as to where the non-boot issue may come from.

It _may_ also be related to older chipset variants.

HTH,
  Diederik

signature.asc
Description: This is a digitally signed message part.


Bug#1022806: linux-image-5.10.0-19-amd64: amggpu unbootable problem persists

2022-10-27 Thread Alexis Huxley
Hi All,

> Would you be able to start own kenel builds, confirming it does not
> happen with 5.10.140 upstream but with 5.10.149, and isolate the
> breaking change?
> 
> (Are you able to boot the kernel from bullseye-backports?)

I will try to do this, but owing to my schedule, it will
be only Tuesday before I can try.

If you did the kernel building and sent me download links
then I can test them, which is obviously a lot less work
for me, but a lot more for you. Otherwise I'll let you
know how I get on next week.

Sorry to have submitted tbe report and then immediatey not
be available to do the followup.

Regards,

Alexis



Bug#1022806: linux-image-5.10.0-19-amd64: amggpu unbootable problem persists

2022-10-27 Thread Salvatore Bonaccorso
Hi Alexis,

[Cc'ing Alex Deucher who addressed the previous regressions as well]

On Wed, Oct 26, 2022 at 12:43:02PM +0200, Alexis Huxley wrote:
> Package: src:linux
> Version: 5.10.149-2
> Severity: serious
> Justification: 1022025, 1022051, 1022062, 1022070, 1022097, 1022147 marked 
> serious so this marked serious too
> 
> Dear Maintainer,
> 
> Following other reports of post-grub kernel hangs on systems with
> amdgpu, I waited for new release of linux-image-5.10.0-19-amd64,
> which came quickly, but it did not solve the problem for me.
> 
> Symptoms are: grub loads kernel and a few seconds into the 
> scrolling messages from the kernel the system hangs. The screen
> is blank. The system is not accessible over the network.
> 
> I reverted to linux-image-5.10.0-18-amd64 and all is okay again.
> 
> The crash happens pretty early on: I believe X has not yet tried
> to start. Both /var/log/syslog and /var/log/messages contain
> no entries pertaining to the hanging boot (only messages from
> where the earlier shutdown of 18 and the later start of 18).
> 
> Output from lscpu is below.
> 
> Automatically included output (e.g. kernel version) pertains
> to linux-image-5.10.0-18-amd64, as I am unable to boot
> linux-image-5.10.0-19-amd64.  I don't remove it in case it contains
> other pertinent information.
> 
> I'm happy to test with other kernels or provide any requested
> files/output.

Would you be able to start own kenel builds, confirming it does not
happen with 5.10.140 upstream but with 5.10.149, and isolate the
breaking change?

(Are you able to boot the kernel from bullseye-backports?)

Regards,
Salvatore



Bug#1022806: linux-image-5.10.0-19-amd64: amggpu unbootable problem persists

2022-10-26 Thread Alexis Huxley
Package: src:linux
Version: 5.10.149-2
Severity: serious
Justification: 1022025, 1022051, 1022062, 1022070, 1022097, 1022147 marked 
serious so this marked serious too

Dear Maintainer,

Following other reports of post-grub kernel hangs on systems with
amdgpu, I waited for new release of linux-image-5.10.0-19-amd64,
which came quickly, but it did not solve the problem for me.

Symptoms are: grub loads kernel and a few seconds into the 
scrolling messages from the kernel the system hangs. The screen
is blank. The system is not accessible over the network.

I reverted to linux-image-5.10.0-18-amd64 and all is okay again.

The crash happens pretty early on: I believe X has not yet tried
to start. Both /var/log/syslog and /var/log/messages contain
no entries pertaining to the hanging boot (only messages from
where the earlier shutdown of 18 and the later start of 18).

Output from lscpu is below.

Automatically included output (e.g. kernel version) pertains
to linux-image-5.10.0-18-amd64, as I am unable to boot
linux-image-5.10.0-19-amd64.  I don't remove it in case it contains
other pertinent information.

I'm happy to test with other kernels or provide any requested
files/output.

Alexis

sugo# lscpu
Architecture:x86_64
CPU op-mode(s):  32-bit, 64-bit
Byte Order:  Little Endian
Address sizes:   48 bits physical, 48 bits virtual
CPU(s):  2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):   1
NUMA node(s):1
Vendor ID:   AuthenticAMD
CPU family:  21
Model:   112
Model name:  AMD A9-9425 RADEON R5, 5 COMPUTE CORES 2C+3G
Stepping:0
Frequency boost: enabled
CPU MHz: 1396.583
CPU max MHz: 3100.
CPU min MHz: 1400.
BogoMIPS:6187.95
Virtualization:  AMD-V
L1d cache:   64 KiB
L1i cache:   128 KiB
L2 cache:2 MiB
NUMA node0 CPU(s):   0,1
Vulnerability Itlb multihit: Not affected
Vulnerability L1tf:  Not affected
Vulnerability Mds:   Not affected
Vulnerability Meltdown:  Not affected
Vulnerability Mmio stale data:   Not affected
Vulnerability Retbleed:  Mitigation; untrained return thunk; SMT 
disabled
Vulnerability Spec store bypass: Mitigation; Speculative Store Bypass disabled 
via prctl and seccomp
Vulnerability Spectre v1:Mitigation; usercopy/swapgs barriers and 
__user pointer sanitization
Vulnerability Spectre v2:Mitigation; Retpolines, IBPB conditional, 
STIBP disabled, RSB filling, PBRSB-eIBRS Not affecte
 d
Vulnerability Srbds: Not affected
Vulnerability Tsx async abort:   Not affected
Flags:   fpu vme de pse tsc msr pae mce cx8 apic sep 
mtrr pge mca cmov pat pse36 clflush mmx fxsr sse s
 se2 ht syscall nx mmxext fxsr_opt pdpe1gb 
rdtscp lm constant_tsc rep_good acc_power nopl nonst
 op_tsc cpuid extd_apicid aperfmperf pni 
pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 movbe p
 opcnt aes xsave avx f16c lahf_lm cmp_legacy 
svm extapic cr8_legacy abm sse4a misalignsse 3dnow
 prefetch osvw ibs xop skinit wdt lwp fma4 tce 
nodeid_msr tbm perfctr_core perfctr_nb bpext pts
 c mwaitx cpb hw_pstate ssbd ibpb vmmcall 
fsgsbase bmi1 avx2 smep bmi2 xsaveopt arat npt lbrv s
 vm_lock nrip_save tsc_scale vmcb_clean 
flushbyasid decodeassists pausefilter pfthreshold avic 
 v_vmsave_vmload vgif overflow_recov
sugo# 



-- Package-specific info:
** Kernel log: boot messages should be attached


** Model information
sys_vendor: HP
product_name: HP Slim Desktop 290-a0xxx
product_version: 
chassis_vendor: HP
chassis_version: 
bios_vendor: AMI
bios_version: F.10
board_vendor: HP
board_name: 8459
board_version: 00

** Network interface configuration:
*** /etc/network/interfaces:
auto lo
iface lo inet loopback

iface enp3s0 inet manual

source /etc/network/interfaces.d/br0


*** /etc/network/interfaces.d/br0:

auto br0
iface br0 inet static
address 192.168.1.16
netmask 255.255.255.0
gateway 192.168.1.51
bridge_ports enp3s0

** PCI devices:
00:00.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Family 15h 
(Models 60h-6fh) Processor Root Complex [1022:1576]
Subsystem: Hewlett-Packard Company Family 15h (Models 60h-6fh) 
Processor Root Complex [103c:8459]
Control: I/O- Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR-