Re: [gentoo-user] [OT] AER: Corrected error received: 0000:00:03.0

2018-09-30 Thread mad.scientist.at.large
If you've been playing with the bios there can be all kinds of problems, I've 
learned this through trial and error (knowing i  was learning and likely to 
learn by experiance).  I'd suggest resetting the bios to "Optimal" or whatever 
your' bios calls it and seeing if that solves it.  You may have to change the 
bios in terms of boot drive and possibly other settings, but best to change as 
little as possible.
I assume this machine previously worked well?  have you tried booting from an 
optical drive or thumb drive with another os or utilities disk to eliminate 
kernel issues?
finally, if it worked till recently you could have a dying pci card in which 
case you'll have to try pulling them all out (other than graphics if it's not 
built in) and then reinstalling one card at a time, rebooting to see if things 
have changed.
failing that, you might rebuild the kernel just in case that was corrupted in 
some way.  A lot of subtle hardware problems can corrupt the compiler but not 
show up in less intensive use.
especially when learning, it's best to change one thing at a time, reboot and 
see what happens.  this way if something breaks you know it was probably your' 
last change.  I hope this is useful to you.  It is also always a good idea to 
ask others who may see what we ourselves are not seeing.  Trouble shooting can 
be very difficult and not everyone is good at it but most can get a lot better 
with practice and patience.

Democracynow.org


27. Sep 2018 16:02 by michaelkintz...@gmail.com 
:


> On Thursday, 27 September 2018 21:51:42 BST Håkon Alstadheim wrote:
>> (Sorry for the OT, don't know where to go for generic hardware questions)
>>
>> I'm wondering if my main board dying? On sept. 1 I started getting error
>> messages like the ones at the  end of this mail. I just noticed them. I
>> am including logs from as far back as I have, just for completeness.
>>
>> This mainboard:
>>
>> # smbios-sys-info
>> Libsmbios version:  2.3.2
>> Product Name:   Z10PE-D8 WS
>> Vendor: ASUSTeK COMPUTER INC.
>> BIOS Version:   3703
>>
>> ... has given me a lot of grief, so I've been mucking about with it
>> quite a lot. I don't know if I did anything on september 1. Could it be
>> some bios-setting I've changed, or som firmware corruption ? Other
>> weirdness happening recently is sensors have started having episodes of
>> all values totally bogus, and one pwm fan-sensor seems to be permanently
>> slightly loony. 
>>
>> OS running is kept up-to-date ~amd64 gentoo-sources running as Dom0
>> under Xen-4.11 latest.
>>
>> --
>>
>>
>> 0:root@gentoo log # zgrep 'pcieport :00:03.0' kern.log* | sed -e
>> 's/^[^:]*://' | sort -M
>> Aug 19 19:38:55 gentoo kernel: [1.181466] pcieport :00:03.0: AER
>> enabled with IRQ 134
>> Aug 19 19:38:55 gentoo kernel: [1.181700] pcieport :00:03.0:
>> Signaling PME with IRQ 134
>
> Did you enable CONFIG_PCIEAER_INJECT in your kernel?
>
> What do you get when you boot with the kernel option:
>
> pcie_ports=auto
>
> -- 
> Regards,
> Mick

Re: [gentoo-user] [OT] AER: Corrected error received: 0000:00:03.0

2018-09-28 Thread Mick
On Friday, 28 September 2018 10:26:09 BST zless wrote:
> În ziua de vineri, 28 septembrie 2018, la 10:05:28 EEST, Håkon Alstadheim a
> 
> scris:
> > Den 28. sep. 2018 00:02, skrev Mick:
> > > On Thursday, 27 September 2018 21:51:42 BST Håkon Alstadheim wrote:
> > >> (Sorry for the OT, don't know where to go for generic hardware
> > >> questions)
> > >> 
> > >> I'm wondering if my main board dying? On sept. 1 I started getting
> > >> error
> > >> messages like the ones at the  end of this mail. I just noticed them. I
> > >> am including logs from as far back as I have, just for completeness.
> > >> 
> > >> This mainboard:
> > >> 
> > >> # smbios-sys-info
> > >> Libsmbios version:  2.3.2
> > >> Product Name:   Z10PE-D8 WS
> > >> Vendor: ASUSTeK COMPUTER INC.
> > >> BIOS Version:   3703
> > >> 
> > >> ... has given me a lot of grief, so I've been mucking about with it
> > >> quite a lot. I don't know if I did anything on september 1. Could it be
> > >> some bios-setting I've changed, or som firmware corruption ? Other
> > >> weirdness happening recently is sensors have started having episodes of
> > >> all values totally bogus, and one pwm fan-sensor seems to be
> > >> permanently
> > >> slightly loony.
> > >> 
> > >> OS running is kept up-to-date ~amd64 gentoo-sources running as Dom0
> > >> under Xen-4.11 latest.
> > >> 
> > >> --
> > >> 
> > >> 
> > >> 0:root@gentoo log # zgrep 'pcieport :00:03.0' kern.log* | sed -e
> > >> 's/^[^:]*://' | sort -M
> > >> Aug 19 19:38:55 gentoo kernel: [1.181466] pcieport :00:03.0:
> > >> AER
> > >> enabled with IRQ 134
> > >> Aug 19 19:38:55 gentoo kernel: [1.181700] pcieport :00:03.0:
> > >> Signaling PME with IRQ 134
> > > 
> > > Did you enable CONFIG_PCIEAER_INJECT in your kernel?
> > 
> > Yes, enabled already.
> > 
> > > What do you get when you boot with the kernel option:
> > > 
> > > pcie_ports=auto
> > 
> > Still happens I'm afraid.
> > 
> > # cat /proc/cmdline
> > placeholder root=LABEL=SAS-GENTOO ro
> > xen-pciback.hide=(02:00.*)(04:00.*)(05:00.*)(08:00.*)(81:00.*)(82:00.*)
> > usbcore.autosuspend=-1 scsi_mod.use_blk_mq=1 xen-netback.max-queues=8
> > xen-pciback.permissive=1 xen-pciback.verbose_request=1
> > xen-blkback.max_persistent_grants=1024 intel_iommu=on domodules domdadm
> > console=tty1 pcie_ports=auto
> 
> I "solved" a similar case with 'pci=noaer' on the kernel command line.
> 
> I quote solved above because even to this day I have no idea what those
> messages were trying to tell me.
> 
> I always had CONFIG_PCIEAER_INJECT not enabled though.

This is not an option on my kernel, I assume these options change between 
kernel releases.  Have a look at your /usr/src/linux/Documentation/admin-
guide/kernel-parameters.txt for pcie_ports options.  I suggested trying 
'auto', because it will only use PICe services if allowed by the BIOS and   
Håkon mentioned he'd been tweaking his BIOS settings.
-- 
Regards,
Mick

signature.asc
Description: This is a digitally signed message part.


Re: [gentoo-user] [OT] AER: Corrected error received: 0000:00:03.0

2018-09-28 Thread zless
În ziua de vineri, 28 septembrie 2018, la 10:05:28 EEST, Håkon Alstadheim a 
scris:
> 
> Den 28. sep. 2018 00:02, skrev Mick:
> > On Thursday, 27 September 2018 21:51:42 BST Håkon Alstadheim wrote:
> >> (Sorry for the OT, don't know where to go for generic hardware questions)
> >>
> >> I'm wondering if my main board dying? On sept. 1 I started getting error
> >> messages like the ones at the  end of this mail. I just noticed them. I
> >> am including logs from as far back as I have, just for completeness.
> >>
> >> This mainboard:
> >>
> >> # smbios-sys-info
> >> Libsmbios version:  2.3.2
> >> Product Name:   Z10PE-D8 WS
> >> Vendor: ASUSTeK COMPUTER INC.
> >> BIOS Version:   3703
> >>
> >> ... has given me a lot of grief, so I've been mucking about with it
> >> quite a lot. I don't know if I did anything on september 1. Could it be
> >> some bios-setting I've changed, or som firmware corruption ? Other
> >> weirdness happening recently is sensors have started having episodes of
> >> all values totally bogus, and one pwm fan-sensor seems to be permanently
> >> slightly loony. 
> >>
> >> OS running is kept up-to-date ~amd64 gentoo-sources running as Dom0
> >> under Xen-4.11 latest.
> >>
> >> --
> >>
> >>
> >> 0:root@gentoo log # zgrep 'pcieport :00:03.0' kern.log* | sed -e
> >> 's/^[^:]*://' | sort -M
> >> Aug 19 19:38:55 gentoo kernel: [1.181466] pcieport :00:03.0: AER
> >> enabled with IRQ 134
> >> Aug 19 19:38:55 gentoo kernel: [1.181700] pcieport :00:03.0:
> >> Signaling PME with IRQ 134
> > Did you enable CONFIG_PCIEAER_INJECT in your kernel?
> Yes, enabled already.
> >
> > What do you get when you boot with the kernel option:
> >
> > pcie_ports=auto
> >
> Still happens I'm afraid.
> 
> # cat /proc/cmdline
> placeholder root=LABEL=SAS-GENTOO ro
> xen-pciback.hide=(02:00.*)(04:00.*)(05:00.*)(08:00.*)(81:00.*)(82:00.*)
> usbcore.autosuspend=-1 scsi_mod.use_blk_mq=1 xen-netback.max-queues=8
> xen-pciback.permissive=1 xen-pciback.verbose_request=1
> xen-blkback.max_persistent_grants=1024 intel_iommu=on domodules domdadm 
> console=tty1 pcie_ports=auto

I "solved" a similar case with 'pci=noaer' on the kernel command line.

I quote solved above because even to this day I have no idea what those 
messages were trying to tell me.

I always had CONFIG_PCIEAER_INJECT not enabled though.





Re: [gentoo-user] [OT] AER: Corrected error received: 0000:00:03.0

2018-09-28 Thread Håkon Alstadheim



Den 28. sep. 2018 00:02, skrev Mick:
> On Thursday, 27 September 2018 21:51:42 BST Håkon Alstadheim wrote:
>> (Sorry for the OT, don't know where to go for generic hardware questions)
>>
>> I'm wondering if my main board dying? On sept. 1 I started getting error
>> messages like the ones at the  end of this mail. I just noticed them. I
>> am including logs from as far back as I have, just for completeness.
>>
>> This mainboard:
>>
>> # smbios-sys-info
>> Libsmbios version:  2.3.2
>> Product Name:   Z10PE-D8 WS
>> Vendor: ASUSTeK COMPUTER INC.
>> BIOS Version:   3703
>>
>> ... has given me a lot of grief, so I've been mucking about with it
>> quite a lot. I don't know if I did anything on september 1. Could it be
>> some bios-setting I've changed, or som firmware corruption ? Other
>> weirdness happening recently is sensors have started having episodes of
>> all values totally bogus, and one pwm fan-sensor seems to be permanently
>> slightly loony. 
>>
>> OS running is kept up-to-date ~amd64 gentoo-sources running as Dom0
>> under Xen-4.11 latest.
>>
>> --
>>
>>
>> 0:root@gentoo log # zgrep 'pcieport :00:03.0' kern.log* | sed -e
>> 's/^[^:]*://' | sort -M
>> Aug 19 19:38:55 gentoo kernel: [1.181466] pcieport :00:03.0: AER
>> enabled with IRQ 134
>> Aug 19 19:38:55 gentoo kernel: [1.181700] pcieport :00:03.0:
>> Signaling PME with IRQ 134
> Did you enable CONFIG_PCIEAER_INJECT in your kernel?
Yes, enabled already.
>
> What do you get when you boot with the kernel option:
>
> pcie_ports=auto
>
Still happens I'm afraid.

# cat /proc/cmdline
placeholder root=LABEL=SAS-GENTOO ro
xen-pciback.hide=(02:00.*)(04:00.*)(05:00.*)(08:00.*)(81:00.*)(82:00.*)
usbcore.autosuspend=-1 scsi_mod.use_blk_mq=1 xen-netback.max-queues=8
xen-pciback.permissive=1 xen-pciback.verbose_request=1
xen-blkback.max_persistent_grants=1024 intel_iommu=on domodules domdadm 
console=tty1 pcie_ports=auto

 # dmesg -c
[13701.614600] pcieport :00:03.0: AER: Corrected error received:
:00:03.0
[13701.614607] pcieport :00:03.0: PCIe Bus Error:
severity=Corrected, type=Data Link Layer, (Transmitter ID)
[13701.614609] pcieport :00:03.0:   device [8086:2f08] error
status/mask=1000/2000
[13701.614610] pcieport :00:03.0:    [12] Replay Timer Timeout 
[24039.831939] pcieport :00:03.0: AER: Corrected error received:
:00:03.0
[24039.831952] pcieport :00:03.0: PCIe Bus Error:
severity=Corrected, type=Data Link Layer, (Transmitter ID)
[24039.831955] pcieport :00:03.0:   device [8086:2f08] error
status/mask=1000/2000
[24039.831958] pcieport :00:03.0:    [12] Replay Timer Timeout 
0:root@gentoo ~ #



Re: [gentoo-user] [OT] AER: Corrected error received: 0000:00:03.0

2018-09-27 Thread Mick
On Thursday, 27 September 2018 21:51:42 BST Håkon Alstadheim wrote:
> (Sorry for the OT, don't know where to go for generic hardware questions)
> 
> I'm wondering if my main board dying? On sept. 1 I started getting error
> messages like the ones at the  end of this mail. I just noticed them. I
> am including logs from as far back as I have, just for completeness.
> 
> This mainboard:
> 
> # smbios-sys-info
> Libsmbios version:  2.3.2
> Product Name:   Z10PE-D8 WS
> Vendor: ASUSTeK COMPUTER INC.
> BIOS Version:   3703
> 
> ... has given me a lot of grief, so I've been mucking about with it
> quite a lot. I don't know if I did anything on september 1. Could it be
> some bios-setting I've changed, or som firmware corruption ? Other
> weirdness happening recently is sensors have started having episodes of
> all values totally bogus, and one pwm fan-sensor seems to be permanently
> slightly loony. 
> 
> OS running is kept up-to-date ~amd64 gentoo-sources running as Dom0
> under Xen-4.11 latest.
> 
> --
> 
> 
> 0:root@gentoo log # zgrep 'pcieport :00:03.0' kern.log* | sed -e
> 's/^[^:]*://' | sort -M
> Aug 19 19:38:55 gentoo kernel: [1.181466] pcieport :00:03.0: AER
> enabled with IRQ 134
> Aug 19 19:38:55 gentoo kernel: [1.181700] pcieport :00:03.0:
> Signaling PME with IRQ 134

Did you enable CONFIG_PCIEAER_INJECT in your kernel?

What do you get when you boot with the kernel option:

pcie_ports=auto

-- 
Regards,
Mick

signature.asc
Description: This is a digitally signed message part.