Hi Daniel,

your problem is that the system BIOS is buggy and doesn't assign resources to the card:
    Region 0: Memory at <ignored> (64-bit, prefetchable)
    Region 2: Memory at <ignored> (64-bit, prefetchable)
    Region 4: I/O ports at 9000 [size=256]
    Region 5: Memory at <ignored> (32-bit, non-prefetchable)
    Expansion ROM at <ignored> [disabled]

The kernel actually tries to assign resources to the bridges, but fails as well because the BIOS didn't reserved any during startup.
[    0.179743] pci 0000:12:00.0: can't claim BAR 14 [mem 0x01c00000-0xef0fffff]: no compatible bridge window [    0.179745] pci 0000:12:00.0: [mem 0x01c00000-0xef0fffff] clipped to [mem 0xef000000-0xef0fffff] [    0.179747] pci 0000:12:00.0:   bridge window [mem 0xef000000-0xef0fffff] [    0.179751] pci 0000:13:01.0: can't claim BAR 14 [mem 0x01c00000-0x01ffffff]: no compatible bridge window [    0.179753] pci 0000:14:00.0: can't claim BAR 14 [mem 0x01c00000-0x01ffffff]: no compatible bridge window [    0.179754] pci 0000:15:00.0: can't claim BAR 14 [mem 0x01d00000-0x01dfffff]: no compatible bridge window [    0.179756] pci 0000:08:04.0: can't claim BAR 13 [io 0xb000-0xcfff]: address conflict with PCI Bus 0000:12 [io 0x9000-0xbfff] [    0.179782] pci 0000:14:00.0: can't claim BAR 0 [mem 0x01c00000-0x01c03fff]: no compatible bridge window [    0.179789] pci 0000:16:00.0: can't claim BAR 0 [mem 0xd0000000-0xdfffffff 64bit pref]: no compatible bridge window [    0.179791] pci 0000:16:00.0: can't claim BAR 2 [mem 0xe0200000-0xe03fffff 64bit pref]: no compatible bridge window [    0.179793] pci 0000:16:00.0: can't claim BAR 5 [mem 0x01d00000-0x01d7ffff]: no compatible bridge window [    0.179798] pci 0000:16:00.1: can't claim BAR 0 [mem 0x01da0000-0x01da3fff]: no compatible bridge window

There isn't much you can do except for trying to update the BIOS and if that doesn't help replace your motherboard.

Regards,
Christian.


Am 09.04.2018 um 15:33 schrieb Daniel Moran:
Christian,
Andrey,

Thank you for the responses.
Here's the requested dmesg/lspci. Also pulled journalctl just in case but didn't see anything that stands out.

I'll take another look at the BIOS settings to see if anything else may explain the memory error. I've got 16GB in the system at the moment, can bump up to 32 - also added a larger swap just in case that was the issue. (No change.)

As always thank you for your continued time and support.

Respectfully,
Daniel S. Moran (garwynn)
PC Hardware Editor - XDA-Developers
Phone: 1-559-316-0760/+81-90-5484-4155
Article Links: http://www.xda-developers.com/author/garwynn
E-mail: xdagarw...@gmail.com <mailto:xdagarw...@gmail.com> | Twitter: @xdagarwynn

On Mon, Apr 9, 2018 at 3:52 PM, Christian König <christian.koe...@amd.com <mailto:christian.koe...@amd.com>> wrote:

    Please provide the full dmesg of the system as well as the output
    of "lspci -s 0000:16:00.0 -vvvv" as attachment.

    Thanks,
    Christian.

    Am 09.04.2018 um 06:00 schrieb Andrey Grodzovsky:

    Just from a quick look it seems to fail in
    amdgpu_device_init->ioremap with ENOMEM, that would explain why
    you don't see any more prints - this failure is very early in the
    device init process.

    No idea why ioremap would fail in this case and not even sure
    which implementation of ioremap to look into for your case.

    Adding Christian for this.

    Andrey


    On 04/07/2018 03:16 AM, Daniel Moran wrote:
    Also, to clarify... if I move it into a regular slot, turn off
    the eGPU it works as expected.
    Tested with Intel iGPU enabled and disabled, made sure i915
    loaded without error and can connect display to it.



    Again, thank you in advance for any time/support offered.

    Respectfully,
    Daniel S. Moran (garwynn)
    PC Hardware Editor - XDA-Developers
    Phone: 1-559-316-0760/+81-90-5484-4155
    Article Links: http://www.xda-developers.com/author/garwynn
    <http://www.xda-developers.com/author/garwynn>
    E-mail: xdagarw...@gmail.com <mailto:xdagarw...@gmail.com> |
    Twitter: @xdagarwynn

    On Sat, Apr 7, 2018 at 3:58 PM, Daniel Moran
    <xdagarw...@gmail.com <mailto:xdagarw...@gmail.com>> wrote:

        Hello all,

        I've got a Powercolor Red Devil Vega 56 here that I'm trying
        to get working in eGPU mode.
        I think on the BIOS/hardware side it's now all fleshed out.
        Now I'm at a point where amdgpu tries to init and reaches a
        fatal error.

        Set loglevel=8 doesn't get any additional messages.
        Here's what it does report (full dmesg attached):

        [  429.005909] [drm] amdgpu kernel modesetting enabled.
        [  429.006080] [drm] initializing kernel modesetting (VEGA10
        0x1002:0x687F 0x148C:0x2388 0xC3).
        [  429.006082] amdgpu 0000:16:00.0: Fatal error during GPU init
        [  429.006155] amdgpu: probe of 0000:16:00.0 failed with
        error -12

        Using the following commands to unload & reload for testing.
        Since it's as an eGPU I'm using the i7-7700K iGPU (i915
        module) as the primary and these commands work in terminal
        without requiring a reboot.

        sudo rmmod amdgpu
        sudo modprobe -v amgpu

        Pulled the UMR and tried to make, fails on Cmake. I'll
        attach log in a text.
        Also will attach a full dmesg and lspci dump. uname -a below:
        /Linux testbox 4.15.15-041515-generic #201803311331 SMP Sat
        Mar 31 17:34:21 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux/

        Any other ideas on how I can debug this further? Feel I'm so
        close, don't want to let this go.
        Thank you in advance for your time.

        Respectfully,
        Daniel S. Moran (garwynn)
        PC Hardware Editor - XDA-Developers
        Phone: 1-559-316-0760/+81-90-5484-4155
        Article Links: http://www.xda-developers.com/author/garwynn
        <http://www.xda-developers.com/author/garwynn>
        E-mail: xdagarw...@gmail.com <mailto:xdagarw...@gmail.com> |
        Twitter: @xdagarwynn




    _______________________________________________
    amd-gfx mailing list
    amd-gfx@lists.freedesktop.org <mailto:amd-gfx@lists.freedesktop.org>
    https://lists.freedesktop.org/mailman/listinfo/amd-gfx
    <https://lists.freedesktop.org/mailman/listinfo/amd-gfx>




_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

Reply via email to