On 4/19/21 3:42 PM, Peter Maydell wrote: > On Thu, 25 Mar 2021 at 18:14, Philippe Mathieu-Daudé <f4...@amsat.org> wrote: >> >> Hi Peter, >> >> On 3/25/21 5:33 PM, Peter Maydell wrote: >>> Currently the gpex PCI controller implements no special behaviour for >>> guest accesses to areas of the PIO and MMIO where it has not mapped >>> any PCI devices, which means that for Arm you end up with a CPU >>> exception due to a data abort. >>> >>> Most host OSes expect "like an x86 PC" behaviour, where bad accesses >>> like this return -1 for reads and ignore writes. In the interests of >>> not being surprising, make host CPU accesses to these windows behave >>> as -1/discard where there's no mapped PCI device. >>> >>> The old behaviour generally didn't cause any problems, because >>> almost always the guest OS will map the PCI devices and then only >>> access where it has mapped them. One corner case where you will see >>> this kind of access is if Linux attempts to probe legacy ISA >>> devices via a PIO window access. So far the only case where we've >>> seen this has been via the syzkaller fuzzer. >>> >>> Reported-by: Dmitry Vyukov <dvyu...@google.com> >>> Fixes: https://bugs.launchpad.net/qemu/+bug/1918917 >>> Signed-off-by: Peter Maydell <peter.mayd...@linaro.org> >>> --- >>> v1->v2 changes: put in the hw_compat machinery. >>> >>> Still not sure if I want to put this in 6.0 or not. >>> >>> include/hw/pci-host/gpex.h | 4 +++ >>> hw/core/machine.c | 1 + >>> hw/pci-host/gpex.c | 56 ++++++++++++++++++++++++++++++++++++-- >>> 3 files changed, 58 insertions(+), 3 deletions(-) >>> >>> diff --git a/include/hw/pci-host/gpex.h b/include/hw/pci-host/gpex.h >>> index d48a020a952..fcf8b638200 100644 >>> --- a/include/hw/pci-host/gpex.h >>> +++ b/include/hw/pci-host/gpex.h >>> @@ -49,8 +49,12 @@ struct GPEXHost { >>> >>> MemoryRegion io_ioport; >>> MemoryRegion io_mmio; >>> + MemoryRegion io_ioport_window; >>> + MemoryRegion io_mmio_window; >>> qemu_irq irq[GPEX_NUM_IRQS]; >>> int irq_num[GPEX_NUM_IRQS]; >>> + >>> + bool allow_unmapped_accesses; >>> }; >>> >>> struct GPEXConfig { >>> diff --git a/hw/core/machine.c b/hw/core/machine.c >>> index 257a664ea2e..9750fad7435 100644 >>> --- a/hw/core/machine.c >>> +++ b/hw/core/machine.c >>> @@ -41,6 +41,7 @@ GlobalProperty hw_compat_5_2[] = { >>> { "PIIX4_PM", "smm-compat", "on"}, >>> { "virtio-blk-device", "report-discard-granularity", "off" }, >>> { "virtio-net-pci", "vectors", "3"}, >>> + { "gpex-pcihost", "allow-unmapped-accesses", "false" }, >>> }; >>> const size_t hw_compat_5_2_len = G_N_ELEMENTS(hw_compat_5_2); >>> >>> diff --git a/hw/pci-host/gpex.c b/hw/pci-host/gpex.c >>> index 2bdbe7b4561..a6752fac5e8 100644 >>> --- a/hw/pci-host/gpex.c >>> +++ b/hw/pci-host/gpex.c >>> @@ -83,12 +83,51 @@ static void gpex_host_realize(DeviceState *dev, Error >>> **errp) >>> int i; >>> >>> pcie_host_mmcfg_init(pex, PCIE_MMCFG_SIZE_MAX); >>> + sysbus_init_mmio(sbd, &pex->mmio); >>> + >>> + /* >>> + * Note that the MemoryRegions io_mmio and io_ioport that we pass >>> + * to pci_register_root_bus() are not the same as the >>> + * MemoryRegions io_mmio_window and io_ioport_window that we >>> + * expose as SysBus MRs. The difference is in the behaviour of >>> + * accesses to addresses where no PCI device has been mapped. >>> + * >>> + * io_mmio and io_ioport are the underlying PCI view of the PCI >>> + * address space, and when a PCI device does a bus master access >>> + * to a bad address this is reported back to it as a transaction >>> + * failure. >>> + * >>> + * io_mmio_window and io_ioport_window implement "unmapped >>> + * addresses read as -1 and ignore writes"; this is traditional >>> + * x86 PC behaviour, which is not mandated by the PCI spec proper >>> + * but expected by much PCI-using guest software, including Linux. >> >> I suspect PCI-ISA bridges to provide an EISA bus. > > I'm not sure what you mean here -- there isn't an ISA bridge > or an EISA bus involved here. This is purely about the behaviour > of the memory window the PCI host controller exposes to the CPU > (and in particular the window for when a PCI device's BAR is > set to "IO" rather than "MMIO"), though we change both here.
I guess I always interpreted the IO BAR were here to address ISA backward compatibility. I don't know well PCI so I'll study it more. Sorry for my confused comment. >>> + * In the interests of not being unnecessarily surprising, we >>> + * implement it in the gpex PCI host controller, by providing the >>> + * _window MRs, which are containers with io ops that implement >>> + * the 'background' behaviour and which hold the real PCI MRs as >>> + * subregions. >>> + */ >>> memory_region_init(&s->io_mmio, OBJECT(s), "gpex_mmio", UINT64_MAX); >>> memory_region_init(&s->io_ioport, OBJECT(s), "gpex_ioport", 64 * 1024); >>> >>> - sysbus_init_mmio(sbd, &pex->mmio); >>> - sysbus_init_mmio(sbd, &s->io_mmio); >>> - sysbus_init_mmio(sbd, &s->io_ioport); >>> + if (s->allow_unmapped_accesses) { >>> + memory_region_init_io(&s->io_mmio_window, OBJECT(s), >>> + &unassigned_io_ops, OBJECT(s), >>> + "gpex_mmio_window", UINT64_MAX); >> >> EISA -> 4 * GiB >> >> unassigned_io_ops allows 64-bit accesses. Here we want up to 32. >> >> Maybe we don't care. >> >>> + memory_region_init_io(&s->io_ioport_window, OBJECT(s), >>> + &unassigned_io_ops, OBJECT(s), >>> + "gpex_ioport_window", 64 * 1024); >> >> Ditto, unassigned_io_ops accepts 64-bit accesses. > > These are just using the same sizes as the io_mmio and io_ioport > MRs which the existing code creates. > >>> static void gpex_host_class_init(ObjectClass *klass, void *data) >>> { >>> DeviceClass *dc = DEVICE_CLASS(klass); >>> @@ -117,6 +166,7 @@ static void gpex_host_class_init(ObjectClass *klass, >>> void *data) >>> dc->realize = gpex_host_realize; >>> set_bit(DEVICE_CATEGORY_BRIDGE, dc->categories); >>> dc->fw_name = "pci"; >>> + device_class_set_props(dc, gpex_host_properties); >> >> IMO this change belongs to the parent bridges, >> TYPE_PCI_HOST_BRIDGE and TYPE_PCIE_HOST_BRIDGE. > > Arnd had a look through the kernel sources and apparently not > all PCI host controllers do this -- there are a few SoCs where the > kernel has to put in special case code to allow for the fact that > it will get a bus error for accesses to unmapped parts of the > window. So I concluded that the specific controller implementation > was the right place for it. Yes the changes are simple. I'm certainly not NAcking the patch, but can't review it neither :( So please ignore my comments.