On 21 December 2017 at 09:59, Ni, Ruiyu <[email protected]> wrote: > On 12/21/2017 5:52 PM, Ard Biesheuvel wrote: >> >> On 21 December 2017 at 09:48, Ni, Ruiyu <[email protected]> wrote: >>> >>> On 12/21/2017 5:14 PM, Guo Heyi wrote: >>>> >>>> >>>> On Thu, Dec 21, 2017 at 08:32:37AM +0000, Ard Biesheuvel wrote: >>>>> >>>>> >>>>> On 21 December 2017 at 08:27, Guo Heyi <[email protected]> wrote: >>>>>> >>>>>> >>>>>> On Wed, Dec 20, 2017 at 03:26:45PM +0000, Ard Biesheuvel wrote: >>>>>>> >>>>>>> >>>>>>> On 20 December 2017 at 15:17, gary guo <[email protected]> wrote: >>>>>>>> >>>>>>>> >>>>>>>> On Wed, Dec 20, 2017 at 09:13:58AM +0000, Ard Biesheuvel wrote: >>>>>>>>> >>>>>>>>> >>>>>>>>> Hi Heyi, >>>>>>>>> >>>>>>>>> On 20 December 2017 at 08:21, Heyi Guo <[email protected]> wrote: >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> PCIe on some ARM platforms requires address translation, not only >>>>>>>>>> for >>>>>>>>>> legacy IO access, but also for 32bit memory BAR access as well. >>>>>>>>>> There >>>>>>>>>> will be "Address Translation Unit" or something similar in PCI >>>>>>>>>> host >>>>>>>>>> bridges to translation CPU address to PCI address and vice versa. >>>>>>>>>> So >>>>>>>>>> we think it may be useful to add address translation support to >>>>>>>>>> the >>>>>>>>>> generic PCI host bridge driver. >>>>>>>>>> >>>>>>>>> >>>>>>>>> I agree. While unusual on a PC, it is quite common on other >>>>>>>>> architectures to have more complex non 1:1 topologies, which >>>>>>>>> currently >>>>>>>>> require a forked PciHostBridgeDxe driver with local changes >>>>>>>>> applied. >>>>>>>>> >>>>>>>>>> This RFC only contains one minor change to the definition of >>>>>>>>>> PciHostBridgeLib, and there certainly will be a lot of other >>>>>>>>>> changes >>>>>>>>>> to make it work, including: >>>>>>>>>> >>>>>>>>>> 1. Use CPU address for GCD space add and allocate operations, >>>>>>>>>> instead >>>>>>>>>> of PCI address; also IO space will be changed to memory space if >>>>>>>>>> translation exists. >>>>>>>>>> >>>>>>>>> >>>>>>>>> For I/O space, the translation should simply be applied to the I/O >>>>>>>>> range. I don't think it makes sense to use memory space here, given >>>>>>>>> that it is specific to architectures that lack native port I/O. >>>>>>>>> >>>>>>>> >>>>>>>> I made an assumption here that platforms supporting real port IO >>>>>>>> space >>>>>>>> do not need address translation, like IA32 and X64, and port IO >>>>>>>> translation implies the platform does not support real port IO >>>>>>>> space. >>>>>>>> >>>>>>> >>>>>>> This may be a reasonable assumption. But I still think it is better >>>>>>> not to encode any assumptions in the first place. >>>>>>> >>>>>>>> Indeed the assumption is not so "generic", so I'll agree if you >>>>>>>> recommend to support IO to IO translation as well. But I still hope >>>>>>>> to >>>>>>>> have IO to memory translation support in PCI host bridge driver, >>>>>>>> rather than in CPU IO protocol, since the faked IO space might only >>>>>>>> be >>>>>>>> used for PCI host bridge and we may have overlapping IO ranges for >>>>>>>> each host bridge, which is compatible with PCIe specification and >>>>>>>> PCIe >>>>>>>> ACPI description. >>>>>>>> >>>>>>> >>>>>>> That is fine. Under UEFI, these will translate to non-overlapping I/O >>>>>>> spaces in the CPU's view. Under the OS, this could be totally >>>>>>> different. >>>>>>> >>>>>>> >>>>>>> For example, >>>>>>> >>>>>>> RC0 IO 0x0000 .. 0xffff -> CPU 0x00000 .. 0x0ffff >>>>>>> RC1 IO 0x0000 .. 0xffff -> CPU 0x10000 .. 0x1ffff >>>>>>> >>>>>>> This is very similar to how MMIO translation works, and makes I/O >>>>>>> devices behind the host bridges uniquely addressable for drivers. >>>>>>> >>>>>>> For our understanding, could you share the host bridge configuration >>>>>>> that you are targetting? >>>>>> >>>>>> >>>>>> >>>>>> IO translation on one of our platforms is like below: >>>>>> >>>>>> PCI IO space CPU memory space >>>>>> 0x0000 .. 0xffff -> 0xafff0000 .. 0xafffffff >>>>>> (The sizes are always 0x10000 so I will omit the limit for others) >>>>>> 0x0000 .. 0xffff -> 0x8abff0000 >>>>>> 0x0000 .. 0xffff -> 0x8b7ff0000 >>>>>> ...... >>>>>> >>>>>> The translated addresses may be beyond 32bit address, will this >>>>>> violate IO space limitation? From EDK2 code, I didn't see such >>>>>> limitation for IO space. >>>>>> >>>>> >>>>> The MMIO address will not be used for I/O port addressing by the CPU. >>>>> The MMIO to IO translation is an implementation detail of your CpuIo2 >>>>> protocol implementation. >>>>> >>>>> So there will be two stacked translations, one for PCI I/O to CPU I/O, >>>>> and one for CPU I/O to CPU MMIO. The latter is transparent to the PCI >>>>> host bridge driver. >>>> >>>> >>>> >>>> Yes this should work. >>>> >>>> Hi Star, Eric and Ruiyu, >>>> >>>> Any comments on this RFC? >>> >>> >>> >>> Let me confirm my understanding: >>> The PciHostBridge core driver/library interface changes only >>> take care of the MMIO translation. >>> >>> Heyi you will implement a special CpuIo driver in your >>> platform code to take care of the IO to MMIO translation. >>> >>> But let me confirm, will you need to additional translate >>> the MMIO (translated from IO) to another MMIO using an offset? >>> If yes, will you handle that translation in your CpuIo driver? >>> >> >> Hi Ray, >> >> The issue is that several PCIe root complexes have colliding I/O ranges: > > > Ard, > The IO-MMIO mapping needs CPU support. I am not sure whether IA32 or > x64 supports. > But I guess ARM supports. right? >
Yes. Exposing PCI I/O ranges via MMIO translation is specific to the CpuIo2 implementations we have for ARM. > Will all the IO part be implemented in ARM CpuIo2 protocol? > No. The CPU to PCI I/O translation needs to be in PciHostBridgeDxe, because it is in charge of allocating the GCD space, and without translation, those allocations will collide. Also, while perhaps non-existent in reality, it is imaginable that a host bridge could translate port I/O addresses between the two sides of the bridge, similar to how non 1:1 mapped PCI MMIO is handled. So just like MMIO, the port I/O address used by the CPU, and the port I/O address programmed into the device BAR could be subject to translation. This is not solvable in the CpuIo2 protocol, because without translation at the host bridge driver level, a CPU port I/O address is ambiguous: e.g., address 0x1000 may apply to each of the RCs. >> >>>>>> PCI IO space CPU memory space >>>>>> 0x0000 .. 0xffff -> 0xafff0000 .. 0xafffffff >>>>>> (The sizes are always 0x10000 so I will omit the limit for others) >>>>>> 0x0000 .. 0xffff -> 0x8abff0000 >>>>>> 0x0000 .. 0xffff -> 0x8b7ff0000 >> >> >> So the CPU view is different from the PCI view, and to create a single >> CPU I/O space where all I/O port ranges behind all host bridges are >> accessible, we need I/O translation for the CPU. This will result in >> an intermediate representation >> >>>>>> PCI IO space CPU IO space >>>>>> 0x0000 .. 0xffff -> 0x00000 .. 0x0ffff >>>>>> 0x0000 .. 0xffff -> 0x10000 .. 0x1ffff >>>>>> 0x0000 .. 0xffff -> 0x20000 .. 0x2ffff >> >> >> On top of that, given that ARM has no native port I/O instructions, we >> will need to implement MMIO to IO translation, but this can be >> implemented in the CpuIo2 protocol. >> _______________________________________________ >> edk2-devel mailing list >> [email protected] >> https://lists.01.org/mailman/listinfo/edk2-devel >> > > > -- > Thanks, > Ray _______________________________________________ edk2-devel mailing list [email protected] https://lists.01.org/mailman/listinfo/edk2-devel

