On Tue, Jan 02, 2018 at 03:56:14PM +0800, Ni, Ruiyu wrote:
> On 12/26/2017 2:50 PM, Guo Heyi wrote:
> > Hi Ard, Ray,
> > 
> > Have we come to the final conclusion? Or are we still waiting for more 
> > comments on this?
> 
> Heyi,
> I think you can send out a draft version of changes for better
> understanding.

Sure, we can do that.
Thanks,

Gary

> 
> > 
> > Thanks,
> > 
> > Gary
> > 
> > On Thu, Dec 21, 2017 at 10:07:51AM +0000, Ard Biesheuvel wrote:
> > > On 21 December 2017 at 09:59, Ni, Ruiyu <ruiyu...@intel.com> wrote:
> > > > On 12/21/2017 5:52 PM, Ard Biesheuvel wrote:
> > > > > 
> > > > > On 21 December 2017 at 09:48, Ni, Ruiyu <ruiyu...@intel.com> wrote:
> > > > > > 
> > > > > > On 12/21/2017 5:14 PM, Guo Heyi wrote:
> > > > > > > 
> > > > > > > 
> > > > > > > On Thu, Dec 21, 2017 at 08:32:37AM +0000, Ard Biesheuvel wrote:
> > > > > > > > 
> > > > > > > > 
> > > > > > > > On 21 December 2017 at 08:27, Guo Heyi <heyi....@linaro.org> 
> > > > > > > > wrote:
> > > > > > > > > 
> > > > > > > > > 
> > > > > > > > > On Wed, Dec 20, 2017 at 03:26:45PM +0000, Ard Biesheuvel 
> > > > > > > > > wrote:
> > > > > > > > > > 
> > > > > > > > > > 
> > > > > > > > > > On 20 December 2017 at 15:17, gary guo 
> > > > > > > > > > <heyi....@linaro.org> wrote:
> > > > > > > > > > > 
> > > > > > > > > > > 
> > > > > > > > > > > On Wed, Dec 20, 2017 at 09:13:58AM +0000, Ard Biesheuvel 
> > > > > > > > > > > wrote:
> > > > > > > > > > > > 
> > > > > > > > > > > > 
> > > > > > > > > > > > Hi Heyi,
> > > > > > > > > > > > 
> > > > > > > > > > > > On 20 December 2017 at 08:21, Heyi Guo 
> > > > > > > > > > > > <heyi....@linaro.org> wrote:
> > > > > > > > > > > > > 
> > > > > > > > > > > > > 
> > > > > > > > > > > > > PCIe on some ARM platforms requires address 
> > > > > > > > > > > > > translation, not only
> > > > > > > > > > > > > for
> > > > > > > > > > > > > legacy IO access, but also for 32bit memory BAR 
> > > > > > > > > > > > > access as well.
> > > > > > > > > > > > > There
> > > > > > > > > > > > > will be "Address Translation Unit" or something 
> > > > > > > > > > > > > similar in PCI
> > > > > > > > > > > > > host
> > > > > > > > > > > > > bridges to translation CPU address to PCI address and 
> > > > > > > > > > > > > vice versa.
> > > > > > > > > > > > > So
> > > > > > > > > > > > > we think it may be useful to add address translation 
> > > > > > > > > > > > > support to
> > > > > > > > > > > > > the
> > > > > > > > > > > > > generic PCI host bridge driver.
> > > > > > > > > > > > > 
> > > > > > > > > > > > 
> > > > > > > > > > > > I agree. While unusual on a PC, it is quite common on 
> > > > > > > > > > > > other
> > > > > > > > > > > > architectures to have more complex non 1:1 topologies, 
> > > > > > > > > > > > which
> > > > > > > > > > > > currently
> > > > > > > > > > > > require a forked PciHostBridgeDxe driver with local 
> > > > > > > > > > > > changes
> > > > > > > > > > > > applied.
> > > > > > > > > > > > 
> > > > > > > > > > > > > This RFC only contains one minor change to the 
> > > > > > > > > > > > > definition of
> > > > > > > > > > > > > PciHostBridgeLib, and there certainly will be a lot 
> > > > > > > > > > > > > of other
> > > > > > > > > > > > > changes
> > > > > > > > > > > > > to make it work, including:
> > > > > > > > > > > > > 
> > > > > > > > > > > > > 1. Use CPU address for GCD space add and allocate 
> > > > > > > > > > > > > operations,
> > > > > > > > > > > > > instead
> > > > > > > > > > > > > of PCI address; also IO space will be changed to 
> > > > > > > > > > > > > memory space if
> > > > > > > > > > > > > translation exists.
> > > > > > > > > > > > > 
> > > > > > > > > > > > 
> > > > > > > > > > > > For I/O space, the translation should simply be applied 
> > > > > > > > > > > > to the I/O
> > > > > > > > > > > > range. I don't think it makes sense to use memory space 
> > > > > > > > > > > > here, given
> > > > > > > > > > > > that it is specific to architectures that lack native 
> > > > > > > > > > > > port I/O.
> > > > > > > > > > > > 
> > > > > > > > > > > 
> > > > > > > > > > > I made an assumption here that platforms supporting real 
> > > > > > > > > > > port IO
> > > > > > > > > > > space
> > > > > > > > > > > do not need address translation, like IA32 and X64, and 
> > > > > > > > > > > port IO
> > > > > > > > > > > translation implies the platform does not support real 
> > > > > > > > > > > port IO
> > > > > > > > > > > space.
> > > > > > > > > > > 
> > > > > > > > > > 
> > > > > > > > > > This may be a reasonable assumption. But I still think it 
> > > > > > > > > > is better
> > > > > > > > > > not to encode any assumptions in the first place.
> > > > > > > > > > 
> > > > > > > > > > > Indeed the assumption is not so "generic", so I'll agree 
> > > > > > > > > > > if you
> > > > > > > > > > > recommend to support IO to IO translation as well. But I 
> > > > > > > > > > > still hope
> > > > > > > > > > > to
> > > > > > > > > > > have IO to memory translation support in PCI host bridge 
> > > > > > > > > > > driver,
> > > > > > > > > > > rather than in CPU IO protocol, since the faked IO space 
> > > > > > > > > > > might only
> > > > > > > > > > > be
> > > > > > > > > > > used for PCI host bridge and we may have overlapping IO 
> > > > > > > > > > > ranges for
> > > > > > > > > > > each host bridge, which is compatible with PCIe 
> > > > > > > > > > > specification and
> > > > > > > > > > > PCIe
> > > > > > > > > > > ACPI description.
> > > > > > > > > > > 
> > > > > > > > > > 
> > > > > > > > > > That is fine. Under UEFI, these will translate to 
> > > > > > > > > > non-overlapping I/O
> > > > > > > > > > spaces in the CPU's view. Under the OS, this could be 
> > > > > > > > > > totally
> > > > > > > > > > different.
> > > > > > > > > > 
> > > > > > > > > > 
> > > > > > > > > > For example,
> > > > > > > > > > 
> > > > > > > > > > RC0 IO 0x0000 .. 0xffff -> CPU 0x00000 .. 0x0ffff
> > > > > > > > > > RC1 IO 0x0000 .. 0xffff -> CPU 0x10000 .. 0x1ffff
> > > > > > > > > > 
> > > > > > > > > > This is very similar to how MMIO translation works, and 
> > > > > > > > > > makes I/O
> > > > > > > > > > devices behind the host bridges uniquely addressable for 
> > > > > > > > > > drivers.
> > > > > > > > > > 
> > > > > > > > > > For our understanding, could you share the host bridge 
> > > > > > > > > > configuration
> > > > > > > > > > that you are targetting?
> > > > > > > > > 
> > > > > > > > > 
> > > > > > > > > 
> > > > > > > > > IO translation on one of our platforms is like below:
> > > > > > > > > 
> > > > > > > > > PCI IO space        CPU memory space
> > > > > > > > > 0x0000 .. 0xffff -> 0xafff0000 .. 0xafffffff
> > > > > > > > > (The sizes are always 0x10000 so I will omit the limit for 
> > > > > > > > > others)
> > > > > > > > > 0x0000 .. 0xffff -> 0x8abff0000
> > > > > > > > > 0x0000 .. 0xffff -> 0x8b7ff0000
> > > > > > > > > ......
> > > > > > > > > 
> > > > > > > > > The translated addresses may be beyond 32bit address, will 
> > > > > > > > > this
> > > > > > > > > violate IO space limitation? From EDK2 code, I didn't see such
> > > > > > > > > limitation for IO space.
> > > > > > > > > 
> > > > > > > > 
> > > > > > > > The MMIO address will not be used for I/O port addressing by 
> > > > > > > > the CPU.
> > > > > > > > The MMIO to IO translation is an implementation detail of your 
> > > > > > > > CpuIo2
> > > > > > > > protocol implementation.
> > > > > > > > 
> > > > > > > > So there will be two stacked translations, one for PCI I/O to 
> > > > > > > > CPU I/O,
> > > > > > > > and one for CPU I/O to CPU MMIO. The latter is transparent to 
> > > > > > > > the PCI
> > > > > > > > host bridge driver.
> > > > > > > 
> > > > > > > 
> > > > > > > 
> > > > > > > Yes this should work.
> > > > > > > 
> > > > > > > Hi Star, Eric and Ruiyu,
> > > > > > > 
> > > > > > > Any comments on this RFC?
> > > > > > 
> > > > > > 
> > > > > > 
> > > > > > Let me confirm my understanding:
> > > > > > The PciHostBridge core driver/library interface changes only
> > > > > > take care of the MMIO translation.
> > > > > > 
> > > > > > Heyi you will implement a special CpuIo driver in your
> > > > > > platform code to take care of the IO to MMIO translation.
> > > > > > 
> > > > > > But let me confirm, will you need to additional translate
> > > > > > the MMIO (translated from IO) to another MMIO using an offset?
> > > > > > If yes, will you handle that translation in your CpuIo driver?
> > > > > > 
> > > > > 
> > > > > Hi Ray,
> > > > > 
> > > > > The issue is that several PCIe root complexes have colliding I/O 
> > > > > ranges:
> > > > 
> > > > 
> > > > Ard,
> > > > The IO-MMIO mapping needs CPU support. I am not sure whether IA32 or
> > > > x64 supports.
> > > > But I guess ARM supports. right?
> > > > 
> > > 
> > > Yes. Exposing PCI I/O ranges via MMIO translation is specific to the
> > > CpuIo2 implementations we have for ARM.
> > > 
> > > > Will all the IO part be implemented in ARM CpuIo2 protocol?
> > > > 
> > > 
> > > No. The CPU to PCI I/O translation needs to be in PciHostBridgeDxe,
> > > because it is in charge of allocating the GCD space, and without
> > > translation, those allocations will collide. Also, while perhaps
> > > non-existent in reality, it is imaginable that a host bridge could
> > > translate port I/O addresses between the two sides of the bridge,
> > > similar to how non 1:1 mapped PCI MMIO is handled.
> > > 
> > > So just like MMIO, the port I/O address used by the CPU, and the port
> > > I/O address programmed into the device BAR could be subject to
> > > translation.
> > > 
> > > This is not solvable in the CpuIo2 protocol, because without
> > > translation at the host bridge driver level, a CPU port I/O address is
> > > ambiguous: e.g., address 0x1000 may apply to each of the RCs.
> > > 
> > > 
> > > > > 
> > > > > > > > > PCI IO space        CPU memory space
> > > > > > > > > 0x0000 .. 0xffff -> 0xafff0000 .. 0xafffffff
> > > > > > > > > (The sizes are always 0x10000 so I will omit the limit for 
> > > > > > > > > others)
> > > > > > > > > 0x0000 .. 0xffff -> 0x8abff0000
> > > > > > > > > 0x0000 .. 0xffff -> 0x8b7ff0000
> > > > > 
> > > > > 
> > > > > So the CPU view is different from the PCI view, and to create a single
> > > > > CPU I/O space where all I/O port ranges behind all host bridges are
> > > > > accessible, we need I/O translation for the CPU. This will result in
> > > > > an intermediate representation
> > > > > 
> > > > > > > > > PCI IO space        CPU IO space
> > > > > > > > > 0x0000 .. 0xffff -> 0x00000 .. 0x0ffff
> > > > > > > > > 0x0000 .. 0xffff -> 0x10000 .. 0x1ffff
> > > > > > > > > 0x0000 .. 0xffff -> 0x20000 .. 0x2ffff
> > > > > 
> > > > > 
> > > > > On top of that, given that ARM has no native port I/O instructions, we
> > > > > will need to implement MMIO to IO translation, but this can be
> > > > > implemented in the CpuIo2 protocol.
> > > > > _______________________________________________
> > > > > edk2-devel mailing list
> > > > > edk2-devel@lists.01.org
> > > > > https://lists.01.org/mailman/listinfo/edk2-devel
> > > > > 
> > > > 
> > > > 
> > > > --
> > > > Thanks,
> > > > Ray
> > _______________________________________________
> > edk2-devel mailing list
> > edk2-devel@lists.01.org
> > https://lists.01.org/mailman/listinfo/edk2-devel
> > 
> 
> 
> -- 
> Thanks,
> Ray
_______________________________________________
edk2-devel mailing list
edk2-devel@lists.01.org
https://lists.01.org/mailman/listinfo/edk2-devel

Reply via email to