On 2025-09-04 at 05:45 +1000, John Hubbard <jhubb...@nvidia.com> wrote...
> On 9/1/25 4:55 PM, Alistair Popple wrote:
> > On 2025-08-30 at 09:55 +1000, John Hubbard <jhubb...@nvidia.com> wrote...
> > > On 8/27/25 1:19 AM, Alistair Popple wrote:
> ...
> > > > +        // SAFETY: No DMA allocations have been made yet
> > > > +        unsafe { pdev.dma_set_mask_and_coherent(DmaMask::new::<48>())? 
> > > > };
> > > 
> > > Eventually, should be 52 bits wide, rather than 48. Or so I believe from
> > > looking at various drivers, including Nouveau (which uses 52-bit for
> > > Blackwell) and mlx* (which use a 64-bit mask).
> > > 
> > > However, it works for the current series, because this series only 
> > > supports
> > > Ampere GPUs, and 48 bits suffices for those.
> > 
> > Actually based on both Nouveau and our internal docs this should be 
> > 47-bits. I
> 
> Yes. Which is why I wrote "48 bits suffices".

Sufficies most of the time, but is wrong none the less! Knowing my luck I would
run into just such a unicorn system where this would cause problems :-) So
thanks for looking at this.

> > suspect I just chose 48 during initial bring-up because that's what most 
> > CPUs
> > support but neglected to add the TODO to actually go and check this. So 
> > will fix
> > for v2.
> > 
> > > So, you could leave this patch as-is, and I'll change 48 --> 52 in the
> > > upcoming Hopper/Blackwell series. Or you can change it now.
> > 
> > We can't of course just change this to 52 bits because this needs to reflect
> > what the GPU HW supports. So ideally this needs to come from the HAL. I left
> 
> I should have been more precise. I meant, "use 52 bits, via HAL, just
> like Nouveau does".

Aha. I wasn't sure so thanks for the clarification.

> > this hard-coded because in the short-term leaving it as 47 bits even for
> > Blackwell won't cause any issues. It may force usage of an IOMMU to address
> > physical addresses greater than 47-bits when it otherwise wouldn't for
> > Hopper/Blackwell (it would always have to for Ampere/Turing), but 
> > short-term I
> > doubt many systems actually have physical memory above 47-bits anyway.
> > 
> > In other words you could leave this as 47 bits in the upcoming 
> > Hopper/Blackwell
> > series or use the HAL we have come up with (if that is available) to obtain 
> > the
> > optimal value.
> 
> Yes. I'm planning to match Nouveau's HAL approach for this, in the
> upcoming Hopper/Blackwell series.

Excellent, that sounds perfect.

> 
> thanks,
> -- 
> John Hubbard
> 

Reply via email to