Timothy Normand Miller wrote:
We're hoping to get the GPU to exceed 100MHz in the FPGA.  Maybe we
Here, you're referring to the Spartan, right? And the DMA controller (aka nanocontroller?) is separately constrained by the PCI bandwidth?

If you're targeting 66MHz 32-bit PCI (or equivalent) and the nanocontroller can do one 32-bit read or write per cycle, am I right in understanding that the nanocontroller would then need to run at well over 133MHz to keep the PCI bandwidth satisfied? I say well over because there will obviously be other instructions executed besides just reads and writes (though obviously you don't have to saturate the peak /theoretical/ bandwidth of the interface...).

BTW, the only reason to do this is because without it, a multiply
would take at least 4 times longer due to the overhead of explicit
shifts and branches.  Do we care?

I would imagine Lattice's tools still infer multipliers and I believe
there is a fast(ish) multiplier implementation for the Lattice XP
architecture using LUTs and carry chains (it's alluded to in the datasheet).

Well, it won't be fast in one cycle, and if it's pipelined, it'll be a
lot more logic than we can probably justify.

Right -- see results below.

not sure if Lattice registers have Enable inputs, or they may be
mutually exclusive with async resets.
At a glance, it looks like the asynchronous set/reset and the clock enable are independent (and there is a clock enable). I'm taking this from page 6 of http://www.latticesemi.com/documents/DS1001.pdf.

I'm going to try to get a feel for the cost of multipliers on the
Lattice part.  They haven't sent me a Synplify license yet, though.

Isn't all that free?

Yes, but it took a while for the license email to arrive.

I'm still getting to know the tools, but it looks like a 32x32->64 multiplier with registers at the inputs and outputs requires 567 slices (12% of the device) and operates at 45MHz post-PAR on LFXP10C,5 (using ispLEVER 7 & Synplify 8.8 under wine). This is using the Project Navigator defaults (Synplify targets 200MHz and achieves 61MHz; PAR targets 56MHz and achieves 45MHz -- I have no idea where or how those constraints were specified).
_______________________________________________
Open-graphics mailing list
[email protected]
http://lists.duskglow.com/mailman/listinfo/open-graphics
List service provided by Duskglow Consulting, LLC (www.duskglow.com)

Reply via email to