At high performance systems:

Wikipedia says (#1):

As of 2010, the fastest six-core PC processor reaches 109 gigaFLOPS (Intel Core i7 980 XE)[16] in double precision calculations. GPUs are considerably more powerful. For example, Nvidia Tesla C2050 GPU computing processors perform around 515 gigaFLOPS[17] in double precision calculations, and the AMD FireStream 9270 peaks at 240 gigaFLOPS.


i7-980X @ 32nm = 239 mm sq. (#2) 109 GFlops/d... 130W max
  C2050 @ 40nm = 526 mm sq. (#3) 515 GFlops/d... 247W max


The power consumption scales with chip, the performance at GPU is however the double of the CPU's (that might be due to limited, but massively parallel architecture).

I bet today you can get multiple of that performance in the GPU - the limit is the way the heat is removed, if somebody makes a cooler capable of dissipating 1kW from 2000 mm.sq area, I bet there would be such chips :)

By taking into account that nvidia is doing its best to design a power efficient core (their primary target is high-performance) and they are ahead of Intel in terms of efficiency, it leaves only a little chance to you.

The question is how can you do better then them?

Maybe just make a small GPU with low core count - there are already to be found in devices targetting the mobile market (powervr, mali, etc) so in reality what you can do is just to make yours easier to use and well documented. There is a big chance that none of the embedded cores will have public documentation in near future.

From my point of view (as a board designer), I would really welcome a discrete PCIe graphics, which has its ddr memory dies integrated in the package. That saves a lot of design time, and you can guarantee the performance/behaviour of the combination.

And there are then interfaces like C2C (found in Omap5 from TI) that let you share the processors ram for the sub-processor cores (like the baseband processor) so there is no need for a separate memory die. But I would not go this way, an integrated ddr3/gddr die would be nicer.

Daniel



[1] http://en.wikipedia.org/wiki/FLOPS

[2] http://ark.intel.com/products/47932/Intel-Core-i7-980X-Processor-
Extreme-Edition-12M-Cache-3_33-GHz-6_40-GTs-Intel-QPI

[3] http://www.techpowerup.com/gpuz/7rgnu/

[4] http://www.siliconmechanics.com/files/C2050Benchmarks.pdf





On 12/10/2012 10:50 PM, Timothy Normand Miller wrote:
To motivate the energy efficiency problem with GPUs, it would be good to
have some statistics, and I know that some people on this list have
their fingers on this information.  I'm trying to finish my CCF proposal
due Monday, and this could REALLY help.

Given state of the art CPUs and GPUs with the same process tech, what
are their transistor counts, per die?  How about total area (mm^2)?

What is the relative power draw per die?  (Keep in mind that some GPUs
are multi-chip)

The more clearly and specifically I can show that GPUs have urgent power
draw problems, the more reviewers are likely to fund the work.

Thanks.

--
Timothy Normand Miller, PhD
Assistant Professor of Computer Science, Binghamton University
http://www.cs.binghamton.edu/~millerti/
<http://www.cse.ohio-state.edu/~millerti>
Open Graphics Project



_______________________________________________
Open-graphics mailing list
[email protected]
http://lists.duskglow.com/mailman/listinfo/open-graphics
List service provided by Duskglow Consulting, LLC (www.duskglow.com)

_______________________________________________
Open-graphics mailing list
[email protected]
http://lists.duskglow.com/mailman/listinfo/open-graphics
List service provided by Duskglow Consulting, LLC (www.duskglow.com)

Reply via email to