Hi Zach, On 06/05/2016 10:43, Zach Davis wrote: > Thanks Freddie, I had device-id set to local-rank, and I intended a > value of 1 here. All is well. It appears that throwing the entire > grid as a single partition on my discrete GPU via OpenCL runs much > slower than partitioning the grid into 4 parts and running on the > CPU. Does this make sense to you? Any idea as to why I’m not > getting solution output files?
In general 2D cases are not great from a benchmark standpoint. They
tend to be highly limited by available memory bandwidth and are often
too small to sufficiently load a capable GPU. Moreover, on a good day
it is possible to fit an extremely large portion of the working set into
the last level cache of a modern CPU; as this point it really is no contest.
Hence, it is not unusual for modest CPUs to substantially outperform
high end GPUs in 2D cases.
For file output: rename the section to [soln-plugin-writer] and make
sure that the basename uses {} for variable substitution.
Regards, Freddie.
--
You received this message because you are subscribed to the Google Groups "PyFR
Mailing List" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To post to this group, send an email to [email protected].
Visit this group at https://groups.google.com/group/pyfrmailinglist.
For more options, visit https://groups.google.com/d/optout.
signature.asc
Description: OpenPGP digital signature
