Hi Peter,

Thanks for the links and pointers.

>From the information provided by microway:

   - 9x Intel 6540 = 11.25 TFlops (CPU taken at median flops)
   - 2x V100 = 14-16 TFlops.  

So theoretically, the 2 GPUs should offer better performance, but not as 
much as I've experienced. The issue lies somewhere else.

I'll start profiling and see if the MPI isn't an issue (shouldn't be with 
only 18 ranks). I'll also benchmark my BLAS to see how it performs with 
respect to other measurements found online. From what I understand, as PyFR 
is written in Python, it heavily relies on BLAS for compute performance.

Thanks for the help,

Solal

-- 
You received this message because you are subscribed to the Google Groups "PyFR 
Mailing List" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web, visit 
https://groups.google.com/d/msgid/pyfrmailinglist/1fd4cc71-c82e-4797-8931-c48e8fdf4ee8%40googlegroups.com.

Reply via email to