Hi Peter, Thanks for the links and pointers.
>From the information provided by microway: - 9x Intel 6540 = 11.25 TFlops (CPU taken at median flops) - 2x V100 = 14-16 TFlops. So theoretically, the 2 GPUs should offer better performance, but not as much as I've experienced. The issue lies somewhere else. I'll start profiling and see if the MPI isn't an issue (shouldn't be with only 18 ranks). I'll also benchmark my BLAS to see how it performs with respect to other measurements found online. From what I understand, as PyFR is written in Python, it heavily relies on BLAS for compute performance. Thanks for the help, Solal -- You received this message because you are subscribed to the Google Groups "PyFR Mailing List" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web, visit https://groups.google.com/d/msgid/pyfrmailinglist/1fd4cc71-c82e-4797-8931-c48e8fdf4ee8%40googlegroups.com.
