Some additional information. The memory is broken up into chunks of 8MB and are executed in parallel using the fork join framework. So all cores are busy iterating over the memory.
On Sunday, January 14, 2018 at 8:44:00 PM UTC+2, Peter Veentjer wrote: > > I'm working on some very simple aggregations on huge chunks of offheap > memory (500GB+) for a hackaton. This is done using a very simple stride; > every iteration the address increases with 20 bytes. So the prefetcher > should not have any problems with it. > > According to my calculations I'm currently processing 35 GB/s. However I'm > not sure if I'm close to the maximum bandwidth of this machine. Specs: > 2133 MHz, 24x HP 32GiB 4Rx4 PC4-2133P > 2x Intel(R) *Xeon*(R) CPU E5-2687W v3, 3.10GHz, 10 cores per socket > > What is the best tool to determine the maximum bandwidth of a machine > running Linux (RHEL 7) > -- You received this message because you are subscribed to the Google Groups "mechanical-sympathy" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
