On Mon, 10 Nov 2008, Mel Gorman wrote: > In the situation a caller allocates multiple buffers with > get_hugepage_region(), all the buffers will align to the start of the > hugepage. This means the buffers are almost certainly using the same > cachelines and operations like copying between the arrays will be mysteriously > slow in comparison to buffers allocated from malloc(). In STREAM, allocating > the three arrays using get_huge_pages() was approximately 40% the throughput > of malloc(). > > When requested, this patch uses bytes that would normally be wasted to > offset the buffer by some random cacheline. Effort is made to select a > random cache-line to give good average performance. > > Signed-off-by: Mel Gorman <[EMAIL PROTECTED]>
Acked-by: Eric B Munson <[EMAIL PROTECTED]>
signature.asc
Description: Digital signature
------------------------------------------------------------------------- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________ Libhugetlbfs-devel mailing list Libhugetlbfs-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/libhugetlbfs-devel