CSV - 2.5 GB/sec (sadly C++)

Laeeth Isharc via Digitalmars-d-learn Sat, 29 Oct 2016 22:01:29 -0700

On Tuesday, 26 January 2016 at 22:36:31 UTC, H. S. Teoh wrote:

So the moral of the story is: avoid large numbers of smallallocations. If you have to do it, consider consolidating yourallocations into a series of allocations of large(ish) buffersinstead, and taking slices of the buffers.

Thanks for sharing this, HS Teoh. I tried replacing allocationswith using a Region from std.experimental.allocator (withFreeList and Quantizer on top), and then just deallocatingeverything in one go once I am done with the data. Seems to be alittle faster, but I haven't had time to measure it.

Just came across this C++ project, which seems to haveastonishing performance. 7 minutes for reading a terabyte, and2.5 to 4.5 GB/sec for reading file cold. That's prettyimpressive. (Obviously they read in parallel, but I haven't yetread source to see what the other tricks might be).

It would be nice to be able match that in D, though practicallyspeaking it's probably easiest just to wrap it:


http://www.wise.io/tech/paratext

https://github.com/wiseio/paratext

CSV - 2.5 GB/sec (sadly C++)

Reply via email to