Re: [Interest] [Semi OT] Concurrent (multi-threaded) read/write disk IO?

Harri Pasanen Thu, 05 Feb 2015 06:20:20 -0800

On 05/02/2015 14:44, Till Oliver Knoll wrote:

Am 05.02.2015 um 14:25 schrieb Till Oliver Knoll<[email protected] <mailto:[email protected]>>:
...
Does it make sense to guarantee/enforce "sequential (exclusive)access to the harddisk" on application level, or would I re-inventfunctionality already present in the underlying OS/disk driver (andmaybe even sacrifice performance)?
I eventually found a link which seems to confirm that it would be bestto only have sequential read/write access with physically spinningdrives, that is, have some kind of "IO Manager" in the application:
http://www.tomshardware.co.uk/forum/251768-32-impact-concurrent-speed
Off course the tricky part then is that the Writer thread does notblock the Reader thread for too long, such that the Work Queue wouldbecome empty (and the worker threads would be sitting there "idle").
Likewise the Writer thread must have enough chances to write, suchthat the Result Queue becomes not too large ("memory constraints").Probably some kind of priorisation scheme taking Queue counts intoconsideration is the answer - but not part of my question; I am reallyjust interested in whether concurrent read/write access should beavoided in the first place these days (or not).
For SSDs it still might be okay (or even better?) to use concurrentread/write access?


The usual answer is "it depends"..

It depends on how much data you are accessing at each write/read. Italso depends on the underlying filesystem and size of files / how manyfiles you are dealing with.

It also depends on your disk array, if you have one or more disks andcapacity of the disks, which affects then number of read/write heads thedisk has. Also The NCQ* implementation and cache RAM amount in a diskmakes a difference.

Then it depends if you need transactional writes, does the write need tosync immediately?If you are on linux, you already get a lot of optimization out of thebox, it is typically much better than any other OS. But even withinlinux the filesystem used makes a difference, for example somefilesystems are good with lots of small files. Sometimes file deletionis the bottleneck.

In the end in spinning drives the underlying physics of spinning mediaand moving read/write heads affect things.

SSDs are typically ~100 times faster in seek operations. Even therecontrollers cause significant differences depending on read/write patterns.

So before optimizing I'd benchmark, especially on linux the filesystemlayer typically does a decent job already.

But if you want maximum IO performance, the rule of thumb is to groupyour reads and writes, and read/write as much data as possible at once.Even SSDs typically favor this. In highly parallel supercomputersettings different rules may apply.


Just my 2 cents,

Harri

*http://en.wikipedia.org/wiki/Native_Command_Queuing

_______________________________________________
Interest mailing list
[email protected]
http://lists.qt-project.org/mailman/listinfo/interest

Re: [Interest] [Semi OT] Concurrent (multi-threaded) read/write disk IO?

Reply via email to