Parallelism in query execution

Andy Seaborne Fri, 29 Jan 2016 03:50:55 -0800

Rob,

In dotNetRDF, there is parallel execution, isn't there?

I have been thinking (toying with) the idea of parallel execution and Iwondered what unit of work is for the parallelism in dotNetRDF.

What little thinking I've done suggests that tapping into theparallelism in java streams is not the right way to do it (which is ashame as that's less work). It needs more control and probably largerunits of work. There is a danger that small/fast queries slow down dueto too much thinking.

It needs more control as well to limit how much of the machine it willtake over because, in Fuseki, it might lead to starvation of otherrequests. As some usage is " many clients, many small requests",parallelism can impact the the system negatively as well as positively.At some point, the limitation will be the connection of CPUs to RAMrather than cycles.


    Andy

Historical note: RDQL had true parallelism once-upon-a-time. An RDQLquery is a BGP+Filter and not more. The filter ran on a separate threadto the BGP solver. Timed gain ... just 10%. This was on a earlygeneration 2 CPU, 2 processor machine so the cost of threading was huge.Most users then did not have a multi-anything machine. It lead to lotsof problems with thread management when Java wasn't what it is today.

Parallelism in query execution

Reply via email to