On Wed, Jun 26, 2013 at 7:01 AM, k...@rice.edu <k...@rice.edu> wrote:

> On Wed, Jun 26, 2013 at 03:47:43PM +0200, Markus Wanner wrote:
> > On 06/25/2013 11:52 PM, Kevin Grittner wrote:
> > > At least until we have parallel
> > > query execution.  At *that* point this all changes.
> >
> > Can you elaborate on that, please? I currently have a hard time
> > imagining how partitions can help performance in that case, either. At
> > least compared to modern RAID and read-ahead capabilities.
> >
> > After all, RAID can be thought of as hash partitioning with a very weird
> > hash function. Or maybe rather range partitioning on an internal key.
> >
> > Put another way: ideally, the system should take care of optimally
> > distributing data across its physical storage itself. If you need to do
> > partitioning manually for performance reasons, that's actually a
> > deficiency of it, not a feature.
>

+1, except I'm looking at it from a CPU perspective not a disk perspective.

I would hope not to need to partition my data at all in order to enable
parallel execution.  I certainly would hope not to redo that partitioning
just because I got new hardware with a different number of CPUs.


> Hi Markus,
>
> I think he is referring to the fact that with parallel query execution,
> multiple partitions can be processed simultaneously instead of serially
> as they are now with the resulting speed increase.
>


Hopefully parallel execution can divide the query into multiple "chunks" on
its own, without me needing to micromanage it.

Cheers,

Jeff

Reply via email to