Re: [PERFORM] Read/Write block sizes

Ron Thu, 25 Aug 2005 16:50:24 -0700

At 04:49 PM 8/25/2005, Chris Browne wrote:

[EMAIL PROTECTED] (Ron) writes:
> At 03:45 PM 8/25/2005, Josh Berkus wrote:
>> > Ask me sometime about my replacement for GNU sort. Â It uses the
>> > same sorting algorithm, but it's an order of magnitude faster due
>> > to better I/O strategy. Â Someday, in my infinite spare time, I
>> > hope to demonstrate that kind of improvement with a patch to pg.
>>
>>Since we desperately need some improvements in sort performance, I
>>do hope you follow up on this.
>
> I'll generalize that.  IMO we desperately need any and all
> improvements in IO performance.  Even more so than we need
> improvements in sorting or sorting IO performance.


That's frankly a step backwards.  Feel free to "specialise" that instead.


We can agree to disagree, I'm cool with that.

I'm well aware that a Systems Approach to SWArchitecture is not always popular in the OpenSource world. Nonetheless, my POV is that if wewant to be taken seriously and beat "the bigboys", we have to do everything smarter andfaster, as well as cheaper, than they do. Youare not likely to be able to do that consistentlywithout using some of the "icky" stuff one isrequired to study as part of formal training inthe Comp Sci and SW Engineering fields.

A patch that improves some specific aspect ofperformance is a thousand times better than anysort of "desperate desire for any and
all improvements in I/O performance."

minor twisting of my words: substituting "desire"for "need". The need is provable. Just put "thebig 5" (SQL Server, Oracle, DB2, mySQL, andPostgreSQL) into some realistic benches to see that.

Major twisting of my words: the apparentimplication by you that I don't appreciateimprovements in the IO behavior of specificthings like sorting as much as I'd appreciatemore "general" IO performanceimprovements. Performance optimization is bestdone as an iterative improvement process thatstarts with measuring where the need is greatest,then improving that greatest need by the most youcan, then repeating the whole cycle. _Every_improvement in such a process is a specificimprovement, even if the improvement is adecision to re-architect the entire product tosolve the current biggest issue. Improvingsorting IO is cool. OTOH, if pg's biggest IOproblems are elsewhere, then the amount ofoverall benefit we will get from improvingsorting IO is going to be minimized until weimprove the bigger problem(s). Amdahl's Law.

The "specialized patch" is also pointedly betterin that a *confidently submitted* patch islikely to be way better than any sort of"desperate clutching at whatever may come to hand."

Another distortion of my statement and POV. Inever suggested nor implied any sort of"desperate clutching...". We have _measurable_IO issues that need to be addressed in order forpg to be a better competitor in themarketplace. Just as we do with sorting performance.

Far too often, I see people trying to addressperformance problems via the "desperateclutching at whatever seems near to hand," and thatgenerally turns out very badly as a particularresult of the whole "desperate clutching" part.
If you can get a sort improvement submitted, that's a concrete improvement...

As I said, I'm all in favor of concrete,measurable improvement. I do not think I everstated I was in favor of anything else.

You evidently are mildly ranting because you'veseen some examples of poor SW EngineeringDiscipline/Practice by people with perhapsinadequate skills for the issues they were tryingto address. We all have. "90% of everything isJreck (eg of too low a quality)."

OTOH, I do not think I've given you any reason tothink I lack such Clue, nor do I think my post was advocating such thrashing.

My post was intended to say that we need anOverall Systems Approach to pg optimizationrather than just applying what compiler writer'scall "peephole optimizations" to pg. No more, no less.


I apologize if I somehow misled you,
Ron Peacetree



---------------------------(end of broadcast)---------------------------
TIP 9: In versions below 8.0, the planner will ignore your desire to
      choose an index scan if your joining column's datatypes do not
      match

Re: [PERFORM] Read/Write block sizes

Reply via email to