Re: [PERFORM] New to PostgreSQL, performance considerations

Ron Fri, 15 Dec 2006 07:05:19 -0800

At 09:23 AM 12/15/2006, Merlin Moncure wrote:

On 12/15/06, Ron <[EMAIL PROTECTED]> wrote:
It seems unusual that code generation options which give access to
more registers would ever result in slower code...
The slower is probably due to the unroll loops switch which canactually hurt code due to the larger footprint (less cache coherency).

I have seen that effect as well occasionally in the last few decades;-) OTOH, suspicion is not _proof_; and I've seen other"optimizations" turn out to be "pessimizations" over the years as well.

The extra registers are not all that important because of pipeliningand other hardware tricks.

No. Whoever told you this or gave you such an impression wasmistaken. There are many instances of x86 compatible code that get30-40% speedups just because they get access to 16 rather than 8 GPRswhen recompiled for x84-64.

Pretty much all the old assembly strategies such as forcing localvariables to registers are basically obsolete...especially withregards to integer math.

Again, not true. OTOH, humans are unlikely at this point to be ableto duplicate the accuracy of the compiler's register coloringalgorithms. Especially on x86 compatibles. (The flip side is that_expert_ humans can often put the quirky register set and instructionpipelines of x86 compatibles to more effective use for a specificchunk of code than even the best compilers can.)

As I said before, modern CPUs are essentially RISC engines with aCISC preprocessing engine laid in top.

I'm sure you meant modern =x86 compatible= CPUs are essentially RISCengines with a CISC engine on top. Just as "all the world's not aVAX", "all CPUs are not x86 compatibles". Forgetting this hasoccasionally cost folks I know...

Things are much more complicated than they were in the old dayswhere you could count instructions for the assembly optimization process.


Those were the =very= old days in computer time...

I suspect that there is little or no differnece between the-march=686 and the various specifc archicectures.

There should be. The FSF compiler folks (and the rest of theindustry compiler folks for that matter) are far from stupid. Theyare not just adding compiler switches because they randomly feel like it.

Evidence suggests that the most recent CPUs are in need of =more=arch specific TLC compared to their ancestors, and that this trend isnot only going to continue, it is going to accelerate.

Did anybody think to look at the binaries and look for the amount ofdifferences? I bet you code compiled for march=opteron will justfine on a pentium 2 if compiled
for 32 bit.

Sucker bet given that the whole point of a 32b x86 compatible is tobe able to run code on any I32 ISA. CPU.OTOH, I bet that code optimized for best performance on a P2 is notgetting best performance on a P4. Or vice versa. ;-)


The big arch specific differences in Kx's are in 64b mode.  Not 32b

Ron Peacetree.


---------------------------(end of broadcast)---------------------------
TIP 3: Have you checked our extensive FAQ?

              http://www.postgresql.org/docs/faq

Re: [PERFORM] New to PostgreSQL, performance considerations

Reply via email to