Re: [PERFORM] Hardware/OS recommendations for large databases

Ron Fri, 18 Nov 2005 12:34:31 -0800

Breaking the ~120MBps pg IO ceiling by any meansis an important result. Particularly when youget a ~2x improvement. I'm curious how far wecan get using simple approaches like this.


At 10:13 AM 11/18/2005, Luke Lonergan wrote:

Dave,
On 11/18/05 5:00 AM, "Dave Cramer" <[EMAIL PROTECTED]> wrote:
>
> Now there's an interesting line drawn in the sand. I presume you have
> numbers to back this up ?
>
> This should draw some interesting posts.

Part 2: The answer

System A:
This system is running RedHat 3 Update 4, with a Fedora 2.6.10 Linux kernel.
On a single table with 15 columns (the BizgresIVP) at a size double memory (2.12GB), Postgres8.0.3 with Bizgres enhancements takes 32 secondsto scan the table: thats 66 MB/s. Not theefficiency Id hope from the onboard SATAcontroller that Id like, I would have expectedto get 85% of the 100MB/s raw read performance.

Have you tried the large read ahead trick withthis system? It would be interesting to see howmuch it would help. It might even be worth it todo the experiment at all of [default, 2x default,4x default, 8x default, etc] read ahead untileither a) you run out of resources to support thedesired read ahead, or b) performance levelsoff. I can imagine the results being very enlightening.

System B:
This system is running an XFS filesystem, andhas been tuned to use very large (16MB)readahead. Its running the Centos 4.1 distro,which uses a Linux 2.6.9 kernel.
Same test as above, but with 17GB of data takes69.7 seconds to scan (!) Thats 244.2MB/s,which is obviously double my earlier point of110-120MB/s. This system is running with a 16MBLinux readahead setting, lets try it with thedefault (I think) setting of 256KB AHA! Now we get 171.4 seconds or 99.3MB/s.

The above experiment would seem useful here as well.

Summary:
<cough, cough> OK you can get more I/Obandwidth out of the current I/O path forsequential scan if you tune the filesystem forlarge readahead. This is a cheap alternative tooverhauling the executor to use asynch I/O.
Still, there is a CPU limit here this is notI/O bound, it is CPU limited as evidenced by thesensitivity to readahead settings. If thefilesystem could do 1GB/s, you wouldnt go any faster than 244MB/s.
- Luke

I respect your honesty in reporting results thatwere different then your expectations orpreviously taken stance. Alan Stange's commentre: the use of direct IO along with your commentsre: async IO and mem copies plus the results ofthese experiments could very well point usdirectly at how to most easily solve pg's CPU boundness during IO.


[HACKERS] are you watching this?

Ron



---------------------------(end of broadcast)---------------------------
TIP 3: Have you checked our extensive FAQ?

              http://www.postgresql.org/docs/faq

Re: [PERFORM] Hardware/OS recommendations for large databases

Reply via email to