Re: [HACKERS] Use pread and pwrite instead of lseek + write and read

Oskari Saarenmaa Wed, 14 Sep 2016 23:57:13 -0700

17.08.2016, 22:11, Tom Lane kirjoitti:

Robert Haas <robertmh...@gmail.com> writes:

I don't understand why you think this would create non-trivial
portability issues.


The patch as submitted breaks entirely on platforms without pread/pwrite.
Yes, we can add a configure test and some shim functions to fix that,
but the argument that it makes the code shorter will get a lot weaker
once we do.

I posted an updated patch which just calls lseek + read/write, thecode's still a lot shorter.

I agree that adding such functions is pretty trivial, but there are
reasons to think there are other hazards that are less trivial:

First, a self-contained shim function will necessarily do an lseek every
time, which means performance will get *worse* not better on non-pread
platforms.  And yes, the existing logic to avoid lseeks fires often enough
to be worthwhile, particularly in seqscans.

This will only regress on platforms without pread. The only relevantsuch platform appears to be Windows which has equivalent APIs.

FWIW, I ran the same pgbench benchmarks on my Linux system where Ialways used lseek() + read/write instead of pread and pwrite - they ranslightly faster than the previous code which saved seek positions, but Isuppose a workload with lots of seqscans could be slower.

Unfortunately I didn't save the actual numbers anywhere, but I can rerunthe benchmarks if you're interested. The numbers were pretty stableacross multiple runs.

Second, I wonder whether this will break any kernel's readahead detection.
I wouldn't be too surprised if successive reads (not preads) without
intervening lseeks are needed to trigger readahead on at least some
platforms.  So there's a potential, both on platforms with pread and those
without, for this to completely destroy seqscan performance, with
penalties very far exceeding what we might save by avoiding some kernel
calls.

At least Linux and FreeBSD don't seem to care how and why you readpages, they'll do readahead regardless of the way you read files andextend the readahead once you access previously readahead pages. Theydisable readahead only if fadvise(POSIX_FADV_RANDOM) has been used.

I'd expect any kernel that implements mmap to also implement readaheadbased on page usage rather than than the seek position. Do you know ofa kernel that would actually use the seek position for readahead?

I'd be more excited about this if the claimed improvement were more than
1.5%, but you know as well as I do that that's barely above the noise
floor for most performance measurements.  I'm left wondering why bother,
and why take any risk of de-optimizing on some platforms.

I think it makes sense to try to optimize for the platforms that peopleactually use for performance critical workloads, especially if it alsoallows us to simplify the code and remove more lines than we add. It'snice if the software still works on legacy platforms, but I don't thinkwe should be concerned about a hypothetical performance impact onplatforms no one uses in production anymore.


/ Oskari


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Use pread and pwrite instead of lseek + write and read

Reply via email to