On 19/03/2025 14:25, Frédéric Yhuel wrote:


On 3/14/25 09:43, Frédéric Yhuel wrote:
One thing I've noticed is that posix_fadvise(,,POSIX_FADV_WILLNEED) isn't always non-blocking on Linux. As Ted Ts'o explains in this old thread[1], it blocks when the request queue fills up.

When posix_fadvise() blocks, it doesn't seem to be completely off-cpu (but mostly, at least on my machine), and I assume that this is the reason for the sentence "A value higher than needed to keep the disks busy will only result in extra CPU overhead" in the documentation for effective_io_concurrency? My guess is that Linux uses a spinlock somewhere, and that this explains the cpu overhead.

Also, on Linux, it seems we can control the size of the request queue with the /sys/block/XXX/queue/nr_requests setting. On my machine, it is set to 64 by default. When I set it up to 128, postgres spends less time on posix_fadvise(), and correspondingly more on pread().



If you're still on this topic, some additions:

* io_uring_prep_fadvise() syscall is maybe interesting to look at, I don't know what has been done here within Andres work nor if it is applicable (I didn't look at this code yet), but "git grep" didn't raise anything.
   This is supposed to be "async posix_fadvise"

* nr_request, I have 1023 (nvme) with Linux 6.12.22-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.12.22-1 (2025-04-10) x86_64.

Note that the scheduler also change the behavior of the queue, with mq-deadline there is 25% of the queue reserved for sync requests. See https://github.com/torvalds/linux/commit/07757588e5076748308dd95ee2e3cd0b82ebb8c4

* https://wiki.postgresql.org/wiki/Readahead - if you want to read/check/enhance its content.

I hope it helps.

---
Cédric Villemain +33 6 20 30 22 52
https://www.Data-Bene.io
PostgreSQL Support, Expertise, Training, R&D



Reply via email to