On Tue, Aug 9, 2022 at 10:57 AM Thomas Munro <thomas.mu...@gmail.com> wrote: > > OK, so there aren't many places in 0001 that error out where we > previously did not.
Well, that's not true I believe. The 0001 patch introduces get_dirent_type() with elevel ERROR which means that lstat failures are now reported as ERRORs with ereport(ERROR, as opposed to continuing on the HEAD. - if (lstat(path, &statbuf) == 0 && !S_ISREG(statbuf.st_mode)) + if (get_dirent_type(path, mapping_de, false, ERROR) != PGFILETYPE_REG) continue; - if (lstat(path, &statbuf) == 0 && !S_ISREG(statbuf.st_mode)) + if (get_dirent_type(path, snap_de, false, ERROR) != PGFILETYPE_REG) { so on. The main idea of using get_dirent_type() instead of lstat or stat is to benefit from avoiding system calls on platforms where the directory entry type is stored in dirent structure, but not to cause errors for lstat or stat system calls failures where we previously didn't. I think 0001 must be just replacing lstat or stat with get_dirent_type() with elevel ERROR if lstat or stat failure is treated as ERROR previously, otherwise with elevel LOG. I modified it as attached. Once this patch gets committed, then we can continue the discussion as to whether we make elog-LOG to elog-ERROR or vice-versa. I'm tempted to use get_dirent_type() in RemoveXlogFile() but that requires us to pass struct dirent * from RemoveOldXlogFiles() (as attached 0002 for now), If done, this avoids one lstat() system call per WAL file. IMO, that's okay to pass an extra function parameter to RemoveXlogFile() as it is a static function and there can be many (thousands of) WAL files at times, so saving system calls (lstat() * number of WAL files) is definitely an advantage. Thoughts? -- Bharath Rupireddy RDS Open Source Databases: https://aws.amazon.com/rds/postgresql/
v14-0001-Make-more-use-of-get_dirent_type.patch
Description: Binary data
v14-0002-Replace-lstat-with-get_dirent_type-in-xlog.c.patch
Description: Binary data