Hi, hackers! Currently I'm working on page-level incremental backups using WAL-G codebase[0]. And I have two questions that I cannot resolve myself.
Incremental backup is a set of changes, that should be applied over preexisting backup. I use page LSN to understand should page be backup`ed or not. Question 1. FSM and VM. As you can see here [1] FSM and VM files are exempt from incremental tracking and are backuped as whole files. I've done it this way, because sanity checks [2] of page headers have indicated a lot of "invalid" pages in FSM and VM files. But seems like in some pages headers are valid with sane LSNs. Can I use LSNs as history marker on FSM and VM pages? On 1Tb backup I get like 150Mb of FSM+VM, and it's kind of a lot. Question 2. File dates. Is it safe to use file modification time to track that file were changes since previous backup? If the file has date before start of previous backup I just add it to "skip list" [3]. I have assumption: every time file is changes in filesystem, it's modification date is updated to higher value. Is this assumption valid for most of used platforms and filesystems? Or can I check this "capacity" of FS? Thank you for your attention. I'll be glad to receive any information\pointers on this matter. Best regards, Andrey Borodin, Yandex. [0] https://github.com/wal-g/wal-g/pull/29 [1] https://github.com/wal-g/wal-g/pull/29/files#diff-d77406e827f5f947d4d4a1e6d76c1f4eR114 [2] https://github.com/wal-g/wal-g/pull/29/files#diff-d77406e827f5f947d4d4a1e6d76c1f4eR50 [3] https://github.com/wal-g/wal-g/pull/29/files#diff-f5c8f0067297f98eb5acc6e2c6b1b234R87 -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers