Re: [s3ql] Caching and block size in S3QL

Daniel Jagszent Wed, 08 Jul 2020 19:26:16 -0700

> [...] This specific filesystem is used to store a borg repository.
> All files are ~500 MiB in size. Consequently, I expect ~20-40k of these
> files.[...]


this should not be a problem regarding the S3QL database size. (I
suspect the uncompressed size of the DB would be < 100 MiB)

> I have not (yet) profiled the file access patterns exactly, but I know
> that all new writes are strictly sequential and files are never
> rewritten, but accessing a borg repository causes many small random
> reads with no discernible pattern.
I do not know the specifics of the content-defined chunking borg uses
but when it is is similar to restic's implementation (
https://godoc.org/github.com/restic/chunker ) then chunks will be
between 512KiB and 8MiB. Let's say that compression can reduce that 2
times. So the chunks borg needs to access are between ~256KiB and ~4MiB.
Then maybe a max S3QL block size of 5 MiB instead of the default of 10
MiB would be better.  Since your file system has relatively few inodes
(~40k inodes, ~40k names, 4M blocks) this should be OK for the S3QL
database.

Besides max block size what really really would improve the random
read/write performance is a dedicated SSD for the S3QL cache.


> [...] Has anybody tried to improve s3ql's caching mechanism to allow partial
> download of blocks?
Not that I know of. Any such implementation should be rock solid with
regards to data integrity (that has priority over performance for S3QL
AFAIK) and should survive an OS crash at any point in time.
Since blocks are (optionally) compressed and encrypted it's not that
easy to discern the required byte range to receive from the object storage…


-- 
You received this message because you are subscribed to the Google Groups 
"s3ql" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/s3ql/7d257557-2e42-25af-7b90-c96c34e73318%40jagszent.de.

Re: [s3ql] Caching and block size in S3QL

Reply via email to