Hi,

A very simple reproducer bash script using the sqlite3 CLI is appended at the end.

I am using WAL mode in a setting with sequential writes and many concurrent reads. Due to WAL mode the readers don't get blocked, which is great and since writes are sequential, they never get blocked either. However, I am seeing the WAL grow on every write without ever restarting at the beginning of the file if there is a constant influx of new reads (with limited lifetime). This causes the WAL file to grow to many GB within minutes, even if the database state fits into a few MB or even kB after closing all connections.

The output of "PRAGMA wal_checkpoint(PASSIVE);" usually looks like this: "0|123|123", which I interpret as the checkpointer being caught up on the current state. I believe the reason that new writes are appended at the end, instead of restarting the WAL, is that while reads are short-lived, there is always at least one going on, so the log of the last write has to be kept, which in turn prevents a reset of the WAL.

An example read (r) write (w) pattern could look like this (b: begin, e: end):
r1_b; w1; r2b; r1e; w2; r2b; w3; r3b; r2e ...

A solution could be to start a second WAL when the first one exceeds some size threshold, which would allow resetting the first one after all readers finish that started before the wal_checkpoint finished, even if there are new writes in the second WAL. Then the roles/order of the two WALs flipped, allowing the second WAL to be reset regardless of read/write frequency. I believe that would limit the total WAL size to about 2 times of the size of writes happening within the timespan of a single read.

This solution has been suggested previously on this list by Mark Hamburg, but the threads lack a simple reproducer and detailed problem description.

Best regards,
Florian

Test script:

rm -f test.sqlite
./sqlite3 test.sqlite <<< "
PRAGMA journal_mode=WAL;
CREATE TABLE t (value INTEGER);
REPLACE into t (rowid, value) VALUES (1,0);
"

for i in {1..1000}
do

./sqlite3 test.sqlite <<< "
BEGIN;
SELECT value FROM t WHERE rowid=1;
.system sleep 0.2
SELECT value FROM t WHERE rowid=1;
COMMIT;
" &

sleep 0.1
./sqlite3 test.sqlite <<< "
BEGIN;
REPLACE into t (rowid, value) VALUES (1,$i);
.print inc
COMMIT;
"
wc -c test.sqlite-wal
done

_______________________________________________
sqlite-users mailing list
sqlite-users@mailinglists.sqlite.org
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users

Reply via email to