Hi There,

1) We are trying to use sqlite3 in Glusterfs as a Change Time Recording Data 
Store
i.e using sqlite3 db to record any modification attempts that happens on the 
glusterfs
per file inode, So that it can be used to indicate the hotness of the file in 
the cluster.

2) We have developed a prototype that inserts/updates inode records in sqlite3 
db(one instance of the 
db file per glusterfs storage unit or as we call it a brick) to record the 
modification 
that are happening on the inode. These insert/updates are in the IO path of the 
file i.e any data or 
metadata change time will be record inline, w.r.t the IO, in the DB.

3) We only save the modification time on the to the db, so that 
we can easily query for "What files have change during a specific period of 
time" or 
"What files have not change during a specific period of time". The query is 
done by a scanner 
that runner on each storage unit or brick on a regular interval and chooses HOT 
or COLD files as suggested 
by the DB for data maintenance operations.

4) Therefore, we are looking at a datastore that can give us a very quick 
write(almost zero latency, 
as the recording is done inline w.r.t file IO) and that as good data querying 
facilities(Slight latency in the read is fine but the fresh of that record data 
should be spot on).

5) Please find our DB setting,
Journal mode : WAL
SYNC MODE : NORMAL 
Cache Size : 1000 - 4096 Pages (Default Page Size)
AutoCheck-Pointing: 1000 - 1000000 Pages 

We wanted to known the following
1) How could we improve the performance on the write side so that we have 
minimal latency?
2) Will ther be any write performance hit when the number of records in the DB 
increase?

Thanks in advance

~Joe
_______________________________________________
sqlite-users mailing list
sqlite-users@sqlite.org
http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users

Reply via email to