Hello everyone. As you know, by default Mnemosyne collects logs of all
flashcard reviews, and has done so for years.

This seems like it could be useful data for some projects (see for
example my earlier emails about time-of-day and -week on performance),
but it's difficult to get access to the corpus because it has become
far too large to email or casually host; Peter provided a torrent ~4
years ago but it died long ago. One can email him for a copy but that
takes up his time and in any case, the logs still have to processed
into a SQL database (I tried recently but after 3 months of
processing, it still hadn't finished because of the IO bottleneck of
my hard drive). I've taken it upon myself to take a recent dump of
logs from Peter, process them into a SQL database (~1 day with my new
SSD), and upload them to my Amazon S3 account where anyone can
download them.

The link is 
https://s3.amazonaws.com/gwern-mnemosyne/2014-01-27-mnemosynelogs-all.db.xz
(due to the size, I suggest a download manager like `wget
--continue`).

This is a 2.8GB file compressed with xz
(https://en.wikipedia.org/wiki/Xz) which `unxz`/unpacks to an 18GB
SQLite 3.x database with the MD5 hash 03569c5416dd6923613389be6d0cc9e1
It can be queried with commands like `$ sqlite3 -batch ./logs.db
"SELECT timestamp,object_id,grade FROM log WHERE event==9;"` or via
SQL interfaces like 'sqldf' for R.

I commit to keeping the file up for 3 months before removing it, since
S3 bandwidth is not free; if you'd like to see it stay longer, I
accept Bitcoin donations at 1HbHpdhazqzfPtbcw9NA2H9R1GWNekm1L

-- 
gwern
http://www.gwern.net/Spaced%20repetition

-- 
You received this message because you are subscribed to the Google Groups 
"mnemosyne-proj-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/mnemosyne-proj-users/CAMwO0gz2azU-N-oCiLrCJ1fYvpU-37Aj%2BMWkRGBWHuBeZXxzTQ%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to