Great, thanks for making this available! Peter
> -----Original Message----- > From: [email protected] [mailto:mnemosyne- > [email protected]] On Behalf Of Gwern Branwen > Sent: 07 February 2014 18:47 > To: Mnemosyne mailing list > Subject: [mnemosyne-proj-users] Mnemosyne data set available > > Hello everyone. As you know, by default Mnemosyne collects logs of all > flashcard reviews, and has done so for years. > > This seems like it could be useful data for some projects (see for example my > earlier emails about time-of-day and -week on performance), but it's difficult > to get access to the corpus because it has become far too large to email or > casually host; Peter provided a torrent ~4 years ago but it died long ago. One > can email him for a copy but that takes up his time and in any case, the logs > still have to processed into a SQL database (I tried recently but after 3 > months of processing, it still hadn't finished because of the IO bottleneck of > my hard drive). I've taken it upon myself to take a recent dump of logs from > Peter, process them into a SQL database (~1 day with my new SSD), and > upload them to my Amazon S3 account where anyone can download them. > > The link is https://s3.amazonaws.com/gwern-mnemosyne/2014-01-27- > mnemosynelogs-all.db.xz > (due to the size, I suggest a download manager like `wget --continue`). > > This is a 2.8GB file compressed with xz > (https://en.wikipedia.org/wiki/Xz) which `unxz`/unpacks to an 18GB SQLite > 3.x database with the MD5 hash 03569c5416dd6923613389be6d0cc9e1 It can > be queried with commands like `$ sqlite3 -batch ./logs.db "SELECT > timestamp,object_id,grade FROM log WHERE event==9;"` or via SQL > interfaces like 'sqldf' for R. > > I commit to keeping the file up for 3 months before removing it, since > S3 bandwidth is not free; if you'd like to see it stay longer, I accept > Bitcoin > donations at 1HbHpdhazqzfPtbcw9NA2H9R1GWNekm1L > > -- > gwern > http://www.gwern.net/Spaced%20repetition > > -- > You received this message because you are subscribed to the Google Groups > "mnemosyne-proj-users" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to mnemosyne-proj- > [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/mnemosyne-proj- > users/CAMwO0gz2azU-N-oCiLrCJ1fYvpU- > 37Aj%2BMWkRGBWHuBeZXxzTQ%40mail.gmail.com. > For more options, visit https://groups.google.com/groups/opt_out. -- You received this message because you are subscribed to the Google Groups "mnemosyne-proj-users" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/mnemosyne-proj-users/00ed01cf24a7%24ec2b41f0%24c481c5d0%24%40UGent.be. For more options, visit https://groups.google.com/groups/opt_out.
