<quote who="Federico Leva (Nemo)" date="Thu, May 29, 2014 at 08:40:16AM +0200"> > Piotr Konieczny, 29/05/2014 05:56: > >Wikia (the largest wiki farm?) appears to be drastically > >under-researched... > > Part of the reason may be that they don't offer regular data dumps. > But WikiTeam has remedied and recovered dumps for most of their top 14k > wikis (as well as all images): > https://archive.org/details/wikia_dump_20140125 > https://archive.org/search.php?query=wikia_dump
Wikia published comprehensive dumps for all of their wikis until sometime in 2010. This is how Kittur and Kraut could write the paper they did. Without question, the current dumps put together by WikiTeam are an awesome resource for folks wanting to do Wikia research. That said, they are a strange sample and it's not clear how they are representative of other Wikia wikis. This makes it hard to use the sample to confidently answer a question like Piotr's. Basically, logged-in users have to "request" every dump individually and by hand. Once a dump is requested, it will be created and put in S3 and then seems to be kept around for at least several months. I've found some shockingly big and important wikis without dumps and 14k is a tiny proportion of all wikis! :-( If I can help or provide resources to help get a new comprehensive set of Wikia dumps, let me know. Regards, Mako -- Benjamin Mako Hill http://mako.cc/ Creativity can be a social contribution, but only in so far as society is free to use the results. --GNU Manifesto
signature.asc
Description: Digital signature
_______________________________________________ Wiki-research-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
