<quote who="Federico Leva (Nemo)" date="Thu, May 29, 2014 at 08:40:16AM +0200">
> Piotr Konieczny, 29/05/2014 05:56:
> >Wikia (the largest wiki farm?) appears to be drastically
> >under-researched...
> 
> Part of the reason may be that they don't offer regular data dumps.
> But WikiTeam has remedied and recovered dumps for most of their top 14k
> wikis (as well as all images):
> https://archive.org/details/wikia_dump_20140125
> https://archive.org/search.php?query=wikia_dump

Wikia published comprehensive dumps for all of their wikis until
sometime in 2010. This is how Kittur and Kraut could write the paper
they did.

Without question, the current dumps put together by WikiTeam are an
awesome resource for folks wanting to do Wikia research. That said,
they are a strange sample and it's not clear how they are
representative of other Wikia wikis. This makes it hard to use the
sample to confidently answer a question like Piotr's.

Basically, logged-in users have to "request" every dump individually
and by hand. Once a dump is requested, it will be created and put in
S3 and then seems to be kept around for at least several months. I've
found some shockingly big and important wikis without dumps and 14k is
a tiny proportion of all wikis! :-(

If I can help or provide resources to help get a new comprehensive
set of Wikia dumps, let me know.

Regards,
Mako


-- 
Benjamin Mako Hill
http://mako.cc/

Creativity can be a social contribution, but only in so far
as society is free to use the results. --GNU Manifesto

Attachment: signature.asc
Description: Digital signature

_______________________________________________
Wiki-research-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l

Reply via email to