Hoi,
Deleted files AFTER the creation of the tarballs were created will always
be part of the tarball. If you create logic that works on these archives,
it is likely that they will also work on the live data at Commons...
MY QUESTION... Yes, it is good to have a backup somewhere. However, what
Hi Frederico,
This is great news! I have two questions though:
1. What happens to files deleted after your crawler retrieved them? I
suppose they will still be available in the archives.
2. Is the archive team willing to host 3rd party, specialized
downloads, such as all the pictures from WLM
WikiTeam has just finished archiving all Wikimedia Commons files up to
2012 (and some more) on the Internet Archive:
https://archive.org/details/wikimediacommons
So far it's about 24 TB of archives and there are also a hundred
torrents you can help seed, ranging from few hundred MB to over a