Re: [Wikitech-l] Tarballs of all 2004-2012 Commons files now available at archive.org

2013-10-16 Thread Gerard Meijssen
Hoi, Deleted files AFTER the creation of the tarballs were created will always be part of the tarball. If you create logic that works on these archives, it is likely that they will also work on the live data at Commons... MY QUESTION... Yes, it is good to have a backup somewhere. However, what

Re: [Wikitech-l] Tarballs of all 2004-2012 Commons files now available at archive.org

2013-10-15 Thread Strainu
Hi Frederico, This is great news! I have two questions though: 1. What happens to files deleted after your crawler retrieved them? I suppose they will still be available in the archives. 2. Is the archive team willing to host 3rd party, specialized downloads, such as all the pictures from WLM

[Wikitech-l] Tarballs of all 2004-2012 Commons files now available at archive.org

2013-10-13 Thread Federico Leva (Nemo)
WikiTeam has just finished archiving all Wikimedia Commons files up to 2012 (and some more) on the Internet Archive: https://archive.org/details/wikimediacommons So far it's about 24 TB of archives and there are also a hundred torrents you can help seed, ranging from few hundred MB to over a