[Wikitech-l] commons.wikimedia.org allowing directory indexes and web robots

Alexandre Dulaunoy Sat, 18 Jul 2009 05:51:26 -0700

Hi All,

Commons.wikimedia.org is growing and provides a quite complete set
of media files including a lot of interesting historical documents.
Contributors are relying on the availability and persistence of 
commons.wikimedia.org but currently the full export is only 
available on download.wikimedia.org (ok not Today ;-).


I was wondering if it would be possible to allow web robots to access
http://upload.wikimedia.org/wikipedia/commons/ to gather and mirror
the media files. As this is pure HTTP, the mirroring could benefit from
the caching mechanisms of HTTP object (instead of having a large dump
containing all the media files, that is more difficult to cache/update).

Maybe this could allow a more distributed backup approach to ensure
the resilience of commons.wikimedia.org?

Thanks a lot for your work,

adulau

-- 
--                   Alexandre Dulaunoy (adulau) -- http://www.foo.be/
--                             http://www.foo.be/cgi-bin/wiki.pl/Diary
--         "Knowledge can create problems, it is not through ignorance
--                                that we can solve them" Isaac Asimov

signature.asc
Description: Digital signature

_______________________________________________
Wikitech-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

[Wikitech-l] commons.wikimedia.org allowing directory indexes and web robots

Reply via email to