[Bug-wget] Difficulty downloading a site from archive.org

2011-08-13 Thread phil curb
I've been looking at downloading a site that's on archive.org I don't have the site in  front of me now but here are two example pages showing the kind of structure i'm working with.  Notice the website is spread in various directories by archive.org

Re: [Bug-wget] Difficulty downloading a site from archive.org

2011-08-13 Thread Micah Cowan
On 08/12/2011 11:56 AM, phil curb wrote: I've been looking at downloading a site that's on archive.org Archive.org's TOS on their website expressly forbids the use of downloading agents, and names wget explicitly. All URLs on archive.org always point at the _original_ (either modern, or

Re: [Bug-wget] Difficulty downloading a site from archive.org

2011-08-13 Thread Tony Lewis
Micah Cowan wrote: (I'd be interested in knowing whether folks actually have legal obligations to respect TOS to an unrestricted-access site like that... I imagine it might even vary by location) What terms of service? I didn't see any terms of service (perhaps because I didn't look for