On Sat, Jan 09, 2010 at 01:31:04PM +1300, Ralph Versteegen wrote:
> 2010/1/9  <[email protected]>:
> > james
> > 2010-01-08 15:53:40 -0800 (Fri, 08 Jan 2010)
> > 196
> > Ack! Wiki mirror script was also mirroring my entire album collection, 
> > which explains why the script was
> > taking hours to finish and took up six gigabytes more disk space than it 
> > was supposed to!
> 
> That's hilarious :)

Actually, it wasn't as bad as I thought. Not the full 6 gb, but it was 
indeed getting some stuff it shouldn't.

> But why was it mirroring thehamsterwheel.net? Don't you have to
> explicitly specify other domain names to crawl them?

httrack can be configured to either whitelist or blacklist sites. I am 
doing a bit of both, which might be the wrong approach.

---
James Paige
_______________________________________________
Ohrrpgce mailing list
[email protected]
http://lists.motherhamster.org/listinfo.cgi/ohrrpgce-motherhamster.org

Reply via email to