Thanks Doğacan, I went ahead and did this anyway after chcking that they weren't being used, and all was well, but does nutch usually take up this much space in temp files?
I'm running the crawl on a server that never gets restarted, so I can't have all the drive space used up like this. I can write a cron job to regularly remove the files, but this seems a bit haphazard. Thanks again for your reply, Lyndon.
