If you just want to save a copy of the site locally, you'd probably be better off using wget.
Jake. -----Original Message----- From: Stefan Groschupf [mailto:[EMAIL PROTECTED] Sent: Sunday, January 29, 2006 7:21 PM To: [email protected] Subject: Re: download/mirror Nutch also 'cache' pages content if you like, but I don't think it also does with images. Am 30.01.2006 um 01:15 schrieb Michael Dodson: > Do the pages stored locally contain text only data? Is there a way > for nutch to store images as well? > > > On Jan 29, 2006, at 10:09 PM, Stefan Groschupf wrote: > >> Sure. >> Crawl pages and the result is stored locally in a index. >> >> Am 28.01.2006 um 14:19 schrieb Michael Dodson: >> >>> Can Nutch be used to download websites as well as index those >>> sites so searching and retrieving can be done offline? >>> >> >> --------------------------------------------------------------- >> company: http://www.media-style.com >> forum: http://www.text-mining.org >> blog: http://www.find23.net >> >> > > --------------------------------------------------------------- company: http://www.media-style.com forum: http://www.text-mining.org blog: http://www.find23.net ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnk&kid3432&bid#0486&dat1642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
