If you just want to save a copy of the site locally, you'd
probably be better off using wget.  

Jake.

-----Original Message-----
From: Stefan Groschupf [mailto:[EMAIL PROTECTED] 
Sent: Sunday, January 29, 2006 7:21 PM
To: [email protected]
Subject: Re: download/mirror

Nutch also 'cache' pages content if you like, but I don't think it  
also does with images.
Am 30.01.2006 um 01:15 schrieb Michael Dodson:

> Do the pages stored locally contain text only data?  Is there a way  
> for nutch to store images as well?
>
>
> On Jan 29, 2006, at 10:09 PM, Stefan Groschupf wrote:
>
>> Sure.
>> Crawl pages and the result is stored locally in a index.
>>
>> Am 28.01.2006 um 14:19 schrieb Michael Dodson:
>>
>>> Can Nutch be used to download websites as well as index those  
>>> sites so searching and retrieving can be done offline?
>>>
>>
>> ---------------------------------------------------------------
>> company:        http://www.media-style.com
>> forum:        http://www.text-mining.org
>> blog:            http://www.find23.net
>>
>>
>
>

---------------------------------------------------------------
company:        http://www.media-style.com
forum:        http://www.text-mining.org
blog:            http://www.find23.net




-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid3432&bid#0486&dat1642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to