Hi Tizy, After you crawl the images, take a look at ./bin/nutch dump to get the images out. ./bin/nutch commoncrawldumper also will dump into the common crawl format.
Cheers, Chris ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: [email protected] WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ -----Original Message----- From: Tizy Ninan <[email protected]> Reply-To: "[email protected]" <[email protected]> Date: Monday, March 23, 2015 at 11:12 PM To: "[email protected]" <[email protected]>, "[email protected]" <[email protected]> Subject: Crawl images and store locally >Hi, > > >Does Nutch supports crawling images from webpages? If so, what are the >steps to retrieve the images and store it locally? > > >Thanks and Regards, >Tizy > > > > > > > >

