Briggs wrote:
I am currently trying to figure out how to deploy Nutch and Hadoop
separately.  I want to configure Hadoop outside of Nutch and have
Nutch use that service, rather than configuring hadoop within nutch.
I would think all that Nutch should need to know is the urls to
connect to Hadoop, but can't figure out how to get this to work.

Is this possible?  If so, is there some sort of document, or archive
of another list post for this?

Sorry for the ignorance.

If you have a clean hadoop installation up and running (made e.g. from one of the official Hadoop builds), it should be enough to put the nutch*.job file in ${hadoop.dir}, and copy bin/nutch (possibly with some minor modifications - my memory is a little vague on this ...).


--
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com

Reply via email to