Re: Separating nutch and hadoop configurations.

Andrzej Bialecki Wed, 11 Jul 2007 10:57:45 -0700

Briggs wrote:

I am currently trying to figure out how to deploy Nutch and Hadoop
separately.  I want to configure Hadoop outside of Nutch and have
Nutch use that service, rather than configuring hadoop within nutch.
I would think all that Nutch should need to know is the urls to
connect to Hadoop, but can't figure out how to get this to work.


Is this possible?  If so, is there some sort of document, or archive
of another list post for this?

Sorry for the ignorance.

If you have a clean hadoop installation up and running (made e.g. fromone of the official Hadoop builds), it should be enough to put thenutch*.job file in ${hadoop.dir}, and copy bin/nutch (possibly with someminor modifications - my memory is a little vague on this ...).



--
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com

Re: Separating nutch and hadoop configurations.

Reply via email to