Re: Deploy nutch on existing Hadoop cluster

Lewis John Mcgibbney Thu, 21 Feb 2013 09:53:45 -0800

Welcome to the world of post 1.3 Nutch ;)

On Thursday, February 21, 2013, Amit Sela <[email protected]> wrote:
> I basically just built with ant and copied the contents of deploy (job
file
> + nutch and crawl scripts) to "nutch" folder in my hadoop-user directory
on
> the master.
>
> I changed the crawl script to work only in distributed mode and it seems
to
> work... though I am getting a lot of Child Error exceptions in one of the
> nodes (not the master)
> while another node seems to work fine (total 1 master + 2 slaves).
>
> Could it be so simple ? am I missing something ?
>
>
> Thanks
>
>
> On Thu, Feb 21, 2013 at 6:21 PM, Julien Nioche <
> [email protected]> wrote:
>
>> https://wiki.apache.org/nutch/NutchHadoopTutorial
>>
>> basically follow the steps in
>> http://hadoop.apache.org/docs/stable/cluster_setup.html then install
Nutch
>> on the master node of your cluster, 'cd runtime/deploy/bin' and use the
>> nutch scripts as usual. You can then use the standard Mapreduce webapp to
>> monitor the progress of your crawl
>>
>> Julien
>>
>> On 21 February 2013 10:00, Amit Sela <[email protected]> wrote:
>>
>> > Anyone have a good tutorial about deploying nutch (1.6) on a
pre-existing
>> > Hadoop cluster ?
>> >
>> > Thanks.
>> >
>>
>>
>>
>> --
>> *
>> *Open Source Solutions for Text Engineering
>>
>> http://digitalpebble.blogspot.com/
>> http://www.digitalpebble.com
>> http://twitter.com/digitalpebble
>>
>


-- 
*Lewis*

Re: Deploy nutch on existing Hadoop cluster

Reply via email to