Andrzej Bialecki wrote:

> Michael Wechner wrote:
>
>> Hi
>>
>> Please apologize if I might ask something obvious, but what is 
>> actually the purpose
>> of the nutch-*.job file?
>
>
> It contains all classes and plugins needed to run a Nutch job on a 
> Hadoop cluster. Hadoop cluster doesn't have to be used for Nutch, 
> indeed there are many other interesting applications for it - so the 
> core Hadoop is independent of any Nutch classes.


ok. I guess this http://wiki.apache.org/nutch/NutchHadoopTutorial

>
> So, as the job is submitted to the cluster, there must be a way to 
> transmit all necessary implementation classes so that tasks on 
> individual nodes could execute the Nutch code. This is the purpose of 
> the job file - it is then expanded on each node,


the submission and expansion of the job file is done automatically? I 
mean one deploys the job file manually on the master and then it's being
spread automatically on the slaves?

Thanks for the info an clarifications

Michi

> and all classes and plugins are loaded by a task's classloader.
>


-- 
Michael Wechner
Wyona      -   Open Source Content Management   -    Apache Lenya
http://www.wyona.com                      http://lenya.apache.org
[EMAIL PROTECTED]                        [EMAIL PROTECTED]
+41 44 272 91 61


-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to