Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by PaulBaclace: http://wiki.apache.org/nutch/OverviewDeploymentConfigs ------------------------------------------------------------------------------ 5. set environment variables in ~/.ssh/environment, since ssh does not source your .bash_profile. These include JAVA_HOME, NUTCH_LOG_DIR, NUTCH_SLAVES and NUTCH_MASTER. 6. make sure that your NUTCH_LOG_DIR and the directories named in ndfs.data.dir exist on all slaves. This can be done most easily with bin/slaves.sh. - === Deployment Startup Sequences === + === Deployment and Startup Sequences === A. Cluster deployment with too many machines to customize (probably more than 4; 1000 machines should be possible): @@ -63, +63 @@ B. Cluster of a few machines: - 1. ''Add more details here'' + While the cluster deployment and startup sequence can apply for 2 more more machines, the idea behind this case is to address experimental configurations for federations of machines that might have firewalls in between them; 1. ''Add more details here...'' C. One developer debugging on one machine: 1. ''Add more details here''
