Hi all Can someone suggest me how to restrict number of jobs Nutch lauches in hadoop when starts segment merger.
When I run generate, fetch, updatedb tasks Nutch starts about 6-10 Mapreduce jobs (cluster of 2 datanodes) - actual value varies from task to task but when the script start merging segments it lauches about 20 jobs and servers get overloaded and crash. Nutch settings are primary default one. How can I control the number of jobs? best Regards Alexander -- Best Regards Alexander Aristov
