Hello, i've succesfully set up cluster of 3 machines under hadoop. However i have a problem. While fetching hadoop generates 6 jobs, however the number of pages in each of those jobs is not spread equally i get 5 jobs with ~ 3 500 pages and one with ~ 50 000. That's not a good thing as 5 jobs finish very quickly and afterwards only one machine is working while others are waiting. Could this be a problem with my configuration, i've set number of map jobs to 30, number of reduce jobs to 6 and fetcher threads to 150, however during fetch i still get only 6 map jobs. Any help would be appreciated, thanks.
-- Karol Rybak Programista / Programmer Sekcja aplikacji / Applications section Wyższa Szkoła Informatyki i Zarządzania / University of Internet Technology and Management +48(17)8661277
