Andrzej Bialecki wrote: > Dennis Kubes wrote: >> I agree there may be subtle bugs. >> >> I can do say a full dmoz crawl (~5M pages) with nutch trunk and hadoop >> 12.1 on a small cluster of 5 machines if this would help? We have >> already >> > > Certainly, that would be most welcome.
I will start that up today. > > >> done some crawls > 100K urls with 11.2 without problems. I say let's >> test >> it and if there aren't any significant issues then let's go with 12.1 if >> the hadoop team thinks it will be more stable. >> > > 0.12.1 is not out the door yet. I can create a patch that uses the > latest Hadoop trunk binaries, so that we could test it. I can just pull it down from source. Let me know if that isn't what we want'. > > >> One question though, are there any concerns about upgrading clusters as >> opposed to new fetches? >> > > Theoretically, there shouldn't be, but this is an uncharted area ... > until someone tries it we won't know for sure. :-/ > ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-developers mailing list Nutch-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nutch-developers