Andrzej Bialecki wrote:
> Dennis Kubes wrote:
>> I agree there may be subtle bugs.
>>
>> I can do say a full dmoz crawl (~5M pages) with nutch trunk and hadoop
>> 12.1 on a small cluster of 5 machines if this would help?  We have 
>> already
>>   
> 
> Certainly, that would be most welcome.

I will start that up today.

> 
> 
>> done some crawls > 100K urls with 11.2 without problems.  I say let's 
>> test
>> it and if there aren't any significant issues then let's go with 12.1 if
>> the hadoop team thinks it will be more stable.
>>   
> 
> 0.12.1 is not out the door yet. I can create a patch that uses the 
> latest Hadoop trunk binaries, so that we could test it.

I can just pull it down from source.  Let me know if that isn't what we 
want'.
> 
> 
>> One question though, are there any concerns about upgrading clusters as
>> opposed to new fetches?
>>   
> 
> Theoretically, there shouldn't be, but this is an uncharted area ... 
> until someone tries it we won't know for sure. :-/
> 

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to