Dennis Kubes wrote:
> I agree there may be subtle bugs.
>
> I can do say a full dmoz crawl (~5M pages) with nutch trunk and hadoop
> 12.1 on a small cluster of 5 machines if this would help?  We have already
>   

Certainly, that would be most welcome.


> done some crawls > 100K urls with 11.2 without problems.  I say let's test
> it and if there aren't any significant issues then let's go with 12.1 if
> the hadoop team thinks it will be more stable.
>   

0.12.1 is not out the door yet. I can create a patch that uses the 
latest Hadoop trunk binaries, so that we could test it.


> One question though, are there any concerns about upgrading clusters as
> opposed to new fetches?
>   

Theoretically, there shouldn't be, but this is an uncharted area ... 
until someone tries it we won't know for sure. :-/

-- 
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to