Alexis, Your post on dev http://www.mail-archive.com/[email protected]/msg01385.html) - Fetch command returns immediately Is the exact problem. Your log file looks very similar to mine. I will apply the fix you mention and see what happens.
Thanks! -----Original Message----- From: Alexis [mailto:[email protected]] Sent: Friday, December 17, 2010 1:19 AM To: [email protected] Subject: Re: Does Nutch 2.0 in good enough shape to test? Hi, I've spent some time working on this as well. I've just put together a blog entry addressing the issues I ran into. See http://techvineyard.blogspot.com/2010/12/build-nutch-20.html In a nutchsell, I changed three pieces in Gora and Nutch code: - flush the datastore regularly in the Hadoop RecordWriter (in GoraOutputFormat) - wait for Hadoop job completion in the Fetcher job - ensure that the content length limit is not being exceeded in protocol-http plugin (only for MySQL datastore) >> So what am I missing? > > I don't know, we need more information. BTW, dev@ list may be more > appropriate for this discussion. > I agree this should not be in nutch-user list. Post a comment on my blog entry or reply to my thread (http://www.mail-archive.com/[email protected]/msg01385.html) in the dev-list! Alexis.

