Re: java.io.EOFException in latest nightly in mergesegs from hadoop.io.DataOutputBuffer

2007-01-21 Thread Sami Siren
Brian Whitman wrote: On Jan 19, 2007, at 4:29 AM, Andrzej Bialecki wrote: Could you guys come up with exact data that causes this bug (primarily I'm interested in a seed list, because then I can see that you simply use the crawl tool, and finally try to run mergesegs). Thanks! I am also

Re: java.io.EOFException in latest nightly in mergesegs from hadoop.io.DataOutputBuffer

2007-01-21 Thread Sami Siren
However I cannot find from the change logs of hadoop that what the change is that is causing nutch these problems. It's HADOOP-331, so i guess at least the changes/additions in map() is required. -- Sami Siren

How to Become a Nutch Developer

2007-01-21 Thread Dennis Kubes
All, I am working on a How to Become a Nutch Developer document for the wiki and I need some input. I need an overview of how the process for JIRA works? If I am a developer new to Nutch and just starting to look at the JIRA and I want to start working on some piece of functionality or to

Re: How to Become a Nutch Developer

2007-01-21 Thread Andrzej Bialecki
Dennis Kubes wrote: All, I am working on a How to Become a Nutch Developer document for the wiki and I need some input. I need an overview of how the process for JIRA works? If I am a developer new to Nutch and just starting to look at the JIRA and I want to start working on some piece of

Re: How to Become a Nutch Developer

2007-01-21 Thread Chris Mattmann
Hi Dennis, On 1/21/07 11:47 AM, Dennis Kubes [EMAIL PROTECTED] wrote: All, I am working on a How to Become a Nutch Developer document for the wiki and I need some input. I need an overview of how the process for JIRA works? If I am a developer new to Nutch and just starting to look at

Reviving Nutch 0.7

2007-01-21 Thread Otis Gospodnetic
Hi, I've been meaning to write this message for a while, and Andrzej's StrategicGoals made me compose it, finally. Nutch 0.8 and beyond is very cool, very powerful, and once Hadoop stabilizes, it will be even more valuable than it is today. However, I think there is still a need for