Looks like I made it through a cycle on the map reduce branch. I put my steps here
http://spack.net/nutch/SimpleMapReduceTutorial.html A few things questions. 1. Sounds like some of you have some glue programs that help run the whole process. Are these going to end up in subversion sometime? I am guessing there is much duplicated effort. 2. Not sure how to test that my index actually worked. Starting catalina in my index directory didn't work this time. 3. What do you all think of setting up some test directories to crawl, in say http://lucene.apache.org/nutch/test/ Thinking it would be kind of cool to have junit run through a whole process on external pages. 4. Any way that http://spack.net/nutch/SimpleMapReduceTutorial.html http://spack.net/nutch/GettingNutchRunningOnUbuntu.html can get on the wiki? I am using apache-ish style and would change to whatever, but as fun as these are to write, I would like to see them used. Feedback would be appreciated. Enjoy! Earl __________________________________ Yahoo! Mail - PC Magazine Editors' Choice 2005 http://mail.yahoo.com
