That works. I created the JIRA and attached your patch. It passes all build tests and works on my 150K run across my 5 machine dev cluster. Should we go ahead and commit this?
Dennis Andrzej Bialecki wrote: > Dennis Kubes wrote: >> Ok, I ran some bigger test crawls > 150K with the 0.9RC. Everything >> worked fine (inject, generate, fetch, updatedb, readdb, linkdb, >> mergesegs, mergdb, merge, index) except delete duplicates on which I >> am getting this error when running against segment indexes on the DFS. >> >> Because of the way I am automating some of my crawls (sorting names by >> alpha and only running part of the list), only one segment part-xxxxx >> had results and then others had 0 results. I don't know if that would >> cause this and I don't think this bug is critical for the 0.9 release >> but I wanted to bring it up. > > Please try the patch included at the end. > > >> >> My guess would be that this is a small bug within the lucene libraries >> when the directories have 0 results. What is everyone's opinion on >> this in terms of the release? My vote would be to move forward with >> the release. > > I think we should move forward. > > ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
