That works.  I created the JIRA and attached your patch.  It passes all 
build tests and works on my 150K run across my 5 machine dev cluster. 
Should we go ahead and commit this?

Dennis

Andrzej Bialecki wrote:
> Dennis Kubes wrote:
>> Ok, I ran some bigger test crawls > 150K with the 0.9RC.  Everything 
>> worked fine (inject, generate, fetch, updatedb, readdb, linkdb, 
>> mergesegs, mergdb, merge, index) except delete duplicates on which I 
>> am getting this error when running against segment indexes on the DFS.
>>
>> Because of the way I am automating some of my crawls (sorting names by 
>> alpha and only running part of the list), only one segment part-xxxxx 
>> had results and then others had 0 results.  I don't know if that would 
>> cause this and I don't think this bug is critical for the 0.9 release 
>> but I wanted to bring it up.
> 
> Please try the patch included at the end.
> 
> 
>>
>> My guess would be that this is a small bug within the lucene libraries 
>> when the directories have 0 results.  What is everyone's opinion on 
>> this in terms of the release?  My vote would be to move forward with 
>> the release.
> 
> I think we should move forward.
> 
> 

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to