Brian Whitman wrote:
> 
> On Jan 19, 2007, at 4:29 AM, Andrzej Bialecki wrote:
>>
> 
>> Could you guys come up with exact data that causes this bug (primarily
>> I'm interested in a seed list, because then I can see that you simply
>> use the crawl tool, and finally try to run mergesegs). Thanks!

I am also experiencing NPE in SegmentReader and Indexer, not 100% sure
yet what exactly causes these problems that happens when hadoop
"spills", I got rid of it with a little patching:

- added/changed Mapper.map to return ObjectWritable instead of plain object.

- patched SequenceFile slightly because of NPE in
SequenceFile.Sorter.MergeQueue

However I cannot find from the change logs of hadoop that what the
change is that is causing nutch these problems.

--
 Sami Siren

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to