[EMAIL PROTECTED] wrote:
Aha, thank you. The progress percentage is set early on. Is that a good thing for both Nutch and Hadoop in general? If that progress is set to early on in the process, what happens when you have a task that takes a *really* long time? I suppose it's just a minor annoyance, since one can always look at the completed/not completed bit to see what the real task status is.
Well, I guess so far it's been just a minor annoyance ... I think it's possible to fix it, but you would need to implement your own version of RecordReader instead of the SequenceFileRecordReader (which is instantiated in SequenceFileInputFormat), that's where the progress of the map task is set.
-- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
