[EMAIL PROTECTED] wrote:
Aha, thank you.  The progress percentage is set early on.  Is that a
good thing for both Nutch and Hadoop in general?  If that progress is
set to early on in the process, what happens when you have a task
that takes a *really* long time?  I suppose it's just a minor
annoyance, since one can always look at the completed/not completed
bit to see what the real task status is.


Well, I guess so far it's been just a minor annoyance ... I think it's possible to fix it, but you would need to implement your own version of RecordReader instead of the SequenceFileRecordReader (which is instantiated in SequenceFileInputFormat), that's where the progress of the map task is set.


--
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com

Reply via email to