I believe the percentage complete is set in hadoop, in the
TaskInProgress.recomputeProgressMethod() and then lines 570-595 in
JobInProgress.updateTaskStatus.
Dennis
Otis Gospodnetic wrote:
Hi,
I noticed that during fetching map tasks get to 100% complete (in the GUI), but
are not marked as completed (also in the GUI), and are in fact really not
complete - the logs show there is fetching still going on (though almost
exclusively timeouts at the end of the fetch run, as expected), while the GUI
shows 100% for all tasks (but 99.99% on the overall job view page).
How come this is happening? I can't locate the spot in Nutch where the
percentage completed is sent during fetching. Or, actually, this could
actually be all in Hadoop land and we can't change the percentages shown.
Thanks,
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch