Eugeny N Dzhurinsky wrote:
Hello there!
Could somebody please explain is it possible to get some statistics for the
certain job? For instance, get some numbers of how many data tuples were
processed yet, and how many tuples needs to be processed to complete the job?
http://lucene.apache.org/hadoop/mapred_tutorial.html#Job+Control
Specifically:
http://lucene.apache.org/hadoop/api/org/apache/hadoop/mapred/JobClient.html#submitJob(org.apache.hadoop.mapred.JobConf)
and
http://lucene.apache.org/hadoop/api/org/apache/hadoop/mapred/RunningJob.html
Arun
This presumes the job knows the required statistics and can deliver it when
queried by Hadoop/something else.