[ 
https://issues.apache.org/jira/browse/HADOOP-5850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod K V updated HADOOP-5850:
------------------------------

    Attachment: HADOOP-5850-20090519.txt

Attaching patch to fix this issue.
 - With this patch, jobs will 0 maps or no input still run JobSetUp, any number 
of reduces( which do nothing), and the JobCleanUp task.
 - Removed the 0-splits check in JobInProgress.initTasks() and added checks so 
that cleanup task doesn't launch before setup tasks when number of splits is 
zero.
 - Renamed TestEmptyJobWithDFS to TestEmptyJob, removed HDFS dependence to 
quicken the test, added checks for verifying the number of map and reduce tasks 
run for an empty-job.

> map/reduce doesn't run jobs with 0 maps
> ---------------------------------------
>
>                 Key: HADOOP-5850
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5850
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: mapred
>    Affects Versions: 0.20.0
>            Reporter: Owen O'Malley
>            Assignee: Vinod K V
>            Priority: Critical
>             Fix For: 0.20.1
>
>         Attachments: HADOOP-5850-20090519.txt
>
>
> Currently, the framework ignores jobs that have 0 maps. This is incorrect. 
> Many pipelines need the job to run (if nothing else, to create the output 
> directory!) so that subsequent jobs don't fail. Effectively, there will be no 
> map tasks and the reduce tasks should immediately set up the Reducer and 
> RecordWriter and then call close on both since there are no inputs to the 
> reduce. I believe it should just work if we remove the check...

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to