[ 
https://issues.apache.org/jira/browse/HADOOP-4830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod K V updated HADOOP-4830:
------------------------------

    Attachment: HADOOP-4830-20090106-2-svn.txt

Attaching a new patch. Incorporated the above review comments. Notes about 
particular points follow:

bq. We are iterating over the task list to get the number of running tasks in 
ControlledMapReduceJob.getRunningTasksCount(). We check if the task is running 
using TaskInProgress.isRunning(). This method of computation seems like it 
would not be different from JobInProgress.runningMaps() or 
JobInProgress.runningReduces(). Can you please check if there is a difference ?
As pointed out, there is no real difference. The original intention was to 
ensure that so and so number of tasks are really running on TaskTrackers. But 
on retrospection, I came to conclude that the number of tasks 
scheduled(JIP.runningTasks()) should suffice. This is because the number of 
tasks scheduled is same as the number of tasks running in the test 
environment(no lost trackers). Made changes to use JIP.runningTasks() regarding 
the same.

bq. This may be that we need to fix something in ivy.xml of the capacity 
scheduler. Copying the ivy.xml from streaming ran the test successfully.
Made changes to ivy.xml in capacity scheduler src to include the needed jars. 
But, as Hemanth also concurs during a discussion, it may become cumbersome in 
future to add every new jar that capacity scheduler might not need directly but 
still needed to be included because of the dependency on underlying 
projects/modules say mapred/hdfs. Will file a new issue to see if this can be 
addressed generally.

While running tests, found that some tests were timing out. The actual reason 
turned out to be HADOOP-4977. The test TestQueueCapacities might fail sometimes 
because of that and till that gets fixed.

ant test-patch results:
{code}
     [exec] +1 overall.  
     [exec] 
     [exec]     +1 @author.  The patch does not contain any @author tags.
     [exec] 
     [exec]     +1 tests included.  The patch appears to include 15 new or 
modified tests.
     [exec] 
     [exec]     +1 javadoc.  The javadoc tool did not generate any warning 
messages.
     [exec] 
     [exec]     +1 javac.  The applied patch does not increase the total number 
of javac compiler warnings.
     [exec] 
     [exec]     +1 findbugs.  The patch does not introduce any new Findbugs 
warnings.
     [exec] 
     [exec]     +1 Eclipse classpath. The patch retains Eclipse classpath 
integrity.
{code}

> Have end to end tests based on MiniMRCluster to verify that queue capacities 
> are honoured.
> ------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-4830
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4830
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: contrib/capacity-sched
>            Reporter: Vinod K V
>            Assignee: Vinod K V
>         Attachments: HADOOP-4830-20081222-svn.2, 
> HADOOP-4830-20081229-svn.txt, HADOOP-4830-20090106-2-svn.txt
>
>
> At present, we only have unit tests that make use of FakeTaskManager and that 
> only test the proper functionality of capacity scheduler in isolation. Many 
> issues unearthed recently proved that this is not enough and that it is 
> required to have end-to-end tests so that real JT is brought into the picture 
> and with that the interaction of the scheduler with JT. This issue along with 
> few other related jiras should automate and replace the end-to-end tests that 
> are now manually done by QA, using MiniMRCluster.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to