[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12864857#action_12864857
 ] 

Konstantin Boudnik commented on MAPREDUCE-1713:
-----------------------------------------------

There was a cross-posting of these comment, so repeating them to make sure they 
are not missed:
 - use {{$(TO_DO_GLOBAL_TMP_DIR)/localconf}}
instead of
+  {{<value>$HADOOP_HOME/conf/localconf</value>}}
because $HADOOP_HOME mighe have root only writable permissions in some setups.

- The value of this variable is _wrong_.
{noformat}
+  public final static String CONF_HADOOP_LOCAL_DIR =
+    "test.system.hadoop.local.confdir";
{noformat}
It is suppose to be 
{{test.system.hdrc.hadoop.local.confdir}} according to the naming convention 
and the name in the system-test.xml file

 - also this code seems very enefficient:
{noformat}
+  public void waitForClusterToStart() throws 
+      Exception {
+    int counter;
+    for (List<AbstractDaemonClient> set : daemons.values()) {
+      for (AbstractDaemonClient daemon : set) {
+        counter = 0;
+        while (counter < 30) {
{noformat}
on a cluster of like 10 nodes it takes about 5 minutes to restart a cluster 
(RPC tries to reconnect to a remove server for about 50 seconds or so). It is 
too much. Instead of doing this sequentially you need to start a number of 
threads (one per a daemon) which will be waiting for them to come up. All wait 
can be done in about a minute or so instead of 1 min x # nodes.


> Utilities for system tests specific.
> ------------------------------------
>
>                 Key: MAPREDUCE-1713
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1713
>             Project: Hadoop Map/Reduce
>          Issue Type: Task
>          Components: test
>            Reporter: Vinay Kumar Thota
>            Assignee: Vinay Kumar Thota
>         Attachments: 1713-ydist-security.patch, systemtestutils_MR1713.patch, 
> utilsforsystemtest_1713.patch
>
>
> 1.  A method for restarting  the daemon with new configuration.
>       public static  void restartCluster(Hashtable<String,Long> props, String 
> confFile) throws Exception;
> 2.  A method for resetting the daemon with default configuration.
>       public void resetCluster() throws Exception;
> 3.  A method for waiting until daemon to stop.
>       public  void waitForClusterToStop() throws Exception;
> 4.  A method for waiting until daemon to start.
>       public  void waitForClusterToStart() throws Exception;
> 5.  A method for checking the job whether it has started or not.
>       public boolean isJobStarted(JobID id) throws IOException;
> 6.  A method for checking the task whether it has started or not.
>       public boolean isTaskStarted(TaskInfo taskInfo) throws IOException;

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to