Few comments
# For topology output, you can change 'output' to 'write'
# '-recursive' can be changed to
       <tr>
         <td><code>-recursive</code></td>
          <td>Recursively traverse input paths for job history logs.</td>
          <td>This options should be used to inform the TraceBuilder to 
recursively scan the input paths and process all the files under under it. Note 
that, by default, only the history logs that are directly under the input 
folder are considered for generating the trace.
          </td>
        </tr>

# Testcase
/**
   * Test {...@link TraceBuilder#processInputArgument()}.
   * @throws IOException
   */
  @Test
  public void testTraceBuilderInputArgumentProcessing() throws IOException {
     List<String> inputPaths = createInput();

    // test with recursive turned OFF
    testTraceBuilderDeaultInputArgumentProcessing(inputPaths); // these can be 
private unit test cases

    // test with recursive turned ON
    testTraceBuilderRecursiveInputArgumentProcessing(inputPaths);
  }

  Also "testRecursive()" should be more detailed.

   W.r.t to the discussion we had, the data/directory creation step can be

# TraceBuilder: "List<Path> inputPaths = new LinkedList<Path>();" can be done 
only after "if (inStatuses == null || inStatuses.length == 0) {" succeeds.

I will review the main code changes to TraceBuilder at your desk
Amar

On 9/23/10 11:21 AM, "Ravi Gummadi (JIRA)" <[email protected]> wrote:



     [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ravi Gummadi updated MAPREDUCE-1978:
------------------------------------

    Attachment: 1978.patch

Attaching patch that adds the option "-recursive" to TraceBuilder.

With -recursive option, TraceBuilder generates trace by scanning all the job 
history logs recursively under the given path.

Please review the patch and provide your comments.

> [Rumen] TraceBuilder should provide recursive input folder scanning
> -------------------------------------------------------------------
>
>                 Key: MAPREDUCE-1978
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1978
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: tools/rumen
>            Reporter: Amar Kamat
>            Assignee: Ravi Gummadi
>         Attachments: 1978.patch
>
>
> Currently, {{TraceBuilder}} assumes that the input is either jobhistory files 
> or a folders containing jobhistory files directly underneath the specified 
> folder. There could be a use cases where the input folder could contain 
> sub-folders containing jobhistory files. Rumen should support such input 
> folders.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Reply via email to