[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amar Kamat updated MAPREDUCE-1912:
----------------------------------

    Attachment: mapreduce-1912-v1.1.patch

Attaching a patch that adds a Driver class to tools package and makes it the 
main-class of the tools.jar. Here is the output of running tools jar :
{code}
node amar$ java -jar build/hadoop-mapred-tools-0.22.0-SNAPSHOT.jar 
An example program must be given as the first argument.
Valid program names are:
  Distch: A tool to perform distributed chmod
  Distcp: A tool to copy data from one hdfs installation to another
  HadoopArchives: An archive creation utility.
  LogAnalyzer: An utility tool for archiving and analyzing hadoop logs.
  Rumen: A tool to extract MapReduce logs and analyze it.
{code}

Note that the lib-jars needed to run the tools are already added to the tool 
jar's (manifest) classpath. 

Added Rumen.java to the rumen tool which provides a single entry point into 
rumen. Here is the output after using the _Rumen_ option :
{code}
node amar$ java -jar build/hadoop-mapred-tools-0.22.0-SNAPSHOT.jar Rumen
An example program must be given as the first argument.
Valid program names are:
  -fold: A Rumen tool fold/scale the trace
  -generate-trace: A Rumen tool to generate traces from MapReduce logs.
{code}
Changed TraceBuilder and Folder to print the usage message when invoked with 
invalid arguments. Example 
{code}
node amar$ java -jar build/hadoop-mapred-tools-0.22.0-SNAPSHOT.jar Rumen -fold
Invalid paramters!!

Usage :
  Folder [generic-options] [-output-duration]  [-input-cycle] [-concentration] 
[-debug] [-allow-missorting] [-seed] [-skew-buffer-length] [-temp-directory] 
<input> <output>

Generic options supported are
-conf <configuration file>     specify an application configuration file
-D <property=value>            use value for given property
-fs <local|namenode:port>      specify a namenode
-jt <local|jobtracker:port>    specify a job tracker
-files <comma separated list of files>    specify comma separated files to be 
copied to the map reduce cluster
-libjars <comma separated list of jars>    specify comma separated jar files to 
include in the classpath.
-archives <comma separated list of archives>    specify comma separated 
archives to be unarchived on the compute machines.

The general command line syntax is
bin/hadoop command [genericOptions] [commandOptions]
{code}

tets-patch and ant-test passed on my box.


> [Rumen] Add a driver for Rumen tool 
> ------------------------------------
>
>                 Key: MAPREDUCE-1912
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1912
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: tools/rumen
>    Affects Versions: 0.22.0
>            Reporter: Amar Kamat
>            Assignee: Amar Kamat
>             Fix For: 0.22.0
>
>         Attachments: mapreduce-1912-v1.1.patch
>
>
> Rumen, as a tool, has 2 entry points :
> - Trace builder
> - Folder
> It would be nice to have a single driver program and have 'trace-builder' and 
> 'folder' as its options. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to