HeartSaVioR opened a new pull request #2624:
URL: https://github.com/apache/hadoop/pull/2624


   This PR proposes to add latency information in 
FileOutputCommitter.mergePaths, so that we can trace how much latency specific 
directory takes to merge.
   
   This information would provide some value on investigation when the commit 
in FileOutputCommitter takes huge time than expected. This class logged the 
call with from/to params in debug level which looks insufficient to trace the 
latency of specific directory due to recursive call.
   
   No test added as there's nothing to test actually. Manual test done via 
adding below in log4j.properties
   
   ```
   log4j.logger.org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter=DEBUG
   ```
   
   and ran tests in TestFileOutputCommitter.
   
   ```
   2021-01-18 16:14:03,475 DEBUG [main] output.FileOutputCommitter 
(FileOutputCommitter.java:mergePaths(461)) - Merging data from 
DeprecatedRawLocalFileStatus{path=file:/Users/jlim/WorkArea/JavaProjects/hadoop/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/target/test-dir/org.apache.hadoop.mapreduce.lib.output.TestFileOutputCommitter/_temporary/0/task_200707121733_0001_m_000000;
 isDirectory=true; modification_time=1610954043000; access_time=1610954043000; 
owner=; group=; permission=rwxrwxrwx; isSymlink=false; hasAcl=false; 
isEncrypted=false; isErasureCoded=false} to 
file:/Users/jlim/WorkArea/JavaProjects/hadoop/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/target/test-dir/org.apache.hadoop.mapreduce.lib.output.TestFileOutputCommitter
   ...
   2021-01-18 16:14:03,476 DEBUG [main] output.FileOutputCommitter 
(FileOutputCommitter.java:mergePaths(502)) - Merged data from 
file:/.../hadoop/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/target/test-dir/org.apache.hadoop.mapreduce.lib.output.TestFileOutputCommitter/_temporary/0/task_200707121733_0001_m_000000
 to 
file:/.../hadoop/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/target/test-dir/org.apache.hadoop.mapreduce.lib.output.TestFileOutputCommitter
 in 1 ms
   ```


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to