[
https://issues.apache.org/jira/browse/HDFS-15811?focusedWorklogId=545677&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-545677
]
ASF GitHub Bot logged work on HDFS-15811:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 01/Feb/21 22:46
Start Date: 01/Feb/21 22:46
Worklog Time Spent: 10m
Work Description: zehaoc2 opened a new pull request #2670:
URL: https://github.com/apache/hadoop/pull/2670
## NOTICE
Please create an issue in ASF JIRA before opening a pull request,
and you need to set the title of the pull request which starts with
the corresponding JIRA issue number. (e.g. HADOOP-XXXXX. Fix a typo in YYY.)
For more details, please see
https://cwiki.apache.org/confluence/display/HADOOP/How+To+Contribute
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 545677)
Remaining Estimate: 0h
Time Spent: 10m
> completeFile should log final file size
> ---------------------------------------
>
> Key: HDFS-15811
> URL: https://issues.apache.org/jira/browse/HDFS-15811
> Project: Hadoop HDFS
> Issue Type: Improvement
> Reporter: Zehao Chen
> Assignee: Zehao Chen
> Priority: Minor
> Time Spent: 10m
> Remaining Estimate: 0h
>
> Jobs, particularly hive queries by non-headless users, can create an
> excessive number of files (many hundreds of thousands). A single user's query
> can generate a sustained burst of 60-80% of all creates for tens of minutes
> or more and impact overall cluster performance. Adding the file size to the
> logline allows us to identify excessive tiny or large files.
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]