tomscut opened a new pull request #3538:
URL: https://github.com/apache/hadoop/pull/3538


   JIRA: [HDFS-16266](https://issues.apache.org/jira/browse/HDFS-16266)
   
   In our production environment, we occasionally encounter a problem where a 
user submits an abnormal computation task, causing a sudden flood of requests, 
which causes the queueTime and processingTime of the Namenode to rise very 
high, causing a large backlog of tasks.
   
   We usually locate and kill specific Spark, Flink, or MapReduce tasks based 
on metrics and audit logs. Currently, IP and UGI are recorded in audit logs, 
but there is no port information, so it is difficult to locate specific 
processes sometimes. Therefore, I propose that we add the port information to 
the audit log, so that we can easily track the upstream process.
   
   Currently, some projects contain port information in audit logs, such as 
Hbase and Alluxio. I think it is also necessary to add port information for 
HDFS audit logs.
   
   Before:
   
![before-hdfs-audit-log](https://user-images.githubusercontent.com/55134131/136699770-e5c07e90-0046-43ba-8c1c-a0e94a02657d.jpg)
   
   
   After:
   
![hdfs-audit-log](https://user-images.githubusercontent.com/55134131/136699624-13bb3375-398b-473f-9f8a-212bdd5ec765.jpg)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to