[
https://issues.apache.org/jira/browse/HIVE-21740?focusedWorklogId=251656&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-251656
]
ASF GitHub Bot logged work on HIVE-21740:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 31/May/19 09:33
Start Date: 31/May/19 09:33
Worklog Time Spent: 10m
Work Description: pvary commented on pull request #633: HIVE-21740:
Collect LLAP execution latency metrics
URL: https://github.com/apache/hive/pull/633#discussion_r289320836
##########
File path:
llap-tez/src/java/org/apache/hadoop/hive/llap/tezplugins/LlapTaskSchedulerService.java
##########
@@ -3154,4 +3160,22 @@ public void taskInfoUpdated(TezTaskAttemptID attemptId,
boolean isGuaranteed) {
+ attemptId + ", " + newState);
sendUpdateMessageAsync(ti, newState);
}
+
+ private void updateMetrics(TaskAttemptImpl taskAttempt) {
+ // Only do it for successful map tasks
+ if (!TaskAttemptState.SUCCEEDED.equals(taskAttempt.getState()) ||
!isMapTask(taskAttempt)) {
+ return;
+ }
+ // Check if this task was already assigned to a node
+ NodeInfo nodeInfo = knownTasks.get(taskAttempt).assignedNode;
+ if (nodeInfo == null) {
+ return;
+ }
+
+ metrics.addTaskLatency(nodeInfo.shortStringBase,
taskAttempt.getFinishTime() - taskAttempt.getLaunchTime());
+ }
+
+ private boolean isMapTask(TaskAttemptImpl taskAttempt) {
Review comment:
I am not yet entirely familiar with this part of the code.
I thought that the vertex can tell us more but getVertex is package private
method for TaskAttemptImp, and getting the Vertex from would need something
like this (found in getTransitiveVertexOutputs):
DagInfo info = getContext().getCurrentDagInfo();
if (!(info instanceof DAG)) {
LOG.warn("DAG info is not a DAG");
return;
}
DAG dag = (DAG) info;
Vertex vertex = dag.getVertex(taskAttempt.getVertexID());
------------------
I found casting DagInfo to DAG more shady than relying on counters, but feel
free to disagree.
Also open to any suggestions where should I dig around more to find a better
solution!
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 251656)
Time Spent: 5.5h (was: 5h 20m)
> Collect LLAP execution latency metrics
> --------------------------------------
>
> Key: HIVE-21740
> URL: https://issues.apache.org/jira/browse/HIVE-21740
> Project: Hive
> Issue Type: New Feature
> Reporter: Peter Vary
> Assignee: Peter Vary
> Priority: Major
> Labels: pull-request-available
> Attachments: HIVE-21740.2.patch, HIVE-21740.3.patch,
> HIVE-21740.4.patch, HIVE-21740.5.patch, HIVE-21740.patch
>
> Time Spent: 5.5h
> Remaining Estimate: 0h
>
> Collect metrics for LLAP task execution times
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)