[ 
https://issues.apache.org/jira/browse/IMPALA-13624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17906660#comment-17906660
 ] 

Surya Hebbar commented on IMPALA-13624:
---------------------------------------

The current list representation is verbose, it contains the timestamps for all 
instances and all events without aggregation in the following manner.
{code}
AGGREGATION_NODE (id=47) [12 instances]:(Total: 1s728ms, non-child: 2.018ms, % 
non-child: 0.12%)
  Node Lifecycle Event Timeline[0]: 6s149ms
     - Open Started: 4s093ms (4s093ms)
     - Open Finished: 6s149ms (2s055ms)
     - First Batch Requested: 6s149ms (12.889us)
     - First Batch Returned: 6s149ms (2.147us)
     - Last Batch Returned: 6s149ms (1.221us)
     - Closed: 6s149ms (32.706us)
  Node Lifecycle Event Timeline[1]: 6s149ms
     - Open Started: 4s093ms (4s093ms)
     - Open Finished: 6s149ms (2s056ms)
     - First Batch Requested: 6s149ms (13.074us)
     - First Batch Returned: 6s149ms (2.056us)
     - Last Batch Returned: 6s149ms (148.000ns)
     - Closed: 6s149ms (35.318us)
  Node Lifecycle Event Timeline[2]: 6s149ms
     - Open Started: 4s093ms (4s093ms)
     - Open Finished: 6s149ms (2s055ms)
     - First Batch Requested: 6s149ms (13.658us)
     - First Batch Returned: 6s149ms (4.752us)
     - Last Batch Returned: 6s149ms (118.000ns)
     - Closed: 6s149ms (27.656us)
  Node Lifecycle Event Timeline[3]:
     ......
  ......
{code}
With this format, it becomes difficult to analyze a particular phase across 
instances.

Hence, a format that enhances aggregation of timestamps across instances 
provides better insights into overall performance.

The suggested aggregate representation is the following.
{code:java}
AGGREGATION_NODE (id=47) [13 instances] : (Total: 2s525ms, non-child: 0.000ns, 
% non-child: 0.00%)
  Node Lifecycle Event Timeline Summary : 4s094ms
   - Open Started: 2s850ms (2850ms)
       Div.    Min     Max    Avg   # Inst. Count
       20%    2000ms  2133ms 2050ms      3
       40%    2200ms  2300ms 2250ms      4
       60%    2400ms  2500ms 2450ms      2
       80%    2700ms  2800ms 2750ms      2
       100%   2850ms  2900ms 2875ms      2
   - Open Finished: 4s094ms (2s269ms)
       Div.    Min     Max    Avg   # Inst. Count
       20%    3000ms  3133ms 3050ms      3
      ......
   - First Batch Requested: 4s094ms (10.517us)
      ......
   - First Batch Returned: 4s094ms (1.695us)
      ......
   - Last Batch Returned: 4s094ms (104.000ns)
      ......
   - Closed: 4s094ms (20.656us)
      ......
{code}

> Implement textual representation of aggregate event sequences
> -------------------------------------------------------------
>
>                 Key: IMPALA-13624
>                 URL: https://issues.apache.org/jira/browse/IMPALA-13624
>             Project: IMPALA
>          Issue Type: New Feature
>            Reporter: Surya Hebbar
>            Assignee: Surya Hebbar
>            Priority: Major
>         Attachments: list_representation.txt
>
>
> Currently, the node lifecycle event timeline among fragment instances is 
> expressed in list format, without any form of summarization.
> Now, with the inclusion of aggregate metrics in the experimental profile in 
> JSON format, it is possible to display them in a comprehensive format, which 
> would help in skew analysis or other such cases.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to