Surya Hebbar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/21683 )

Change subject: IMPALA-13304: Include aggregate instance-level metrics within 
experimental profile(V2)
......................................................................


Patch Set 9:

(1 comment)

I agree with your opinion. It would be great to enhance EventSequence's 
constructor and the collection mechanism along with a cleanup and performance 
boost provided by switching to a completed version of profile V2. I was trying 
to mention something along those lines in the 3rd approach.

But, also please take a look at the current patchset.

In most profiles I had seen, missing events occurred in cases of medium to 
large number of nodes, and there always used to be at least one instance with a 
complete set of events. But still the problem of alignment and order occurred 
when a failed instance was first being used to generate the 'labels' map. This 
resulted in requiring a complete change in the order of all timestamps.

After understanding the current collection mechanism, I have now addressed a 
big part of this problem by maintaining a reference to the reordering of labels 
themselves. So, now instead of all timestamps, only instances with missing 
timestamps need reordering.

In case...

1. Some instances have missing events and there is at least a single instance 
with a complete event sequence, only those irregular instances are ordered and 
aligned efficiently.

2. There is no instance at all with a complete order, the initial ordering and 
alignment of labels and timestamps are kept, while skipping the inclusion of 
events at the end.

3. All instances are already in order, alignment and ordering is skipped with 
nearly zero overhead.

This resolves almost all cases of missing timestamps with very less computation.

I will still create a JIRA with the above example mentioning the unresolved 
corner cases and the possible improvement.

Please let me know, if I am overlooking any cases with this approach.

http://gerrit.cloudera.org:8080/#/c/21683/8/be/src/util/runtime-profile.cc
File be/src/util/runtime-profile.cc:

http://gerrit.cloudera.org:8080/#/c/21683/8/be/src/util/runtime-profile.cc@97
PS8, Line 97: const std::vector<std::string> 
AggregatedRuntimeProfile::AGGREGATED_INFO_STRINGS_REQUIRED
> line too long (107 > 90)
Done



--
To view, visit http://gerrit.cloudera.org:8080/21683
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I49e18a7a7e1288e3e674e15b6fc86aad60a08214
Gerrit-Change-Number: 21683
Gerrit-PatchSet: 9
Gerrit-Owner: Surya Hebbar <sheb...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Kurt Deschler <kdesc...@cloudera.com>
Gerrit-Reviewer: Riza Suminto <riza.sumi...@cloudera.com>
Gerrit-Reviewer: Surya Hebbar <sheb...@cloudera.com>
Gerrit-Comment-Date: Tue, 19 Nov 2024 20:12:08 +0000
Gerrit-HasComments: Yes

Reply via email to