Hello Bharath Vissapragada, Sahil Takiar, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

    http://gerrit.cloudera.org:8080/13786

to look at the new patch set (#2).

Change subject: IMPALA-7322: Add storage wait time to profile
......................................................................

IMPALA-7322: Add storage wait time to profile

Add metrics to record storage wait time for operations with
metadata load in catalog for hdfs, kudu and hbase tables.
Pass storage wait time from catalog to fe through thrift
and log total storage load time in query profile.
Storage-load-time is the amount of time spent loading metadata
from the underlying storage layer (e.g. S3, HDFS, Kudu, HBase),
which does not include the amount of time spending loading data
from HMS.

Testing: Ran queries that can trigger all of, none of or
some of the related tables loading.
Check query profile for each query.
Check catalog metrics for each table.
Add unit tests to test_observability.py
Ran all core tests.

Sample output:
Profile for Catalog V1: (storage-load-time is the added property):
After ran a hbase query (Metadata load finished is divided into several lines
because of limitation of commit message): Query Compilation: 4s401ms
  - Metadata load started: 661.084us (661.084us)
  - Metadata load finished. loaded-tables=1/1
      load-requests=1 catalog-updates=3
      storage-load-time=233ms: 3s819ms (3s819ms)
  - Analysis finished: 3s820ms (763.979us)
  - Value transfer graph computed: 3s820ms (63.193us)
Profile for Catalog V2:
    Frontend:
       - CatalogFetch.ColumnStats.Misses: 1
       - CatalogFetch.ColumnStats.Requests: 1
       - CatalogFetch.ColumnStats.Time: 16ms
       - CatalogFetch.Config.Misses: 1
       - CatalogFetch.Config.Requests: 1
       - CatalogFetch.Config.Time: 24ms
       - CatalogFetch.DatabaseList.Hits: 1
       - CatalogFetch.DatabaseList.Requests: 1
       - CatalogFetch.DatabaseList.Time: 0
       - CatalogFetch.Metadata.storage-load-time: 218ms
       - CatalogFetch.PartitionLists.Misses: 1
       - CatalogFetch.PartitionLists.Requests: 1
Catalog metrics(this sample is from a hdfs table):
    storage-metadata-load-duration:
       Count: 1
       Mean rate: 0.0085
       1 min. rate: 0.032
       5 min. rate: 0.1386
       15 min. rate: 0.177
       Min (msec): 111
       Max (msec): 111
       Mean (msec): 111.1802
       Median (msec): 111.1802
       75th-% (msec): 111.1802
       95th-% (msec): 111.1802
       99th-% (msec): 111.1802

Change-Id: I7447f8c8e7e50eb71d18643859d2e3de865368d2
---
M common/thrift/CatalogObjects.thrift
M common/thrift/CatalogService.thrift
M fe/src/main/java/org/apache/impala/analysis/StmtMetadataLoader.java
M fe/src/main/java/org/apache/impala/catalog/HBaseTable.java
M fe/src/main/java/org/apache/impala/catalog/HdfsTable.java
M fe/src/main/java/org/apache/impala/catalog/KuduTable.java
M fe/src/main/java/org/apache/impala/catalog/Table.java
M fe/src/main/java/org/apache/impala/catalog/local/CatalogdMetaProvider.java
M tests/query_test/test_observability.py
9 files changed, 160 insertions(+), 31 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/86/13786/2
--
To view, visit http://gerrit.cloudera.org:8080/13786
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I7447f8c8e7e50eb71d18643859d2e3de865368d2
Gerrit-Change-Number: 13786
Gerrit-PatchSet: 2
Gerrit-Owner: Yongzhi Chen <yc...@cloudera.com>
Gerrit-Reviewer: Bharath Vissapragada <bhara...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Sahil Takiar <stak...@cloudera.com>
Gerrit-Reviewer: Yongzhi Chen <yc...@cloudera.com>

Reply via email to