[ 
https://issues.apache.org/jira/browse/IMPALA-7731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16752396#comment-16752396
 ] 

ASF subversion and git services commented on IMPALA-7731:
---------------------------------------------------------

Commit 8c673ed6a00d1ed5c542a2311608993651bae850 in impala's branch 
refs/heads/master from Lars Volker
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=8c673ed ]

IMPALA-7731: Add Read/Exchange counters to profile

Selective scans (and by extension selective fragment instances)
take higher performance hits when reading data remotely. They can
be identified by a low ratio between data being transmitted vs data
being read from HDFS.

This change adds several counters to the profile to make it easier to
identify queries based on their scan instance selectivity.

* TotalBytesSent - The total number of bytes sent by a query in
  exchange nodes. Does not include remote reads, data written to disk,
  or data sent to the client.

* TotalScanBytesSent - The total number of bytes sent by fragment
  instances that had a scan node in their plan.

* TotalInnerBytesSent - The total number of bytes sent by fragment
  instances that did not have a scan node in their plan, i.e. that
  received their input data from other instances through exchange node.

* ExchangeScanRatio - The ratio between TotalScanBytesSent and
  TotalBytesRead, i.e. the selectivity over all fragment instances that
  had a scan node in their plan. This counter is also added to each
  fragment instance.

* InnerNodeSelectivityRatio - The ratio between bytes sent by instances
  with a scan node in their plan and instances without a scan node in
  their plan. This indicates how well the inner nodes of the execution
  plan reduced the data volume.

Change-Id: Ife7ec78fe42558429c1cbe6e5eba79842bffd648
Reviewed-on: http://gerrit.cloudera.org:8080/12229
Reviewed-by: Lars Volker <[email protected]>
Tested-by: Impala Public Jenkins <[email protected]>


> Add ratio between scanned and transmitted bytes to fragment instances
> ---------------------------------------------------------------------
>
>                 Key: IMPALA-7731
>                 URL: https://issues.apache.org/jira/browse/IMPALA-7731
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Backend
>    Affects Versions: Impala 3.1.0
>            Reporter: Lars Volker
>            Assignee: Lars Volker
>            Priority: Major
>              Labels: observability, supportability
>         Attachments: Selective Scan Slowdowns.png
>
>
> Selective scans (and by extension selective fragment instances) take higher 
> performance hits when reading data remotely. They can be identified by a low 
> ratio between data being transmitted vs data being read from HDFS. To make it 
> easier to spot those instances we should add this ratio to each instance and 
> to the root of the execution profile.
>  !Selective Scan Slowdowns.png! 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to