[
https://issues.apache.org/jira/browse/IMPALA-5473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael Ho updated IMPALA-5473:
-------------------------------
Issue Type: Task (was: Sub-task)
Parent: (was: IMPALA-5865)
> Make diagnosing network issues easier
> -------------------------------------
>
> Key: IMPALA-5473
> URL: https://issues.apache.org/jira/browse/IMPALA-5473
> Project: IMPALA
> Issue Type: Task
> Affects Versions: Impala 2.10.0
> Reporter: Henry Robinson
> Assignee: Michael Ho
> Priority: Major
>
> With our current metrics in the profile, it's hard to debug queries that get
> slow throughput from their exchanges.
> The following cases have different causes, but similar symptoms (e.g. a high
> {{InactiveTimer}} in the xchg profile):
> 1. Downstream sender does not produce rows quickly (perhaps because *its*
> child instances do not produce rows quickly).
> 2. Downstream sender can not _send_ rows quickly, perhaps because of network
> congestion.
> 3. Downstream sender does not start producing rows until some time after the
> upstream has started (captured by {{FirstBatchArrivalWaitTime}}).
> 4. Downstream sender does not close stream until some time after all rows are
> sent.
> We should try to improve these metrics so that all the information about who
> is slow, and why, is available clearly in the runtime profile. Distinguishing
> cases 1 and 2 is particularly important.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]