[ 
https://issues.apache.org/jira/browse/IMPALA-10964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

carolinchen updated IMPALA-10964:
---------------------------------
    Fix Version/s:     (was: Impala 4.0.1)

> Add query option that limits skew query in runtime
> --------------------------------------------------
>
>                 Key: IMPALA-10964
>                 URL: https://issues.apache.org/jira/browse/IMPALA-10964
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Backend
>    Affects Versions: Impala 4.0.0
>            Reporter: carolinchen
>            Assignee: carolinchen
>            Priority: Major
>
> Reject queries that  skew value is too big when executing the query. 
> Query skew refers to the situation in which some nodes are significantly 
> behind other nodes in the process of concurrent execution of SQL.
> There are two style skews:
> 1. Row skew, which may be caused by unreasonable sql or uneven task 
> distributions.
> 2. Time skew, which may be caused by different capability by execnode.
> Query skew will cause two effects:
> 1. For the skew node may execute slowly,  which will slow down the query 
> progress .
> 2. For the skew node may exhaust lots system resources( I/O, memory, rpc), 
> which will
> affect other queries in the same host/ query pool.
> When the skew value reach unreasonale range,  will affect the cluster status 
> and other running queries. This is a mechanism to protect the cluster from 
> potentially harmful queries(eg: mem_limit).
> In our environment, the SKEW_LIMIT query option is added to limit skewed 
> query.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to