[ 
https://issues.apache.org/jira/browse/IMPALA-6604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong updated IMPALA-6604:
----------------------------------
    Component/s: Infrastructure

> rethink stress test binary search cutoff point
> ----------------------------------------------
>
>                 Key: IMPALA-6604
>                 URL: https://issues.apache.org/jira/browse/IMPALA-6604
>             Project: IMPALA
>          Issue Type: Task
>          Components: Infrastructure
>    Affects Versions: Impala 2.11.0
>            Reporter: Michael Brown
>            Priority: Major
>
> While working on IMPALA-5721 I see that during a binary search, a lot of 
> queries that are meant to be tested with spilling are not set at any 
> different mem_limit than they were for their non-spillable counterparts. This 
> is due to both the default data scales of the stress test (TPCH 100, TPCDS 
> 300) and the fact that {{MEM_LIMIT_EQ_THRESHOLD_MB = 50}} in 
> {{concurrent_select.py}}. For queries that only use roughly 200 MB, a 50MB 
> threshold is a rather large percentage. This means the binary search is 
> "giving up" sooner than it could.
> There are some options:
> - drop MEM_LIMIT_EQ_THRESHOLD_MB completely and only rely on 
> MEM_LIMIT_EQ_THRESHOLD_PC, which is 0.975. For a 200 MB query, this would 
> effectively be a threshold of 5MB. This would lead to a more precise binary 
> search, but one that would take longer to complete.
> - keep MEM_LIMIT_EQ_THRESHOLD_MB but only put it in play for either larger 
> queries or larger data sets.
> - do not test at these data scales at all and instead move toward testing at 
> data scales of, say, TPCH 1000



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to