Qifan Chen created IMPALA-11163:
-----------------------------------

             Summary: To scan small dimensional tables, the number of nodes 
selected by FE can be less
                 Key: IMPALA-11163
                 URL: https://issues.apache.org/jira/browse/IMPALA-11163
             Project: IMPALA
          Issue Type: Improvement
            Reporter: Qifan Chen


In Impala, FE determines the # of exec nodes to use for scan based on the # of 
local/remote nodes hosting data blocks. For example for a dimensional table,  
assume its #local nodes = 3, and its #remote nodes = 17. Then # of exec nodes 
for scan is 20. The final value is min(20, #exec nodes in cluster).

In the case of a partitioned join(f, d) where f is the fact table and d is the 
dimensional table, the # of network opens from join to table d can be made less 
(say 2 instead of 20). Therefore, the system can handle more # of queries.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to