Sreeparna Bhabani created DRILL-7720:
----------------------------------------
Summary: Issue observed in performance of UNION ALL between
Parquet and DB query
Key: DRILL-7720
URL: https://issues.apache.org/jira/browse/DRILL-7720
Project: Apache Drill
Issue Type: Bug
Components: Functions - Drill
Affects Versions: 1.17.0
Reporter: Sreeparna Bhabani
{color:#26282a}Consider the below scenarios. The first 2 scenarios are giving
expected results in terms of performance. But we are not getting expected
performance for 3rd scenario which is UNION ALL with 2 different types of
datasets (Parquet + DB).{color}
{color:#26282a} {color}
*{color:#26282a}Scenario 1- Parquet UNION ALL Parquet{color}*
{color:#26282a}Individual execution time of 1st query - 5 secs{color}
{color:#26282a}Individual execution time of 2nd query - 5 secs{color}
{color:#26282a}UNION ALL of both queries execution time - 10 secs{color}
{color:#26282a} {color}
*{color:#26282a}Scenario 2 - DB query UNION ALL DB{color}*{color:#26282a}
*query*{color}
{color:#26282a}Individual execution time of 1st query - 5 secs{color}
{color:#26282a}Individual execution time of 2nd query - 5 secs{color}
{color:#26282a}UNION ALL of both queries execution time - 10 secs{color}
{color:#26282a} {color}
*{color:#26282a}Scenario 3 - Parquet UNION ALL DB query{color}*
{color:#26282a}Individual execution time of 1st query - 5 secs{color}
{color:#26282a}Individual execution time of 2nd query - 1 sec{color}
{color:#26282a}UNION ALL execution time - 20 secs{color}
{color:#26282a}Ideally the execution time should not be more than 6 secs.{color}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)