Fang-Yu Rao created IMPALA-14328:
------------------------------------

             Summary: Produce column lineage graph when Calcite is the planner
                 Key: IMPALA-14328
                 URL: https://issues.apache.org/jira/browse/IMPALA-14328
             Project: IMPALA
          Issue Type: Sub-task
          Components: Frontend
            Reporter: Fang-Yu Rao
            Assignee: Fang-Yu Rao


We found that Impala could not produce a structurally same column lineage graph 
when Calcite is the planner. For instance, consider the following query in 
[lineage.test|https://github.com/apache/impala/blob/master/testdata/workloads/functional-query/queries/QueryTest/lineage.test].
{code:java}
select * from (
  select tinyint_col + int_col x from functional.alltypes
  union all
  select sum(bigint_col) y from (select bigint_col from functional.alltypes) 
v1) v2
{code}

We expect Impala to produce a graph with 4 vertices and 1 edge. However, we 
only get one vertice when Calcite is the planner.
{code}
{
    "edges": [
        {
            "edgeType": "PROJECTION",
            "sources": [], 
            "targets": [
                0
            ]
        }
    ],  
    "endTime": 1755630445,
    "hash": "3968bd65781e9e856eaca799f4501513",
    "queryId": "fb443702ac817ecc:c432854600000000",
    "queryText": "select * from (   select tinyint_col + int_col x from 
functional.alltypes   union all   select sum(bigint_col) y from (select 
bigint_col from functional.alltypes) 
    "timestamp": 1755630437,
    "user": "fangyurao",
    "vertices": [
        {
            "id": 0,
            "vertexId": "X",
            "vertexType": "COLUMN"
        }
    ]   
}
{code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org

Reply via email to