Table aliases are ambiguous
---------------------------

                 Key: HIVE-1395
                 URL: https://issues.apache.org/jira/browse/HIVE-1395
             Project: Hadoop Hive
          Issue Type: Bug
            Reporter: Adam Kramer


Consider this query:

SELECT a.num FROM (
  SELECT a.num AS num, b.num AS num2
  FROM foo a LEFT OUTER JOIN bar b ON a.num=b.num
) a
WHERE a.num2 IS NULL;

...in this case, the table alias 'a' is ambiguous. It could be the outer table 
(i.e., the subquery result), or it could be the inner table (foo).

In the above case, Hive silently parses the outer reference to a as the inner 
reference. The result, then, is akin to:
SELECT foo.num FROM foo WHERE bar.num IS NULL. This is bad.

The bigger problem, however, is that Hive even lets people use the same table 
alias at multiple points in the query. We should simply throw an exception 
during the parse stage if there is any ambiguity in which table is which, just 
like we do if the column names are ambiguous.

Or, if for some reason we need people to be able to use 'a' to refer to 
multiple tables or subqueries, it would be excellent if the exact parsing 
structure were made clear and added to the wiki. In that case, I will file a 
separate bug JIRA to complain about how it should be different. :)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to