[ 
https://issues.apache.org/jira/browse/HIVE-384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12696173#action_12696173
 ] 

Namit Jain commented on HIVE-384:
---------------------------------

The patch has a few bug fixes:

1. The total number of jobs were counted wrongly, which made it very difficult 
to debug (specially big union queries).
2. The job name for a query was incorrect, , which made it very difficult to 
debug (specially big union queries).

These two are fixed in Driver.java

3. Union plans were very complex and inefficient - since the unions were not 
getting merged - the simple fix to to merge 
them in the SemanticAnalyzer - no need to check anything, union schema should 
be the same.

4. No special case for ReduceSink followed by Union -- the fix is in 
GenMRRedSink3


Added a bunch of new tests




> problem in union if the first subquery is a map-only job
> --------------------------------------------------------
>
>                 Key: HIVE-384
>                 URL: https://issues.apache.org/jira/browse/HIVE-384
>             Project: Hadoop Hive
>          Issue Type: Bug
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: hive.384.1.patch, hive.384.2.patch, hive.384.3.patch, 
> hive.384.4.patch
>
>
> Union needs special handling.
> explain 
>   select unionsrc.key, count(1) FROM (select s2.key as key, s2.value as value 
> from src1 s2
>                                         UNION  ALL  
>                                       select 'tst1' as key, cast(count(1) as 
> string) as value from src s1) 
>   unionsrc group by unionsrc.key;
> results in a null pointer exception

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to