Karen Coppage created HIVE-22902: ------------------------------------ Summary: Incorrect Spark/SparkOnYarn result for auto_sortmerge_join_16.q Key: HIVE-22902 URL: https://issues.apache.org/jira/browse/HIVE-22902 Project: Hive Issue Type: Bug Reporter: Karen Coppage
In files {code:java} auto_sortmerge_join_16.q.out_spark [TestMiniSparkOnYarnCliDriver] auto_sortmerge_join_16.q.out [TestSparkCliDriver] {code} at the first run of: {code:java} select a.key , a.value , b.value , 'day1' as day, 1 as pri from ( select key, value from bucket_big_n17 where day='day1' ) a left outer join ( select key, value from bucket_small_n17 where pri between 1 and 2 ) b on (a.key = b.key) {code} the output is (beginning line 444): {code:java} 0 val_0 val_0 day1 1 0 val_0 val_0 day1 1 0 val_0 val_0 day1 1 0 val_0 val_0 day1 1 0 val_0 val_0 day1 1 0 val_0 val_0 day1 1 103 val_103 val_103 day1 1 103 val_103 val_103 day1 1 103 val_103 val_103 day1 1 103 val_103 val_103 day1 1 169 val_169 NULL day1 1 172 val_172 val_172 day1 1 172 val_172 val_172 day1 1 172 val_172 val_172 day1 1 172 val_172 val_172 day1 1 374 val_374 NULL day1 1 {code} Result should not include NULLs. It should match llap/auto_sortmerge_join_16.q.out, beginning line 461: {code:java} 0 val_0 val_0 day1 1 0 val_0 val_0 day1 1 0 val_0 val_0 day1 1 0 val_0 val_0 day1 1 0 val_0 val_0 day1 1 0 val_0 val_0 day1 1 103 val_103 val_103 day1 1 103 val_103 val_103 day1 1 103 val_103 val_103 day1 1 103 val_103 val_103 day1 1 169 val_169 val_169 day1 1 169 val_169 val_169 day1 1 169 val_169 val_169 day1 1 169 val_169 val_169 day1 1 169 val_169 val_169 day1 1 169 val_169 val_169 day1 1 169 val_169 val_169 day1 1 169 val_169 val_169 day1 1 172 val_172 val_172 day1 1 172 val_172 val_172 day1 1 172 val_172 val_172 day1 1 172 val_172 val_172 day1 1 374 val_374 val_374 day1 1 374 val_374 val_374 day1 1 {code} Looks like this was changed in HIVE-20915. -- This message was sent by Atlassian Jira (v8.3.4#803005)