pig-user  

Union support

Arthur Zwiegincew
Mon, 29 Sep 2008 18:30:08 -0700

I've come across a very basic problem—unions simply do not work in Hadoop
mode.

data files:

$ cat ~/tmp/data
1 1
2 1
3 10

$ cat ~/tmp/data-2
4 20
5 20

pig script:
data = load '/Users/arthur/tmp/data' as (x, y);
data2 = load '/Users/arthur/tmp/data-2' as (x, y);
both = union data, data2;
dump both;

result:
(4, 20)
(5, 20)


I've opened a bug <https://issues.apache.org/jira/browse/PIG-390> on this,
but there has been no response.


Am I missing anything?


Thanks,
Arthur