Hi all,

my script looks like this:

A = LOAD 'left_rel.txt' AS (var1, var2);
B = LOAD 'right_rel.txt' AS (var1, var3);
C = JOIN A BY var1 LEFT OUTER, B BY var1;
D = FILTER C BY $2 is null;
DUMP D;

But when I dump D I get the error "Unable to store alias D".
I suppose there is something going wrong with the Filter vor null-values (is 
not null also doesn't work).
What I want to do is to filter for the tuples in A which do not find a Join 
partner in B
Input files are attached.

Does anybody know what's going on and how to fix this?
By the way, I'm using Cloudera Distribution for Hadoop 3 Beta with pig 0.5.0.

Thx in advance,
Alex

a       x
a       y
b       x
b       y
c       x
a       5
a       10
b       5
b       10

Reply via email to