Hi all, my script looks like this:
A = LOAD 'left_rel.txt' AS (var1, var2); B = LOAD 'right_rel.txt' AS (var1, var3); C = JOIN A BY var1 LEFT OUTER, B BY var1; D = FILTER C BY $2 is null; DUMP D; But when I dump D I get the error "Unable to store alias D". I suppose there is something going wrong with the Filter vor null-values (is not null also doesn't work). What I want to do is to filter for the tuples in A which do not find a Join partner in B Input files are attached. Does anybody know what's going on and how to fix this? By the way, I'm using Cloudera Distribution for Hadoop 3 Beta with pig 0.5.0. Thx in advance, Alex
a x a y b x b y c x
a 5 a 10 b 5 b 10
