Hi,
Below code work in pig MapReduce mode but doesn't in Tez. In the sense
mstat should return 'matches' but returns nothing when executed in tez mode.
cd1 = LOAD '/user/falcon/data/cd1.txt' USING PigStorage('\n') AS first:
chararray;
cd2 = LOAD '/user/falcon/data/cd1.txt' USING PigStorage('\n') AS second:
chararray;
combined = JOIN cd1 BY first FULL OUTER, cd2 BY second;
mstat = FOREACH combined GENERATE (
CASE
WHEN cd1.first == cd2.second THEN 'matches'
else 'mismatch'
END
) as match_status;
dump mstat;
Suggestions please.
Thanks,
Joel