Hi,

Below code work in pig MapReduce mode but doesn't in Tez. In the sense
mstat should return 'matches' but returns nothing when executed in tez mode.

cd1 = LOAD '/user/falcon/data/cd1.txt' USING PigStorage('\n') AS first:
chararray;
cd2 = LOAD '/user/falcon/data/cd1.txt' USING PigStorage('\n') AS second:
chararray;


combined = JOIN cd1 BY first FULL OUTER, cd2 BY second;


mstat = FOREACH combined GENERATE (
  CASE
    WHEN cd1.first == cd2.second THEN 'matches'
else 'mismatch'
  END
) as match_status;

dump mstat;



Suggestions please.

Thanks,
Joel

Reply via email to