LOAD once, use multiple times

Something Something Mon, 03 Oct 2011 16:54:06 -0700

I have 3 Pig scripts that load data from the same log file, but filter &
group this data differently.  If I combine these 3 into one & LOAD only
once, performance seems to have improved, but now I am curious exactly what
does LOAD do?


How does LOAD work internally?  Does Pig save results of the LOAD into some
separate location in HDFS?  Someone please explain how LOAD relates to
MapReduce?  Thanks.

LOAD once, use multiple times

Reply via email to