Here's an explain I'm trying to grok.
The last Load is frustrating because the file isn't descriptive at all.
I have to scroll up and find out which file it was from which mapred job.
I the file had a descriptive name (like the name of the stored relation) or
how it was computed then it would be easier to understand.
I was also thinking that it would be advantageous to KEEP the temp files
around on disk for a post mortem to see what's actually ring stored
(assuming this feature doesn't already exist) and having the temp files
NAMED would be nice.
Kevin
|---foo_joined: Local Rearrange[tuple]{int}(false) - scope-158
| | |
| | Project[int][0] - scope-159
| |
| |---bar: New For Each(false)[bag] - scope-151
| | |
| | Project[int][0] - scope-149
| |
| |---bar_filtered: Filter[bag] - scope-144
| | |
| | Equal To[boolean] - scope-148
| | |
| | |---Project[long][2] - scope-145
| | |
| | |---Cast[long] - scope-147
| | |
| | |---Constant(0) - scope-146
| |
|
|---Load(file:/tmp/temp-1876184695/tmp-2019577453:org.apache.pig.impl.io.InterStorage)
- scope-249
--
Founder/CEO Spinn3r.com
Location: *San Francisco, CA*
Skype: *burtonator*
Skype-in: *(415) 871-0687*