I think that (2) is more compatible with the hadoop mindset. Jobs shouldn't overwrite or delete output except for temp files.
On Wed, May 12, 2010 at 9:48 AM, Jeff Eastman <[email protected]>wrote: > 2. Leave the Drivers as-is and just remove their output deletion. > This puts a bit more burden on the user but makes constructing job > chains with clustering computations possible. >
