On Jul 17, 2013, at 1:41 PM, Edward Capriolo wrote:

> 
> "In my opinion we should limit the amount of tez related optimizations to
> and trunk" Refactoring that cleans up code is good, but as you have pointed
> out there wont be a tez release until sometime this fall, and this branch
> will be open for an extended period of time. Thus code cleanups and other
> tez related refactoring does not need to be disruptive to trunk.

I agree with this, though I suspect people will end up arguing about the 
meaning of "code cleanup" and "disruptive".  In my discussions with Gunther he 
said he was doing code cleanup and it was not disruptive.  You obviously 
disagreed.  I've already suggested that any future patches that break lots of 
others should have their checkin preceded by a few hours notice that the patch 
will break things so others can say something if they are about to check in 
too.  I'd also be interested to hear from Gunther how much more general cleanup 
he feels is necessary on trunk.

> 
> I have another relevant question, which I already probably know the answer
> to, but I will ask it anyway.
> 
> Because tez is a YARN application, does this mean that Tez will be the
> first hive feature that will require YARN? (It seems like the answer is yes)

Yes, it will only work in the Hadoop 2.x world.  So obviously all this work 
needs to be done in a way that still allows Hive to use the MR execution engine 
in the Hadoop 1.x world.

Alan.

Reply via email to