Yes, agree that topic should be called [Discuss], more people might comment
on this topic which is a big one.

> The question isn't whether there are people running Hadoop 1.x, it is
> whether those people are likely to install a new version of Hive running
on
> their old Hadoop cluster.

Yes, question is whether users want to run latest Hive version on Hadoop
1.x clusters.

> would enable the rest of the Hive community to move forward and take
> advantage of the powerful new features in Hadoop 2.x.

We can still take advantage of powerful Hadoop 2 features without removing
support for Hadoop 1, which we are doing for some time via the Shim layer
(eg, Hdfs encryption, Hdfs extended ACL's, all in this category).  The
question here seems to be code complexity with Hive of the Shim layer.

I am not arguing to keep support for Hadoop-1 indefinitely in newer
versions of Hive and keep complexity forever, but I think its fair to give
users a fair warning via one full release cycle where Hadoop1 is formally
deprecated, instead of immediately removing it next release.  Especially as
Hadoop 2 is GA only a year and a few months.  Thoughts?

Thanks
Szehon


On Tue, Apr 28, 2015 at 9:19 PM, Lefty Leverenz <leftylever...@gmail.com>
wrote:

> This thread needs [DISCUSS] in the subject.
>
> -- Lefty
>
> On Tue, Apr 28, 2015 at 10:58 PM, Owen O'Malley <omal...@apache.org>
> wrote:
>
> > On Tue, Apr 28, 2015 at 5:25 PM, Szehon Ho <sze...@cloudera.com> wrote:
> >
> > > Hadoop 2 has been GA for a little over a year, there is still a fairly
> > > significant user base that uses hadoop-1 and would not be happy with
> this
> > > change.
> >
> >
> > The question isn't whether there are people running Hadoop 1.x, it is
> > whether those people are likely to install a new version of Hive running
> on
> > their old Hadoop cluster. As a point of reference, CDH 4 shipped Hadoop
> 2.0
> > and Hive 0.10 and HDP 2.0 shipped Hadoop 2.0 and Hive 0.12.
> >
> >
> > > Perhaps we can declare it deprecated in some future release (perhaps
> > 1.3),
> > > then another release to formally remove it, as was done in HBase.
> >
> >
> > Are you interested in managing a Hadoop 1.x compatible version of Hive?
> > Maybe we should call the new release Hive 2.0 and enable you to maintain
> > the Hive 1.x branch with backwards compatibility with Hadoop 1.x. That
> > would enable the rest of the Hive community to move forward and take
> > advantage of the powerful new features in Hadoop 2.x.
> >
> > .. Owen
> >
>

Reply via email to