On Fri, Jan 26, 2018 at 11:09 AM, Vihang Karajgaonkar <vih...@cloudera.com>
wrote:

> Oh, may be I misunderstood then. Are you suggesting that hive binaries will
> include metastore and there would also be another tar file which only
> includes metastore and its dependencies? So if someone just wants to
> install metastore they can just download metastore tar ball. For hive,
> since it comes pre-packaged with its own metastore the installation process
> remains the unchanged.
>
​Exactly.
​

>
> In case someone wants to build from source code I think it should be
> sufficient to provide mvn options (like -Pdist we have for hive) to build a
> metastore distribution. I think that would make sense.
>
​In the HIVE-17983 patch when you do ‘mvn install’ the standalone metastore
builds its packages (since package is one of the phases of install).  For
Hive as a whole doing 'mvn install -Pdist' builds the packages as before.
So I think we’ve covered the bases there.

Alan.​

>
> On Fri, Jan 26, 2018 at 10:42 AM, Alan Gates <alanfga...@gmail.com> wrote:
>
> > I’m not seeing the value of moving the jars to a different path.  If
> > someone just wants to run the metastore it will be possible to download
> it
> > alone.  I’m afraid moving things around will break some script somewhere
> > that we won’t think to test.
> >
> > Alan.
> >
> > On Fri, Jan 26, 2018 at 10:10 AM, Vihang Karajgaonkar <
> vih...@cloudera.com
> > >
> > wrote:
> >
> > > I think its important to keep supporting "hive --service metastore" for
> > > backwards compatibility. I also think it would be great to have some
> > > separation in the lib/bin if possible so that it is easier for someone
> > who
> > > downloads Hive but just wants to deploy a standalone-metastore to
> > identify
> > > which jars are needed to run metastore.
> > >
> > > On Thu, Jan 25, 2018 at 8:13 AM, Alexander Kolbasov <
> ak...@cloudera.com>
> > > wrote:
> > >
> > > > Alan,
> > > >
> > > > While continuing shipping HMS with Hive makes sense (at least for a
> > > while),
> > > > what do you think about somehow separating lib/bin directories
> created
> > in
> > > > the distro so Hive and metastore have a separate set of bin/lib dirs?
> > > >
> > > > - Alex
> > > >
> > > > On Wed, Jan 24, 2018 at 12:16 PM, Alan Gates <alanfga...@gmail.com>
> > > wrote:
> > > >
> > > > > In HIVE-17983 I have been working on packaing and start/stop
> scripts
> > > for
> > > > > the standalone metastore.  One question this brings up is how Hive
> > will
> > > > be
> > > > > released now, with or without the metastore.  I can see two
> options:
> > > > >
> > > > > 1) We continue to ship the metastore with Hive.  Not only does this
> > > mean
> > > > > the metastore code is in the Hive source code release and the
> > metastore
> > > > > jars are in the Hive binary distribution, but scripts like
> > metastore.sh
> > > > are
> > > > > still included in Hive's bin directory, so that Hive admins can
> still
> > > do
> > > > > 'hive --service metastore' to start the metastore.  I see the
> > following
> > > > > advantages of this:
> > > > > a) it is completely backwards compatible;
> > > > > b) it is what users would expect (I have installed many databases
> and
> > > > never
> > > > > been asked to first install a separate package for its data catalog
> > or
> > > > any
> > > > > other essential piece);
> > > > > c) this will still be the metastore's most frequent use case for at
> > > least
> > > > > the near future.
> > > > >
> > > > > The disadvantage is it is error prone when Hive is set up to
> connect
> > > to a
> > > > > separate metastore.  An operator could easily start the metastore
> in
> > > the
> > > > > Hive package, not realizing Hive is configured to connect to a
> > > different
> > > > > one.
> > > > >
> > > > > 2) We remove the metastore from the packaging completely like we do
> > > > Hadoop
> > > > > and require the user to install it separately.  The advantages and
> > > > > disadvantages of this exactly mirror those of option 1.
> > > > >
> > > > > Based on both the 80/20 rule (most metastore users will still be
> > single
> > > > > system Hive users) and the law of least astonishment (people
> expect a
> > > > > database to have a data catalog) I vote for option 1.
> > > > >
> > > > > Anyone strongly feel we should do 2 instead?
> > > > >
> > > > > Any other options I haven't considered?
> > > > >
> > > > > Alan.
> > > > >
> > > >
> > >
> >
>

Reply via email to