the total time taken by a snapshot should be bounded by the slowest machine.
we send a notification to each RS and each RS execute the snapshot
operation for each region.
can you track down what is slow in your case?

clone has to create a reference for each file, and that is a master
operation, and these calls may all go away if we change the layout in a
proper way instead of doing what is proposed in HBASE-13991.
Most of the time should be spent on the enableTable phase of the clone.


On Fri, Jul 10, 2015 at 4:36 PM, Jean-Marc Spaggiari <
jean-m...@spaggiari.org> wrote:

> Hi Rahul,
>
> Have you identified with it takes those 30 minutes? Is the table balances
> correctly across the servers? Form the logs, are you able to identify what
> takes that much time?
>
> JM
>
> 2015-07-10 18:46 GMT-04:00 rahul gidwani <rahul.gidw...@gmail.com>:
>
> > Hi Matteo,
> >
> > We do SKIP_FLUSH.  We have 1200+ regionservers with a single table with
> 60k
> > regions and 4 column families.  It takes around 30 minutes to snapshot
> this
> > table using manifests compared to just seconds doing this with hdfs.
> > Cloning this table takes considerably longer.
> >
> > For cases where someone would want to run Map/Reduce over snapshots this
> > could be much faster as we could take an hdfs snapshot and bypass the
> > clone.
> >
> > rahul
> >
> >
> > On Thu, Jul 9, 2015 at 12:20 PM, Matteo Bertozzi <
> theo.berto...@gmail.com>
> > wrote:
> >
> > > On Thu, Jul 9, 2015 at 12:12 PM, rahul gidwani <
> rahul.gidw...@gmail.com>
> > >  wrote:
> > >
> > > > Even with manifests (Snapshot V2) for our larger tables it can take
> > hours
> > > > to Snapshot and Clone a table.
> > > >
> > >
> > > on snapshot time the only thing that can take hours, is "flush".
> > > if you don't need that (which is what you get with hdfs snapshots) you
> > can
> > > specify SKIP_FLUSH => true
> > >
> > >
> > > Matteo
> > >
> > >
> > > On Thu, Jul 9, 2015 at 12:12 PM, rahul gidwani <
> rahul.gidw...@gmail.com>
> > > wrote:
> > >
> > > > HBase snapshots are a very useful feature. but it was implemented
> back
> > > > before there was the ability to snapshot via HDFS.
> > > >
> > > > Newer versions of Hadoop support HDFS snapshots.  I was wondering if
> > the
> > > > community would be interested in something like a Snapshot V3 where
> we
> > > use
> > > > HDFS to take these snapshots.
> > > >
> > > > Even with manifests (Snapshot V2) for our larger tables it can take
> > hours
> > > > to Snapshot and Clone a table.
> > > >
> > > > Would this feature be of use to anyone?
> > > >
> > > > thanks
> > > > rahul
> > > >
> > >
> >
>

Reply via email to