Thanks Ted, Huaxiang I'll move this to a Cloudera forum and comment back here if it appears unrelated.
On Wed, Oct 12, 2016 at 7:24 PM, Huaxiang Sun <h...@cloudera.com> wrote: > By the way, I forgot the forum link: http://community.cloudera.com < > http://community.cloudera.com/> > > Thanks, > Huaxiang > > > On Oct 12, 2016, at 10:10 AM, Huaxiang Sun <h...@cloudera.com> wrote: > > > > Hi Tim, > > > > I believe that it runs into an issue which is specific to cloudera > release we fixed recently. For details, could you discuss it in cdh forum? > > Copy me(h...@cloudera.com <mailto:h...@cloudera.com>) in the forum so I > can explain more there. > > > > Thanks, > > Huaxiang > > > >> On Oct 12, 2016, at 8:13 AM, Ted Yu <yuzhih...@gmail.com <mailto: > yuzhih...@gmail.com>> wrote: > >> > >> Have you looked at HBASE-16578 ? > >> > >> Cheers > >> > >>> On Oct 12, 2016, at 8:02 AM, Tim Robertson <timrobertson...@gmail.com > <mailto:timrobertson...@gmail.com>> wrote: > >>> > >>> Hi devs, > >>> [Had a quick chat with Lars G. about this and before opening a Jira I > >>> thought I'd raise it here first] > >>> > >>> We have just experienced data loss in HBase 1.0.0-cdh5.4.10. > >>> > >>> Before I dig into this further, I'd like to just ask if anyone has seen > >>> this before? > >>> > >>> The initial state was a table (tim_test) built with MOB support and a > few > >>> 10's million rows and 10's billions of cells. > >>> > >>> I wanted to rename the table to get this into production and did so as > >>> follows: > >>> > >>> snapshot 'tim_test', 'tim_test-snapshot' > >>> clone_snapshot 'tim_test-snapshot', 'prod_b_map' > >>> > >>> At this stage the application all looked good, and so I continued with: > >>> > >>> delete_snapshot 'tim_test-snapshot' > >>> disable 'tim_test' > >>> drop ‘tim_test’ > >>> > >>> Then things went... awry and data just started dropping out in the app. > >>> Before long, all MOB data seemingly is gone. > >>> > >>> The references in the new table MOB folder appear to point to the > source > >>> table (e.g. > >>> /hbase/mobdir/data/default/prod_b_map/ba42a2e8e9b669d9fc85bdfeed2f5f > 2a/EPSG_4326/tim_test=14bf5f1737ac65c34615ed97c0b7de06- > d41d8cd98f00b204e9800998ecf8427e20161006ff8baa70d21f408caefe8ae6318dfba2). > >>> > >>> The RS logs full of ERROR like: > >>> > >>> 2016-10-12 15:19:14,640 ERROR org.apache.hadoop.hbase. > regionserver.HStore: > >>> The mob file > >>> d41d8cd98f00b204e9800998ecf8427e20161006b59865f80e604781a79e > bfa2ddd66b48 > >>> could not be found in the locations > >>> [hdfs://ha-nn/hbase/mobdir/data/default/tim_test/ > 14bf5f1737ac65c34615ed97c0b7de06/EPSG_4326 <hdfs://ha-nn/hbase/mobdir/ > data/default/tim_test/14bf5f1737ac65c34615ed97c0b7de06/EPSG_4326>, > >>> hdfs://ha-nn/hbase/archive/data/default/tim_test/ > 14bf5f1737ac65c34615ed97c0b7de06/EPSG_4326] <hdfs://ha-nn/hbase/archive/ > data/default/tim_test/14bf5f1737ac65c34615ed97c0b7de06/EPSG_4326]> > >>> > >>> What I don't know is: > >>> 1) was this running a background task to copy the MOB data when the > >>> snapshot was cloned and I just deleted the source before the copy was > >>> complete? > >>> - or > >>> 2) when running "snapshot and clone" it just references the source MOB > >>> data until a (?) change? > >>> 3) snapshot and clone just doesn't support MOB? > >>> > >>> Can anyone shed some light on this easily before I dig into it please? > >>> > >>> While this situation exists (at least in 1.0.0) might it be good to get > >>> info about data loss for MOB tables into the snapshot clone docs? > >>> > >>> Thanks, > >>> Tim > > > >