On Sep 6, 2014, at 9:32 AM, Ted Yu <[email protected]> wrote: > Can you post your hbase-site.xml ? > > /apps/hbase/data/archive/data/default is where HFiles are archived (e.g. > when a column family is deleted, HFiles for this column family are stored > here). > /apps/hbase/data/data/default seems to be your hbase.rootdir > >
hbase.rootdir is defined to be hdfs://foo:8020/apps/hbase/data. I think that's the default that Ambari creates. So the HFiles in the archive subdirectory have been discarded and can be deleted safely? > bq. a problem I'm having running map/reduce jobs against snapshots > > Can you describe the problem in a bit more detail ? > > I don't understand what I'm seeing well enough to ask an intelligent question yet. I appear to be scanning duplicate rows when using initTableSnapshotMapperJob, but I'm trying to get a better understanding of how this works, since It's probably just something I'm doing wrong. Brian > Cheers > > > On Sat, Sep 6, 2014 at 6:09 AM, Brian Jeltema < > [email protected]> wrote: > >> I'm trying to track down a problem I'm having running map/reduce jobs >> against snapshots. >> Can someone explain the difference between files stored in: >> >> /apps/hbase/data/archive/data/default >> >> and files stored in >> >> /apps/hbase/data/data/default >> >> (Hadoop 2.4, HBase 0.98) >> >> Thanks
