subject:"\[jira\] \[Commented\] \(HBASE\-50\) Snapshot of table"

[
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897609#action_12897609
]

HBase Review Board commented on HBASE-50:
-

Message from: Chongxin Li lichong...@zju.edu.cn

bq. On 2010-08-10 21:34:40, stack wrote:
bq. src/main/java/org/apache/hadoop/hbase/master/HMaster.java, line 234
bq. http://review.cloudera.org/r/467/diff/3/?file=6015#file6015line234
bq.
bq. You might want to check the returns from these methods.

Snapshot root dir might already exist, e.g. created in previous start up, then
mkdirs would return false. But this is normal.

Here are previous comments from Todd:
you can just call mkdirs, I think, and it won't fail if it already exists

- Chongxin

---
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/467/#review823
---

Snapshot of table
-

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HBASE-50) Snapshot of table


[ 
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897619#action_12897619
 ] 

HBase Review Board commented on HBASE-50:
-

Message from: Chongxin Li lichong...@zju.edu.cn


bq.  On 2010-08-11 11:32:27, stack wrote:
bq.   src/main/java/org/apache/hadoop/hbase/master/SnapshotMonitor.java, line 
166
bq.   http://review.cloudera.org/r/467/diff/3/?file=6019#file6019line166
bq.  
bq.   Want to remove this or enable the assertion?  One or the other I'd 
say rather than this.

remove it


bq.  On 2010-08-11 11:32:27, stack wrote:
bq.   src/main/java/org/apache/hadoop/hbase/master/SnapshotTracker.java, line 1
bq.   http://review.cloudera.org/r/467/diff/3/?file=6021#file6021line1
bq.  
bq.   Its a pity this class is named so.  We're about to bring in a new 
patch that redoes the zk stuff -- breaks it up into pieces each with a singular 
purpose; e.g. tracking root location or tracking meta region server -- and 
unfortunately the pattern is to name these purposed classes *Tracker.  There'll 
be a clash of this kinda Tracker and the new zk Trackers.  Not important, just 
saying in case you have another name in mind for this class.

I'll think about it. Any suggestion?


bq.  On 2010-08-11 11:32:27, stack wrote:
bq.   src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java, line 
2288
bq.   http://review.cloudera.org/r/467/diff/3/?file=6024#file6024line2288
bq.  
bq.   And flushing is disabled at this point too, right?  Compactions? 
(Good).

yes, flushing and compaction are disabled when snapshot.


bq.  On 2010-08-11 11:32:27, stack wrote:
bq.   src/main/java/org/apache/hadoop/hbase/regionserver/Store.java, line 944
bq.   http://review.cloudera.org/r/467/diff/3/?file=6027#file6027line944
bq.  
bq.   Do we have to do this down at the Store level?  Coud we move it up 
to Region or up to the RegionServer itself?  It already has an HTable instance.

This method is only used to delete old store files after compaction, is it 
appropriate to move it to Region?


bq.  On 2010-08-11 11:32:27, stack wrote:
bq.   src/test/java/org/apache/hadoop/hbase/master/TestSnapshot.java, line 382
bq.   http://review.cloudera.org/r/467/diff/3/?file=6037#file6037line382
bq.  
bq.   What about a test of restore from snapshot?  Is there one?  I dont' 
see it?

It's already in TestAdmin


bq.  On 2010-08-11 11:32:27, stack wrote:
bq.   src/main/java/org/apache/hadoop/hbase/util/FSUtils.java, line 713
bq.   http://review.cloudera.org/r/467/diff/3/?file=6032#file6032line713
bq.  
bq.   Does this stuff belong in here in this general utility class?  
Should it be polluted with References?  Should this stuff be over in io package 
where the Reference is or static methods on Reference?

OK, I'll move it to Reference


bq.  On 2010-08-11 11:32:27, stack wrote:
bq.   src/main/java/org/apache/hadoop/hbase/regionserver/StoreFile.java, line 
267
bq.   http://review.cloudera.org/r/467/diff/3/?file=6028#file6028line267
bq.  
bq.   Why you have to pass the reference?  It wasn't needed previously?

Previously there is only one type of reference file, i.e. reference after 
split. But right now there are another type of reference file for snapshot. We 
need to know the reference type to get the referred to file. 

This is used for table restored from snapshot.


bq.  On 2010-08-11 11:32:27, stack wrote:
bq.   src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java, line 
2355
bq.   http://review.cloudera.org/r/467/diff/3/?file=6024#file6024line2355
bq.  
bq.   If snapshot fails, do we have to do cleanup?

HRegions just quit the snapshot mode if fails. The master would be notified 
with failure and do the clean up work for the whole snapshot.


- Chongxin


---
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/467/#review840
---





 Snapshot of table
 -

 Key: HBASE-50
 URL: https://issues.apache.org/jira/browse/HBASE-50
 Project: HBase
  Issue Type: New Feature
Reporter: Billy Pearson
Assignee: Li Chongxin
Priority: Minor
 Attachments: HBase Snapshot Design Report V2.pdf, HBase Snapshot 
 Design Report V3.pdf, HBase Snapshot Implementation Plan.pdf, Snapshot Class 
 Diagram.png


 Havening an option to take a snapshot of a table would be vary useful in 
 production.
 What I would like to see this option do is do a merge of all the data into 
 one or more files stored in the same folder on the dfs. This way we could 
 save data in case of a software bug in hadoop or user code. 
 The other advantage would be to be able to export a table to multi locations. 
 Say I had a read_only table that must be online. I could take a

[jira] Commented: (HBASE-50) Snapshot of table


[ 
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897660#action_12897660
 ] 

HBase Review Board commented on HBASE-50:
-

Message from: Chongxin Li lichong...@zju.edu.cn

---
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/467/
---

(Updated 2010-08-12 02:43:42.872855)


Review request for hbase.


Summary
---

This patch includes the first three sub-tasks of HBASE-50:
1. Start and monitor the creation of snapshot via ZooKeeper
2. Create snapshot of an HBase table
3. Some existing functions of HBase are modified to support snapshot

Currently snapshots can be created as expected, but can not be restored or 
deleted yet


This addresses bug HBASE-50.
http://issues.apache.org/jira/browse/HBASE-50


Diffs (updated)
-

  src/main/java/org/apache/hadoop/hbase/HConstants.java c77ebf5 
  src/main/java/org/apache/hadoop/hbase/HRegionInfo.java ee94690 
  src/main/java/org/apache/hadoop/hbase/HTableDescriptor.java 0d57270 
  src/main/java/org/apache/hadoop/hbase/SnapshotDescriptor.java PRE-CREATION 
  src/main/java/org/apache/hadoop/hbase/SnapshotExistsException.java 
PRE-CREATION 
  src/main/java/org/apache/hadoop/hbase/TablePartiallyOpenException.java 
PRE-CREATION 
  src/main/java/org/apache/hadoop/hbase/client/HBaseAdmin.java 8b01aa0 
  src/main/java/org/apache/hadoop/hbase/io/HalfStoreFileReader.java ed12e7a 
  src/main/java/org/apache/hadoop/hbase/io/HbaseObjectWritable.java 85fde3a 
  src/main/java/org/apache/hadoop/hbase/io/Reference.java 219203c 
  src/main/java/org/apache/hadoop/hbase/io/hfile/HFile.java b2de7e4 
  src/main/java/org/apache/hadoop/hbase/ipc/HBaseRPCProtocolVersion.java 
d4bcbed 
  src/main/java/org/apache/hadoop/hbase/ipc/HMasterInterface.java bd48a4b 
  src/main/java/org/apache/hadoop/hbase/mapreduce/LoadIncrementalHFiles.java 
1183584 
  src/main/java/org/apache/hadoop/hbase/master/BaseScanner.java 69eab39 
  src/main/java/org/apache/hadoop/hbase/master/DeleteSnapshot.java PRE-CREATION 
  src/main/java/org/apache/hadoop/hbase/master/HMaster.java e4bd30d 
  src/main/java/org/apache/hadoop/hbase/master/LogsCleaner.java 9d1a8b8 
  src/main/java/org/apache/hadoop/hbase/master/RestoreSnapshot.java 
PRE-CREATION 
  src/main/java/org/apache/hadoop/hbase/master/SnapshotLogCleaner.java 
PRE-CREATION 
  src/main/java/org/apache/hadoop/hbase/master/SnapshotMonitor.java 
PRE-CREATION 
  src/main/java/org/apache/hadoop/hbase/master/SnapshotOperation.java 
PRE-CREATION 
  src/main/java/org/apache/hadoop/hbase/master/SnapshotTracker.java 
PRE-CREATION 
  src/main/java/org/apache/hadoop/hbase/master/TableDelete.java 1153e62 
  src/main/java/org/apache/hadoop/hbase/master/TableSnapshot.java PRE-CREATION 
  src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java 6dc41a4 
  src/main/java/org/apache/hadoop/hbase/regionserver/HRegionServer.java 6a54736 
  src/main/java/org/apache/hadoop/hbase/regionserver/Snapshotter.java 
PRE-CREATION 
  src/main/java/org/apache/hadoop/hbase/regionserver/Store.java ae9e190 
  src/main/java/org/apache/hadoop/hbase/regionserver/StoreFile.java 757a50c 
  src/main/java/org/apache/hadoop/hbase/regionserver/ZKSnapshotWatcher.java 
PRE-CREATION 
  src/main/java/org/apache/hadoop/hbase/regionserver/wal/HLog.java 9593286 
  
src/main/java/org/apache/hadoop/hbase/replication/master/ReplicationLogCleaner.java
 4d4b00a 
  src/main/java/org/apache/hadoop/hbase/util/FSUtils.java 5cf3481 
  src/main/java/org/apache/hadoop/hbase/zookeeper/ZooKeeperWrapper.java 3827fa5 
  src/main/resources/hbase-default.xml b73f0ff 
  src/test/java/org/apache/hadoop/hbase/HBaseTestingUtility.java 4d09fe9 
  src/test/java/org/apache/hadoop/hbase/client/TestAdmin.java c9b78b9 
  src/test/java/org/apache/hadoop/hbase/master/TestLogsCleaner.java 8b7f60f 
  src/test/java/org/apache/hadoop/hbase/master/TestSnapshot.java PRE-CREATION 
  src/test/java/org/apache/hadoop/hbase/master/TestSnapshotFailure.java 
PRE-CREATION 
  src/test/java/org/apache/hadoop/hbase/regionserver/TestCompaction.java 
34b8044 
  src/test/java/org/apache/hadoop/hbase/regionserver/TestHRegion.java 98bd3e5 
  src/test/java/org/apache/hadoop/hbase/regionserver/TestHRegionSnapshot.java 
PRE-CREATION 
  src/test/java/org/apache/hadoop/hbase/regionserver/TestStoreFile.java 38ef520 
  src/test/java/org/apache/hadoop/hbase/regionserver/TestZKSnapshotWatcher.java 
PRE-CREATION 

Diff: http://review.cloudera.org/r/467/diff


Testing
---

Unit tests and integration tests with mini cluster passed.


Thanks,

Chongxin




 Snapshot of table
 -

 Key: HBASE-50
 URL: https://issues.apache.org/jira/browse/HBASE-50
 Project: HBase
  Issue Type: New Feature
Reporter: Billy

[jira] Commented: (HBASE-50) Snapshot of table

[
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897666#action_12897666
]

HBase Review Board commented on HBASE-50:
-

Message from: Ted Yu ted...@yahoo.com

---
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/467/#review869
---

src/main/java/org/apache/hadoop/hbase/master/SnapshotTracker.java
http://review.cloudera.org/r/467/#comment2875

How about SnapshotWatcher ?

src/main/java/org/apache/hadoop/hbase/regionserver/Store.java
http://review.cloudera.org/r/467/#comment2874

I think putting this in Region is good.

src/main/java/org/apache/hadoop/hbase/regionserver/StoreFile.java
http://review.cloudera.org/r/467/#comment2876

Can we get to hbase root directly ?

- Ted

Snapshot of table
-

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HBASE-50) Snapshot of table

[
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897732#action_12897732
]

HBase Review Board commented on HBASE-50:
-

Message from: Chongxin Li lichong...@zju.edu.cn

bq. On 2010-08-12 02:53:06, Ted Yu wrote:
bq. src/main/java/org/apache/hadoop/hbase/master/SnapshotTracker.java, line 1
bq. http://review.cloudera.org/r/467/diff/3/?file=6021#file6021line1
bq.
bq. How about SnapshotWatcher ?

Will it sound like this class implement the Watcher interface of ZK?

bq. On 2010-08-12 02:53:06, Ted Yu wrote:
bq. src/main/java/org/apache/hadoop/hbase/regionserver/StoreFile.java, line
283
bq. http://review.cloudera.org/r/467/diff/3/?file=6028#file6028line283
bq.
bq. Can we get to hbase root directly ?

Since this method is static, we probably need another parameter for root
directory?

- Chongxin

---
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/467/#review869
---

Snapshot of table
-

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HBASE-50) Snapshot of table

[
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897735#action_12897735
]

HBase Review Board commented on HBASE-50:
-

Message from: Chongxin Li lichong...@zju.edu.cn

bq. On 2010-08-10 21:34:40, stack wrote:
bq. src/main/java/org/apache/hadoop/hbase/HTableDescriptor.java, line 673
bq. http://review.cloudera.org/r/467/diff/3/?file=6002#file6002line673
bq.
bq. This is fine for an hbase that is a fresh install but what about
case where the data has been migrated from an older hbase version; it won't
have this column family in .META. We should make a little migration script
that adds it or on start of new version, check for it and if not present,
create it.
bq.
bq. Chongxin Li wrote:
bq. That's right. But AddColumn operation requires the table disabled to
proceed, ROOT table can not be disabled once the system is started. Then how
could we execute the migration script or check and create it on start of new
version?

This can be done with a script when HBase is shutdown. The script scans the
root region with MetaUtils and add the column family SNAPSHOT to .META. table?

- Chongxin

---
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/467/#review823
---

Snapshot of table
-

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HBASE-50) Snapshot of table

[
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897854#action_12897854
]

HBase Review Board commented on HBASE-50:
-

Message from: Ted Yu ted...@yahoo.com

---
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/467/#review874
---

src/main/java/org/apache/hadoop/hbase/master/BaseScanner.java
http://review.cloudera.org/r/467/#comment2888

Check return value.

src/main/java/org/apache/hadoop/hbase/master/DeleteSnapshot.java
http://review.cloudera.org/r/467/#comment2887

Should return value be checked ?

src/main/java/org/apache/hadoop/hbase/master/DeleteSnapshot.java
http://review.cloudera.org/r/467/#comment2886

Is there more to be done here ?

- Ted

Snapshot of table
-

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HBASE-50) Snapshot of table

[
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897246#action_12897246
]

HBase Review Board commented on HBASE-50:
-

Message from: Chongxin Li lichong...@zju.edu.cn

bq. On 2010-08-10 22:40:31, Ted Yu wrote:
bq. src/main/java/org/apache/hadoop/hbase/master/HMaster.java, line 962
bq. http://review.cloudera.org/r/467/diff/3/?file=6015#file6015line962
bq.
bq. Moving crashed snapshots has two benefits:
bq. 1. future call to listSnapshots() wouldn't encounter IOException.
bq. 2. it's easy for user to get statistics on failed snapshots and
analyze them
bq.
bq. Or, if you log enough information when cleaning up the failed
snapshot.
bq.

What about snapshot fails when it is being created? Currently it is cleaned up
if exception occurs in HMaster.snapshot. Should we also move it to this
directory? Then for reference information sync, should we also take the
reference files of these failed snapshots into account?

- Chongxin

---
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/467/#review830
---

Snapshot of table
-

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HBASE-50) Snapshot of table

[
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897250#action_12897250
]

HBase Review Board commented on HBASE-50:
-

Message from: Chongxin Li lichong...@zju.edu.cn

That's right. But AddColumn operation requires the table disabled to proceed,
ROOT table can not be disabled once the system is started. Then how could we
execute the migration script or check and create it on start of new version?

bq. On 2010-08-10 21:34:40, stack wrote:
bq. src/main/java/org/apache/hadoop/hbase/client/HBaseAdmin.java, line 899
bq. http://review.cloudera.org/r/467/diff/3/?file=6005#file6005line899
bq.
bq. Can the snapshot name be empty and then we'll make one up?

a default snapshot name? or a auto-generated snapshot name, such as creation
time?

bq. On 2010-08-10 21:34:40, stack wrote:
bq. src/main/java/org/apache/hadoop/hbase/client/HBaseAdmin.java, line 951
bq. http://review.cloudera.org/r/467/diff/3/?file=6005#file6005line951
bq.
bq. For restore of the snapshot, do you use loadtable.rb or Todd's new
bulkloading scripts?

Currently, NO...
Snapshot is composed of a list of log files and a bunch of reference files for
HFiles of the table. These reference files have the same hierarchy as the
original table and the name is in the format of 1239384747630.tablename,
where the front is the file name of the referred HFile and the latter is table
name for snapshot. Thus to restore a snapshot, just copy reference files (which
are just a few bytes) to the table dir, update the META and split the logs.
When this table is enabled, the system know how to replay the commit edits and
read such a reference file. Methods getReferredToFile, open in StoreFile are
updated to deal with this kind of reference files for snapshots.

At present, snapshot can only be restored to the table whose name is the same
as the one for which the snapshot is created. That the old table with the same
name must be deleted before restore a snapshot. That's what I do in unit test
TestAdmin. Restoring snapshot to a different table name has a low priority. It
has not been implemented yet.

bq. On 2010-08-10 21:34:40, stack wrote:
bq. src/main/java/org/apache/hadoop/hbase/io/Reference.java, line 50
bq. http://review.cloudera.org/r/467/diff/3/?file=6008#file6008line50
bq.
bq. Whats this? A different kind of reference?

Yes.. This is the reference file in snapshot. It references an HFile of the
original table.

bq. On 2010-08-10 21:34:40, stack wrote:
bq. src/main/java/org/apache/hadoop/hbase/master/SnapshotLogCleaner.java,
line 115
bq. http://review.cloudera.org/r/467/diff/3/?file=6018#file6018line115
bq.
bq. This looks like a class that you could write a unit test for?

Sure, I'll add another case in TestLogsCleaner.

bq. On 2010-08-10 21:34:40, stack wrote:
bq. src/main/java/org/apache/hadoop/hbase/master/RestoreSnapshot.java, line
130
bq. http://review.cloudera.org/r/467/diff/3/?file=6017#file6017line130
bq.
bq. If table were big, this could be prohibitively expensive? A
single-threaded copy of all of a tables data? We could compliment this with
MR-base restore, something that did the copy using MR?

This method is only used in RestoreSnapshot, where reference files of snapshot
are copied to the table dir. These reference files just contains a few bytes
instead of the table's data. Snapshots share the table data with the original
table and other snapshots. Do we still need a MR job?

bq. On 2010-08-10 21:34:40, stack wrote:
bq. src/main/java/org/apache/hadoop/hbase/master/BaseScanner.java, line 212
bq. http://review.cloudera.org/r/467/diff/3/?file=6013#file6013line212
bq.
bq. Why Random negative number? Why not just leave it blank?

If a blank value is used as the key, there would be only one item at last if it
is the first few times to scan the regions. Using random negative number
indicates all these regions have not been scanned before. If it has been
scanned, there would be a last checking time for it instead.

bq. On 2010-08-10 21:34:40, stack wrote:
bq.
src/main/java/org/apache/hadoop/hbase/mapreduce/LoadIncrementalHFiles.java,
line 251
bq. http://review.cloudera.org/r/467/diff/3/?file=6012#file6012line251
bq.
bq. Is this comment right?

I just renamed the Ranges to caps, comment was not

[jira] Commented: (HBASE-50) Snapshot of table

[
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897257#action_12897257
]

HBase Review Board commented on HBASE-50:
-

Message from: Chongxin Li lichong...@zju.edu.cn

bq. On 2010-08-10 22:20:23, Ted Yu wrote:
bq. src/main/java/org/apache/hadoop/hbase/io/Reference.java, line 156
bq. http://review.cloudera.org/r/467/diff/3/?file=6008#file6008line156
bq.
bq. I think the current code is backward compatible. Boolean value of
true is interpreted as TOP, value of false is BOTTOM.
bq. Since ENTIRE is introduced, this code is not backward compatible.
bq.
bq. See:
bq.
http://download.oracle.com/javase/1.4.2/docs/api/java/io/DataOutput.html#writeBoolean%28boolean%29

Why it is not backward compatible when ENTIRE is introduces? The value for
ENTIRE is 2, different from the old written value of boolean.

- Chongxin

---
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/467/#review829
---

Snapshot of table
-

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HBASE-50) Snapshot of table

[
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897474#action_12897474
]

HBase Review Board commented on HBASE-50:
-

Message from: Ted Yu ted...@yahoo.com

---
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/467/#review846
---

src/main/java/org/apache/hadoop/hbase/io/Reference.java
http://review.cloudera.org/r/467/#comment2846

I meant value of 2 cannot be correctly interpreted as boolean.

src/main/java/org/apache/hadoop/hbase/master/HMaster.java
http://review.cloudera.org/r/467/#comment2847

I think we need to limit the space consumed by failed snapshots.
This issue can be addressed by a future JIRA.

- Ted

Snapshot of table
-

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HBASE-50) Snapshot of table

[
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12896963#action_12896963
]

HBase Review Board commented on HBASE-50:
-

Message from: Ted Yu ted...@yahoo.com

---
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/467/#review799
---

src/main/java/org/apache/hadoop/hbase/master/HMaster.java
http://review.cloudera.org/r/467/#comment2704

Do we need to abort TableSnapshot processing in case of exception ?

src/main/java/org/apache/hadoop/hbase/master/HMaster.java
http://review.cloudera.org/r/467/#comment2707

If you create directory for failed snapshots, you can also add
listFailedSnapshots() method.

src/main/java/org/apache/hadoop/hbase/master/HMaster.java
http://review.cloudera.org/r/467/#comment2705

It would be better to move crashed snapshots into a separate directory
under snapshot rootDir.

- Ted

Snapshot of table
-

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HBASE-50) Snapshot of table

[
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897042#action_12897042
]

HBase Review Board commented on HBASE-50:
-

Message from: Ted Yu ted...@yahoo.com

---
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/467/#review803
---

src/main/java/org/apache/hadoop/hbase/master/BaseScanner.java
http://review.cloudera.org/r/467/#comment2713

IOException should be handled so that synchronization of reference counts
isn't interrupted.

- Ted

Snapshot of table
-

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HBASE-50) Snapshot of table

[
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897143#action_12897143
]

HBase Review Board commented on HBASE-50:
-

Message from: Chongxin Li lichong...@zju.edu.cn

bq. On 2010-08-10 10:49:06, stack wrote:
bq. src/main/java/org/apache/hadoop/hbase/HSnapshotDescriptor.java, line 36
bq. http://review.cloudera.org/r/467/diff/3/?file=6001#file6001line36
bq.
bq. Drop the H. Call it SnapshotDescriptor

Alright

bq. On 2010-08-10 10:49:06, stack wrote:
bq. src/main/java/org/apache/hadoop/hbase/HSnapshotDescriptor.java, line 41
bq. http://review.cloudera.org/r/467/diff/3/?file=6001#file6001line41
bq.
bq. If it is in under the snapshot directory maybe just call this file
snapshotinfo? Drop the '.' prefix. The '.' prefix is usually to demark
'special' files we don't want to consider as part of normal operation. In this
case, we are under a snapshot directory, already outside of 'normal' operation.

This is named following .regioninfo

bq. On 2010-08-10 10:49:06, stack wrote:
bq. src/main/java/org/apache/hadoop/hbase/HRegionInfo.java, line 373
bq. http://review.cloudera.org/r/467/diff/3/?file=6000#file6000line373
bq.
bq. How often is this called? If it happens alot, it could add up -- be
expensive.

Not too much actually. This method is only called in BaseScanner when reference
rows in META are checked and synchronized with the reference files. And right
now there would be at most five rows to be checked in one scan of META.
There is no region info saved in each reference row. Thus reference row which
is a combination of SNAPSHOT_PREFIX and region name is parsed to obtain the
region name. That's why we need this method.

- Chongxin

---
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/467/#review800
---

Snapshot of table
-

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HBASE-50) Snapshot of table

[
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897144#action_12897144
]

HBase Review Board commented on HBASE-50:
-

Message from: Chongxin Li lichong...@zju.edu.cn

bq. On 2010-08-10 10:04:44, Ted Yu wrote:
bq. src/main/java/org/apache/hadoop/hbase/master/HMaster.java, line 962
bq. http://review.cloudera.org/r/467/diff/3/?file=6015#file6015line962
bq.
bq. It would be better to move crashed snapshots into a separate
directory under snapshot rootDir.

If so, probably we need the above method.
But why move crashed snapshots into a separate directory? It would be pretty
hard to recover a crashed snapshot.

bq. On 2010-08-10 10:04:44, Ted Yu wrote:
bq. src/main/java/org/apache/hadoop/hbase/master/HMaster.java, line 945
bq. http://review.cloudera.org/r/467/diff/3/?file=6015#file6015line945
bq.
bq. If you create directory for failed snapshots, you can also add
listFailedSnapshots() method.

Currently there is no directory for failed snapshots. If snapshot fails, it is
cleaned up and exception is thrown to notify the user.

bq. On 2010-08-10 10:04:44, Ted Yu wrote:
bq. src/main/java/org/apache/hadoop/hbase/master/HMaster.java, line 930
bq. http://review.cloudera.org/r/467/diff/3/?file=6015#file6015line930
bq.
bq. Do we need to abort TableSnapshot processing in case of exception ?

For snapshot which is created by TableSnapshot, the table must be offline and
snapshot is totally driven by the master. Region servers have no awareness of
such a snapshot. So in case of exception, just clean up the failed snapshot.
There is no need to abort the snapshot across the cluster.

Regarding SnapshotMonitor, it only monitors the snapshots which are created
across the region servers.

- Chongxin

---
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/467/#review799
---

Snapshot of table
-

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HBASE-50) Snapshot of table

[
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12897146#action_12897146
]

HBase Review Board commented on HBASE-50:
-

Message from: Ted Yu ted...@yahoo.com

---
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/467/#review829
---

src/main/java/org/apache/hadoop/hbase/io/Reference.java
http://review.cloudera.org/r/467/#comment2793

I think the current code is backward compatible. Boolean value of true is
interpreted as TOP, value of false is BOTTOM.
Since ENTIRE is introduced, this code is not backward compatible.

See:

http://download.oracle.com/javase/1.4.2/docs/api/java/io/DataOutput.html#writeBoolean%28boolean%29

- Ted

Snapshot of table
-

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HBASE-50) Snapshot of table