[ 
https://issues.apache.org/jira/browse/HBASE-50?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Li Chongxin updated HBASE-50:
-----------------------------

    Attachment: snapshot-src.zip
                snapshot-flowchart.png

One more problem:
When I run the test with standalone ZooKeeper, everything is fine, while 
running with MiniZooKeeperCluster a exception might be thrown, although the 
output is still correct. Below is the exception stack trace and it shoud be 
related with MiniZooKeeperCluster. I don't know if it is a configuration 
problem.

===================================================================================
2010-05-06 22:05:27,062 ERROR [Thread-1] server.NIOServerCnxn$Factory$1(82): 
Thread Thread[Thread-1,5,main] died
java.nio.channels.CancelledKeyException
 at sun.nio.ch.SelectionKeyImpl.ensureValid(SelectionKeyImpl.java:55)
 at sun.nio.ch.SelectionKeyImpl.interestOps(SelectionKeyImpl.java:64)
 at 
org.apache.zookeeper.server.NIOServerCnxn$SendBufferWriter.wakeup(NIOServerCnxn.java:927)
 at 
org.apache.zookeeper.server.NIOServerCnxn$SendBufferWriter.checkFlush(NIOServerCnxn.java:909)
 at 
org.apache.zookeeper.server.NIOServerCnxn$SendBufferWriter.flush(NIOServerCnxn.java:945)
 at java.io.BufferedWriter.flush(BufferedWriter.java:236)
 at java.io.PrintWriter.flush(PrintWriter.java:276)
 at org.apache.zookeeper.server.NIOServerCnxn$2.run(NIOServerCnxn.java:1089)
=====================================================================================


> Snapshot of table
> -----------------
>
>                 Key: HBASE-50
>                 URL: https://issues.apache.org/jira/browse/HBASE-50
>             Project: Hadoop HBase
>          Issue Type: New Feature
>            Reporter: Billy Pearson
>            Assignee: Li Chongxin
>            Priority: Minor
>         Attachments: snapshot-flowchart.png, snapshot-src.zip
>
>
> Havening an option to take a snapshot of a table would be vary useful in 
> production.
> What I would like to see this option do is do a merge of all the data into 
> one or more files stored in the same folder on the dfs. This way we could 
> save data in case of a software bug in hadoop or user code. 
> The other advantage would be to be able to export a table to multi locations. 
> Say I had a read_only table that must be online. I could take a snapshot of 
> it when needed and export it to a separate data center and have it loaded 
> there and then i would have it online at multi data centers for load 
> balancing and failover.
> I understand that hadoop takes the need out of havening backup to protect 
> from failed servers, but this does not protect use from software bugs that 
> might delete or alter data in ways we did not plan. We should have a way we 
> can roll back a dataset.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to