+1 on undelete. Move the deleted files into a special place and start actually deleting them when the disk space is low. If the space gets low only on few nodes, and the load is not high, the blocks from deleted files may be re-replicated to other nodes, -- or the replication factor of deleted files may be decreased.

Snapshot seems an orthogonal functionality with a whole bunch of independent requruirements
On Aug 8, 2006, at 10:18 AM, Yoram Arnon (JIRA) wrote:

support undelete, snapshots, or other mechanism to recover lost files
---------------------------------------------------------------------

                 Key: HADOOP-432
                 URL: http://issues.apache.org/jira/browse/HADOOP-432
             Project: Hadoop
          Issue Type: Improvement
            Reporter: Yoram Arnon


currently, once you delete a file it's gone forever.
most file systems allow some form of recovery of deleted files.
a simple solution would be an 'undelete' command.
a more comprehensive solution would include snapshots, manual and automatic, with scheduling options.

--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



Reply via email to