[
https://issues.apache.org/jira/browse/HADOOP-4058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628079#action_12628079
]
dhruba borthakur commented on HADOOP-4058:
------------------------------------------
I agree with Allen to a certain extent. The design has to be such that the
"retention-policy" be part of a layer written above the HDFS file system. I
would like the design to be similar to approach adopted by the
block-rebalancing code. The block-rebalacer is not in the namenode. It uses a
few primitives in the namenode, but mos tof the logic of what-to-move,
when-to-move, etc are handled outside the namenode.
i would like the "transparent-archiving-and-restoring" code to be designed
similarly. I think the implementation would be more than a "bunch of scripts"!
It would involve some server(s) that continuously monitors the access pattern
in a cluster and take appropriate action.
I like the idea of having a volume-abstraction. But I think this idea of
auto-archiving-and-restore is orthogonal to "volumes". Can you pl explain how
they are related?
> Transparent archival and restore of files from HDFS
> ---------------------------------------------------
>
> Key: HADOOP-4058
> URL: https://issues.apache.org/jira/browse/HADOOP-4058
> Project: Hadoop Core
> Issue Type: New Feature
> Components: dfs
> Reporter: dhruba borthakur
> Assignee: dhruba borthakur
>
> There should be a facility to migrate old files away from a production
> cluster. Access to those files from applications should continue to work
> transparently, without changing application code, but maybe with reduced
> performance. The policy engine that does this could be layered on HDFS
> rather than being built into HDFS itself.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.