[
https://issues.apache.org/jira/browse/HADOOP-9700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13704093#comment-13704093
]
Binglin Chang commented on HADOOP-9700:
---------------------------------------
Currently, SnapshotDiffReport(even SnapshotDiffInfo) lacks information to
support minimal diff transfer, I think there are 3 aspects at least:
# simple file/dir renaming, I think it can be handled without the help of
InodeID
# changing dir hierarchical structure, I'm not sure current diff report format
can express this kind of change, looks like diff of changing complex dir
hierarchical structure can only be archived by fully comparing whole InodeID
sets of the two snapshots.
# file append, like Luke mentioned.
SnapshotDiffInfo is private, we can either change SnapshotDiffInfo to public or
add more information to diff report.
Any suggestions for which direction to go?
> Snapshot support for distcp
> ---------------------------
>
> Key: HADOOP-9700
> URL: https://issues.apache.org/jira/browse/HADOOP-9700
> Project: Hadoop Common
> Issue Type: New Feature
> Components: tools/distcp
> Reporter: Binglin Chang
> Assignee: Binglin Chang
> Attachments: HADOOP-9700-demo.patch
>
>
> Add snapshot incremental copy ability to distcp, so we can do iterative
> consistent backup between hadoop clusters.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira