[
https://issues.apache.org/jira/browse/HBASE-5509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13221478#comment-13221478
]
Zhihong Yu commented on HBASE-5509:
-----------------------------------
I think Karthick may tell us something about the failure scenarios they have
handled through this approach :-)
SnapshotMR.FileReporter's run() method only sleeps. What purpose does
FileReporter serve ?
{code}
+ * Map method. Copies one file from source file system to destination.
{code}
The above is inaccurate: every file returned from
SnapshotUtilities.getStoreFileList() is copied.
> MR based copier for copying HFiles (trunk version)
> --------------------------------------------------
>
> Key: HBASE-5509
> URL: https://issues.apache.org/jira/browse/HBASE-5509
> Project: HBase
> Issue Type: Sub-task
> Components: documentation, regionserver
> Reporter: Karthik Ranganathan
> Assignee: Lars Hofhansl
> Fix For: 0.94.0, 0.96.0
>
> Attachments: 5509.txt
>
>
> This copier is a modification of the distcp tool in HDFS. It does the
> following:
> 1. List out all the regions in the HBase cluster for the required table
> 2. Write the above out to a file
> 3. Each mapper
> 3.1 lists all the HFiles for a given region by querying the regionserver
> 3.2 copies all the HFiles
> 3.3 outputs success if the copy succeeded, failure otherwise. Failed
> regions are retried in another loop
> 4. Mappers are placed on nodes which have maximum locality for a given region
> to speed up copying
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira