[
https://issues.apache.org/jira/browse/ACCUMULO-571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Christopher Tubbs updated ACCUMULO-571:
---------------------------------------
Assignee: John Vines
Reporter: John Vines (was: jv)
> MergeClone/BulkImport from existing table
> -----------------------------------------
>
> Key: ACCUMULO-571
> URL: https://issues.apache.org/jira/browse/ACCUMULO-571
> Project: Accumulo
> Issue Type: New Feature
> Components: client, tserver
> Reporter: John Vines
> Assignee: John Vines
>
> This is idea that was recently brought to my attention. The use case is a
> user wants to essentially clone a subset of a table into an existing table.
> Currently cloning does not allow this. Current option is to copy the files in
> hdfs and then bulk import, since bulk import moves the files. This is pretty
> wasteful. Under the hood, the system can handle the cross-linking between
> files like that. We just need a mechanism to provide the ability to assign a
> subset of data to another region.
> Potential uses include the above mentioned, as well as the potential for
> users to bring fresh data into a table which was cloned and modified. There
> may be other cases, but I haven't fully thought out this problem space.
> The biggest problem with this is it does put the onus on the user for
> ensuring that data in the in memory maps is flushed before moving, as well as
> for handling the possibility of duplicate data.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira