[jira] [Commented] (CASSANDRA-4047) Bulk hinting

Jonathan Ellis (JIRA) Tue, 26 Mar 2013 21:49:20 -0700

    [ 
https://issues.apache.org/jira/browse/CASSANDRA-4047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13614924#comment-13614924
 ]


Jonathan Ellis commented on CASSANDRA-4047:
-------------------------------------------

bq. we can insert the range that failed into a system table on a replica that 
succeeded via thrift

You're right, we pretty much need to do that anyway, since we have no 
guarantees that whatever process the Hadoop job is running in will be around 
later.  It really needs to take the approach of "injecting" the range-hint into 
a C* node.
                
> Bulk hinting
> ------------
>
>                 Key: CASSANDRA-4047
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4047
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Brandon Williams
>            Assignee: Carl Yeksigian
>             Fix For: 2.0
>
>
> With the introduction of the BulkOutputFormat, there may be cases where 
> someone would like to tolerate node failures and have the job complete, but 
> afterwards since we streamed they have to repair or rely on read repair.  We 
> don't currently have any way of hinting streams, but a node could take a 
> snapshot before acknowledging the stream session, then remember to send the 
> files in the snapshot to the unavailable nodes when they come back up.  This 
> isn't quite ideal since of course the node may have compacted these files, 
> however it's much simpler than any sort of key tracking at this scale.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-4047) Bulk hinting

Reply via email to