[ 
https://issues.apache.org/jira/browse/HBASE-3247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12934192#action_12934192
 ] 

Steven Noels commented on HBASE-3247:
-------------------------------------

If this is really about robust (and distributed) pulling, wouldn't the RowLog 
mechanism as implemented in Lily would be a more solid approach - to the point 
that RowLog would leave in-row-process-status data in a non-user-visible column?

I'm wondering, as I'm seeing a proliferation of alternative yet overlapping 
approaches to a certain number of issues (secondary indexes, change listening) 
which in the end could confuse new users.

> Changes API: API for pulling edits from HBase
> ---------------------------------------------
>
>                 Key: HBASE-3247
>                 URL: https://issues.apache.org/jira/browse/HBASE-3247
>             Project: HBase
>          Issue Type: Task
>            Reporter: stack
>
> Talking to Shay from Elastic Search, he was asking where the Changes API is 
> in HBase.  Talking more -- there was a bit of beer involved so apologize up 
> front -- he wants to be able to bootstrap an index and thereafter ask HBase 
> for changes since time t.  We thought he could tie into the replication 
> stream, but rather he wants to be able to pull rather than have it pushed to 
> him (in case he crashes, etc. so on recovery he can start pulling again from 
> last good edit received).  He could do the bootstrap with a Scan.  
> Thereafter, requests to pull from hbase would pass a marker of some  sort.  
> HBase would then give out edits that came in after this marker, in batches, 
> along with an updated marker.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to