[ 
https://issues.apache.org/jira/browse/SOLR-9922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15800658#comment-15800658
 ] 

Cao Manh Dat edited comment on SOLR-9922 at 1/5/17 7:55 AM:
------------------------------------------------------------

Updated patch
- fix some bugs, all the test passed. 
- hdfsUpdateLog store buffered updates in hdfs
- buffered updates won't be replay in case of node die upon buffering, update 
log will know that an old buffered updates exist, and will delete old buffer 
update tlog when 
-- ensureLog() is called
-- new buffer tlog is created

[[email protected]] There are only one thing that I concern is back-ward 
compatibly, the patch delete FLAG_GAP and related things. But I think when user 
update the Solr, their node must be in active state, so the the new version 
won't have to read old tlog.
[~erickerickson] The CDCR tests passed, But I'm not sure the changes on 
CDCRUpdateLog are correct. So can you take a look at the patch?


was (Author: caomanhdat):
Updated patch
- fix some bugs, all the test passed. 
- hdfsUpdateLog store buffered updates in hdfs
- buffered updates won't be replay in case of node die upon buffering, update 
log will know that an old buffered updates exist, and will delete old buffer 
update tlog when 
-- ensureLog() is called
-- new buffer tlog is created

[~erickerickson] The CDCR tests passed, But I'm not sure the changes on 
CDCRUpdateLog are correct. So can you take a look at the patch?

> Write buffering updates to another tlog
> ---------------------------------------
>
>                 Key: SOLR-9922
>                 URL: https://issues.apache.org/jira/browse/SOLR-9922
>             Project: Solr
>          Issue Type: Improvement
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Cao Manh Dat
>         Attachments: SOLR-9922.patch, SOLR-9922.patch
>
>
> Currently, we write buffering logs to current tlog and not apply that updates 
> to index. Then we rely on replay log to apply that updates to index. But at 
> the same time there are some updates also write to current tlog and applied 
> to the index. 
> For example, during peersync, if new updates come to replica we will end up 
> with this tlog
> tlog : old1, new1, new2, old2, new3, old3
> old updates belong to peersync, and these updates are applied to the index.
> new updates belong to buffering updates, and these updates are not applied to 
> the index.
> But writing all the updates to same current tlog make code base very complex. 
> We should write buffering updates to another tlog file.
> By doing this, it will help our code base simpler. It also makes replica 
> recovery for SOLR-9835 more easier. Because after peersync success we can 
> copy new updates from temporary file to current tlog, for example
> tlog : old1, old2, old3
> temporary tlog : new1, new2, new3
> -->
> tlog : old1, old2, old3, new1, new2, new3



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to