from:"Renaud Delbru"


[ 
https://issues.apache.org/jira/browse/SOLR-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14968987#comment-14968987
 ] 

Renaud Delbru commented on SOLR-6273:
-

The tlog replication is only relevant to the source cluster, as it ensures that 
tlogs are replicated between a master and slaves in case of a recovery (with a 
snappull). If not, then there are some scenarios where a slave can end up with 
an incomplete update log, and if it becomes the master, then we will miss some 
updates and the target cluster becomes inconsistent wrt the source cluster.


> Cross Data Center Replication
> -
>
> Key: SOLR-6273
> URL: https://issues.apache.org/jira/browse/SOLR-6273
> Project: Solr
>  Issue Type: New Feature
>Reporter: Yonik Seeley
>Assignee: Erick Erickson
> Attachments: SOLR-6273-trunk-testfix1.patch, 
> SOLR-6273-trunk-testfix2.patch, SOLR-6273-trunk-testfix3.patch, 
> SOLR-6273-trunk.patch, SOLR-6273-trunk.patch, SOLR-6273.patch, 
> SOLR-6273.patch, SOLR-6273.patch, SOLR-6273.patch
>
>
> This is the master issue for Cross Data Center Replication (CDCR)
> described at a high level here: 
> http://heliosearch.org/solr-cross-data-center-replication/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6273) Cross Data Center Replication


[ 
https://issues.apache.org/jira/browse/SOLR-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14969006#comment-14969006
 ] 

Renaud Delbru commented on SOLR-6273:
-

Yes, I think we should probably change the default value of the scheduler to 
1ms unless we change the model to a streaming one. 1000ms is way too high as 
default value.

> Cross Data Center Replication
> -
>
> Key: SOLR-6273
> URL: https://issues.apache.org/jira/browse/SOLR-6273
> Project: Solr
>  Issue Type: New Feature
>Reporter: Yonik Seeley
>Assignee: Erick Erickson
> Attachments: SOLR-6273-trunk-testfix1.patch, 
> SOLR-6273-trunk-testfix2.patch, SOLR-6273-trunk-testfix3.patch, 
> SOLR-6273-trunk.patch, SOLR-6273-trunk.patch, SOLR-6273.patch, 
> SOLR-6273.patch, SOLR-6273.patch, SOLR-6273.patch
>
>
> This is the master issue for Cross Data Center Replication (CDCR)
> described at a high level here: 
> http://heliosearch.org/solr-cross-data-center-replication/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6273) Cross Data Center Replication


[ 
https://issues.apache.org/jira/browse/SOLR-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14968970#comment-14968970
 ] 

Renaud Delbru commented on SOLR-6273:
-

[~shalinmangar] thanks for looking into this.

Regarding performance (2 and 3), it is true that the right batch size and 
scheduler delay is very important for optimal performance. With the proper 
batch sizes and scheduler delays, we have seen very low update latency between 
the source and target clusters. In your setup, one document was approximately 
0.2kb in size, therefore the batch size was ~14kb which should correspond to 
~14mb/s of transfer rate. With such a transfer rate, the replication should 
have been done in a few seconds / minutes, not hours. Could you give more 
information about your setup / benchmark ? Were replication turned off while 
you were indexing on the source, or you turned it on after ?

In term of moving from a batch model to to a pure streaming one, this might 
probably simplify the configuration on the user size, but in term of 
performance, I am not sure - maybe some other people can give their opinion 
here. Batch size might not use that much memory (if properly configured), and 
transfer speed also (if the batch size is properly configured too). One way to 
simplify also the configuration for the user is, like you proposed, having a 
configurable transfer rate but with some logic to automatically adjust the 
batch size and scheduler delay based on the configurable transfer rate ?

About 5, I think transfer rate is a good addition. Latency could be computed as 
the QUEUES monitoring action is returning the last document timestamp.


> Cross Data Center Replication
> -
>
> Key: SOLR-6273
> URL: https://issues.apache.org/jira/browse/SOLR-6273
> Project: Solr
>  Issue Type: New Feature
>Reporter: Yonik Seeley
>Assignee: Erick Erickson
> Attachments: SOLR-6273-trunk-testfix1.patch, 
> SOLR-6273-trunk-testfix2.patch, SOLR-6273-trunk-testfix3.patch, 
> SOLR-6273-trunk.patch, SOLR-6273-trunk.patch, SOLR-6273.patch, 
> SOLR-6273.patch, SOLR-6273.patch, SOLR-6273.patch
>
>
> This is the master issue for Cross Data Center Replication (CDCR)
> described at a high level here: 
> http://heliosearch.org/solr-cross-data-center-replication/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6273) Cross Data Center Replication


[ 
https://issues.apache.org/jira/browse/SOLR-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14969188#comment-14969188
 ] 

Renaud Delbru commented on SOLR-6273:
-

First time I saw this issue.
How did you perform the reload ? Have you deleted it the source collection 
before the reload, or just reload and overwrite the existing documents ?

> Cross Data Center Replication
> -
>
> Key: SOLR-6273
> URL: https://issues.apache.org/jira/browse/SOLR-6273
> Project: Solr
>  Issue Type: New Feature
>Reporter: Yonik Seeley
>Assignee: Erick Erickson
> Attachments: SOLR-6273-trunk-testfix1.patch, 
> SOLR-6273-trunk-testfix2.patch, SOLR-6273-trunk-testfix3.patch, 
> SOLR-6273-trunk.patch, SOLR-6273-trunk.patch, SOLR-6273.patch, 
> SOLR-6273.patch, SOLR-6273.patch, SOLR-6273.patch
>
>
> This is the master issue for Cross Data Center Replication (CDCR)
> described at a high level here: 
> http://heliosearch.org/solr-cross-data-center-replication/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6273) Cross Data Center Replication


[ 
https://issues.apache.org/jira/browse/SOLR-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14969182#comment-14969182
 ] 

Renaud Delbru commented on SOLR-6273:
-

That's a good point, and I think the current implementation might interfere 
with the replay of the buffered updates. The current tlog replication works as 
follow:
1) Fetch the the tlog files from the master
2) reset the update log before switching the tlog directory
3) switch the tlog directory and re-initialise the update log with the new 
directory.
Currently there is no logic to keep "buffered updates" while resetting and 
reinitializing the update log. It looks like the tlog replication still needs 
some work.

> Cross Data Center Replication
> -
>
> Key: SOLR-6273
> URL: https://issues.apache.org/jira/browse/SOLR-6273
> Project: Solr
>  Issue Type: New Feature
>Reporter: Yonik Seeley
>Assignee: Erick Erickson
> Attachments: SOLR-6273-trunk-testfix1.patch, 
> SOLR-6273-trunk-testfix2.patch, SOLR-6273-trunk-testfix3.patch, 
> SOLR-6273-trunk.patch, SOLR-6273-trunk.patch, SOLR-6273.patch, 
> SOLR-6273.patch, SOLR-6273.patch, SOLR-6273.patch
>
>
> This is the master issue for Cross Data Center Replication (CDCR)
> described at a high level here: 
> http://heliosearch.org/solr-cross-data-center-replication/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Strange comment in CdcrReplicationHandlerTest.java

2015-07-30 Thread Renaud Delbru

Yes, my apologies for this, I didn't catch this one when I reviewed the 
code before commit.

--
Renaud Delbru

On 29/07/15 23:29, Erick Erickson wrote:

Standard apache license, this was just a couple of erroneous lines at the top,
I suspect auto-added to by his IDE, I messed it too.

Will fix this in the next week, I'm traveling right now.

To Whit.

/*
  * Licensed to the Apache Software Foundation (ASF) under one or more
  * contributor license agreements.  See the NOTICE file distributed with
  * this work for additional information regarding copyright ownership.
  * The ASF licenses this file to You under the Apache License, Version 2.0
  * (the License); you may not use this file except in compliance with
  * the License.  You may obtain a copy of the License at
  *
  * http://www.apache.org/licenses/LICENSE-2.0
  *
  * Unless required by applicable law or agreed to in writing, software
  * distributed under the License is distributed on an AS IS BASIS,
  * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  * See the License for the specific language governing permissions and
  * limitations under the License.
  */

On Wed, Jul 29, 2015 at 4:27 PM, Uwe Schindler u...@thetaphi.de wrote:

RAT would only fail if the license header is missing completely. I don't think it checks 
for copyright notices.

If there is no license header, we should check our RAT config! What does it 
list as license for that file?

Uwe

-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de



-Original Message-
From: Erick Erickson [mailto:erickerick...@gmail.com]
Sent: Wednesday, July 29, 2015 7:36 PM
To: dev@lucene.apache.org
Subject: Re: Strange comment in CdcrReplicationHandlerTest.java

Yeah, I wondered that myself.

On Wed, Jul 29, 2015 at 1:35 PM, Ramkumar R. Aiyengar
andyetitmo...@gmail.com wrote:

Hmm.. I would have expected rat to fail this in precommit actually..

On 29 Jul 2015 18:01, Timothy Potter thelabd...@gmail.com wrote:


Why is this in the code?

/**
  * Copyright (c) 2015 Renaud Delbru. All Rights Reserved.
  */
package org.apache.solr.cloud;

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For
additional commands, e-mail: dev-h...@lucene.apache.org





-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional
commands, e-mail: dev-h...@lucene.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6461) peer cluster configuration

2015-07-03 Thread Renaud Delbru (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-6461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14613126#comment-14613126
 ] 

Renaud Delbru commented on SOLR-6461:
-

Yes, it has been fixed by the superset issue SOLR-6273. The configuration of 
the target clusters is done through the Replica Parameters (see 
[manual|https://docs.google.com/document/d/1DZHUFM3z9OX171DeGjcLTRI9uULM-NB1KsCSpVL3Zy0/edit#]).
 It consists of 3 parameters: zkHost to indicate the address of the zookeeper 
of the target cluster, source to indicate the source collection to replicate, 
and target to indicate the target collection that will receive updates. 

 peer cluster configuration
 --

 Key: SOLR-6461
 URL: https://issues.apache.org/jira/browse/SOLR-6461
 Project: Solr
  Issue Type: Sub-task
Reporter: Yonik Seeley

 From http://heliosearch.org/solr-cross-data-center-replication/#Overview
 Clusters will be configured to know about each other, most likely through 
 keeping a cluster peer list in zookeeper. One essential piece of information 
 will be the zookeeper quorum address for each cluster peer. Any node in one 
 cluster can know the configuration of another cluster via a zookeeper 
 client.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6273) Cross Data Center Replication

2015-07-03 Thread Renaud Delbru (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14613124#comment-14613124
 ] 

Renaud Delbru commented on SOLR-6273:
-

Hi Martin,

The google doc is up to date with the current implementation. One suggestion is 
for tuning the performance of the replication. The performance of the 
replication depends on the Replicator Parameters. In your scenario, the two 
main parameters will be schedule and batchSize. If you would like to see a 
very small latency between replication batches, you can decrease the schedule 
parameter from 1000ms to 1ms. To improve the network IO, you can also try to 
increase the batchSize parameter to a larger number (if your documents are a 
few kbs or less, you can try to increase it to 500, 1000 or more). 

To measure the impact that the parameters have on the replication performance, 
you can use the monitoring api, e.g., ?action=QUEUES, to retrieve some stats 
about the replication queue. The queue size will tell you how much your replica 
lags behind the source cluster. If the replication is not fast enough, you'll 
see the queue size increasing. The idea is to try to tune the schedule and 
batchSize parameters until you find the optimal values for your collection and 
setup, and see this queue being relatively stable and small.

 Cross Data Center Replication
 -

 Key: SOLR-6273
 URL: https://issues.apache.org/jira/browse/SOLR-6273
 Project: Solr
  Issue Type: New Feature
Reporter: Yonik Seeley
Assignee: Erick Erickson
 Attachments: SOLR-6273-trunk-testfix1.patch, 
 SOLR-6273-trunk-testfix2.patch, SOLR-6273-trunk.patch, SOLR-6273-trunk.patch, 
 SOLR-6273.patch, SOLR-6273.patch, SOLR-6273.patch, SOLR-6273.patch


 This is the master issue for Cross Data Center Replication (CDCR)
 described at a high level here: 
 http://heliosearch.org/solr-cross-data-center-replication/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-6273) Cross Data Center Replication

2015-06-03 Thread Renaud Delbru (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Renaud Delbru updated SOLR-6273:

Attachment: SOLR-6273-trunk-testfix2.patch

[~erickerickson], I have attached a new patch regarding the unit test failures 
from the jenkins job. It is likely that the errors we saw are due to the 
jenkins server being under heavy load and therefore less responsive, which 
might trigger race condition issues in the assertions of the unit tests.
I have added various safeguard methods to the unit test framework, so that the 
it will wait for the completion of particular tasks (cdcr state replication, 
update log cleaning, etc.) and fail after a given timeout (15s).

 Cross Data Center Replication
 -

 Key: SOLR-6273
 URL: https://issues.apache.org/jira/browse/SOLR-6273
 Project: Solr
  Issue Type: New Feature
Reporter: Yonik Seeley
Assignee: Erick Erickson
 Attachments: SOLR-6273-trunk-testfix1.patch, 
 SOLR-6273-trunk-testfix2.patch, SOLR-6273-trunk.patch, SOLR-6273-trunk.patch, 
 SOLR-6273.patch, SOLR-6273.patch, SOLR-6273.patch, SOLR-6273.patch


 This is the master issue for Cross Data Center Replication (CDCR)
 described at a high level here: 
 http://heliosearch.org/solr-cross-data-center-replication/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-6273) Cross Data Center Replication

2015-05-26 Thread Renaud Delbru (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Renaud Delbru updated SOLR-6273:

Attachment: SOLR-6273-trunk-testfix1.patch

[~erickerickson], I was able to reproduce the issues from the failed jinkins 
build. After replicating the tlog files, the update log of the slave is not 
properly re-initialised, and it still contains references to the previous 
tlog files. I have attached a fix for this.

 Cross Data Center Replication
 -

 Key: SOLR-6273
 URL: https://issues.apache.org/jira/browse/SOLR-6273
 Project: Solr
  Issue Type: New Feature
Reporter: Yonik Seeley
Assignee: Erick Erickson
 Attachments: SOLR-6273-trunk-testfix1.patch, SOLR-6273-trunk.patch, 
 SOLR-6273-trunk.patch, SOLR-6273.patch, SOLR-6273.patch, SOLR-6273.patch, 
 SOLR-6273.patch


 This is the master issue for Cross Data Center Replication (CDCR)
 described at a high level here: 
 http://heliosearch.org/solr-cross-data-center-replication/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6273) Cross Data Center Replication

2015-05-21 Thread Renaud Delbru (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14554099#comment-14554099
 ] 

Renaud Delbru commented on SOLR-6273:
-

[~erickerickson] I have checked the new patch on the latest trunk. The unit 
tests seem to properly run with the latest changes. Thanks for porting this to 
trunk.

 Cross Data Center Replication
 -

 Key: SOLR-6273
 URL: https://issues.apache.org/jira/browse/SOLR-6273
 Project: Solr
  Issue Type: New Feature
Reporter: Yonik Seeley
Assignee: Erick Erickson
 Attachments: SOLR-6273-trunk.patch, SOLR-6273-trunk.patch, 
 SOLR-6273.patch, SOLR-6273.patch, SOLR-6273.patch, SOLR-6273.patch


 This is the master issue for Cross Data Center Replication (CDCR)
 described at a high level here: 
 http://heliosearch.org/solr-cross-data-center-replication/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-6273) Cross Data Center Replication

2015-04-27 Thread Renaud Delbru (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Renaud Delbru updated SOLR-6273:

Attachment: SOLR-6273.patch

Here is a new patch with the following changes:

- Renamed 'slice' into 'shard'

- Removed an optimisation in the replication of tlog files which could lead to 
duplicate tlog entries on a slave node. We were trying to avoid transferring 
tlog files that were already present on the slave nodes in order to reduce 
network transfer. However, tlog files between the master and slave can differ, 
overlap, etc. making the comparison difficult to achieve. We removed this 
optimisation and now during a recovery the tlog replication will transfer all 
the tlog files from the master to the slave, and replace on the slave node all 
the existing tlog files.

 Cross Data Center Replication
 -

 Key: SOLR-6273
 URL: https://issues.apache.org/jira/browse/SOLR-6273
 Project: Solr
  Issue Type: New Feature
Reporter: Yonik Seeley
Assignee: Erick Erickson
 Attachments: SOLR-6273.patch, SOLR-6273.patch, SOLR-6273.patch, 
 SOLR-6273.patch


 This is the master issue for Cross Data Center Replication (CDCR)
 described at a high level here: 
 http://heliosearch.org/solr-cross-data-center-replication/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6273) Cross Data Center Replication

2015-04-22 Thread Renaud Delbru (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14507324#comment-14507324
 ] 

Renaud Delbru commented on SOLR-6273:
-

Hi,

[~erickerickson]: From the original subtasks, the ones that are not covered 
with this patch are: SOLR-6465 and SOLR-6466.

[~grishick]: The current patch does not cover the auto-provisioning of 
collections / live configuration of peer clusters. I think this issue should be 
tackled as part of SOLR-6466.

[~janhoy]: Could you point to where *slice* is being used instead of *shard* ? 
This should not be a problem to change that.


 Cross Data Center Replication
 -

 Key: SOLR-6273
 URL: https://issues.apache.org/jira/browse/SOLR-6273
 Project: Solr
  Issue Type: New Feature
Reporter: Yonik Seeley
Assignee: Erick Erickson
 Attachments: SOLR-6273.patch, SOLR-6273.patch, SOLR-6273.patch


 This is the master issue for Cross Data Center Replication (CDCR)
 described at a high level here: 
 http://heliosearch.org/solr-cross-data-center-replication/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-6273) Cross Data Center Replication

2015-04-15 Thread Renaud Delbru (JIRA)

[
https://issues.apache.org/jira/browse/SOLR-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Renaud Delbru updated SOLR-6273:

Attachment: SOLR-6273.patch

A new version of the patch. The patch has been created from the latest
branch_5x. The full Solr test suite has been executed successfully (there were
a few timeouts in some of the tests, but this seems irrelevant to this patch).
The principal change in this new version includes a fix for the replication of
tlog files. The {{ReplicationHandler}} and {{IndexFetcher}} have been modifed
to replicate tlog files during a recovery (only if CDCR is activated). Some
unit tests covering various scenarios can be found
in {{core/src/test/org/apache/solr/cloud/CdcrReplicationHandlerTest.java}}.
In addition of the suite of automated unit tests, this version has been tested
in various real deployments. One client has extensively tested the robustness
and performance of CDCR in pre-prod, and is satisfied with the results.

We think that the code is in a relatively good state to be pushed to Solr. How
can we move forward from here ?

Cross Data Center Replication
-

Key: SOLR-6273
URL: https://issues.apache.org/jira/browse/SOLR-6273
Project: Solr
Issue Type: New Feature
Reporter: Yonik Seeley
Attachments: SOLR-6273.patch, SOLR-6273.patch, SOLR-6273.patch

This is the master issue for Cross Data Center Replication (CDCR)
described at a high level here:
http://heliosearch.org/solr-cross-data-center-replication/

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Behavior of JettySolrRunner#start wrt Solr data/tlog directories

2015-03-23 Thread Renaud Delbru


Hi Alan,

Thanks for your feedback,
Indeed after your reply, I have investigated a bit more, and discovered 
that this was the UpdateHandler's init that was clearing the tlog 
directory if a non persistent directory is used. The solution is to 
switch to a persistent directory factory for my tests.

--
Renaud Delbru

On 03/23/2015 04:32 PM, Alan Woodward wrote:

Hi Renaud,

I don't think there's anything special in JettySolrRunner that cleans up
old directories, and the various Replication tests do something very
similar to what you want here - are you sure it's the JSR code that's
removing files here?

Alan Woodward
www.flax.co.uk http://www.flax.co.uk


On 23 Mar 2015, at 16:20, Renaud Delbru wrote:


Dear all,

I am currently working on the SOLR-6273 (CDCR) and I am currently
facing an issue with the Solr test framework. I am trying to write a
unit test where the slave node is stopped then restarted during the
execution of the unit test, in order to verify the replication of tlog
files (something that is introduced by CDCR). The scenario is the
following:
- instantiate a master and slave node
- send a first batch of updates to the master
- stop the slave
- send a second batch of updates to the mater
- restart the slave in order to trigger replication
- verify that the update logs between the master and slaves are
properly replicated.

The problem I am facing is that whenever I restart the slave, using
the SolrJettyRunner.start() method, the Solr data directory and tlog
subdirectory are cleaned up, and not reused. Therefore I am unable to
test the scenario where the slave has some partial tlog files.

Is there a way to tell the jetty server to reuse the Solr data
directory / tlog directory instead of erasing it ? Or is there another
way to emulate that a slave node is down ?

Thanks
--
Renaud Delbru

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
mailto:dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org
mailto:dev-h...@lucene.apache.org





-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Behavior of JettySolrRunner#start wrt Solr data/tlog directories

2015-03-23 Thread Renaud Delbru


Dear all,

I am currently working on the SOLR-6273 (CDCR) and I am currently facing 
an issue with the Solr test framework. I am trying to write a unit test 
where the slave node is stopped then restarted during the execution of 
the unit test, in order to verify the replication of tlog files 
(something that is introduced by CDCR). The scenario is the following:

- instantiate a master and slave node
- send a first batch of updates to the master
- stop the slave
- send a second batch of updates to the mater
- restart the slave in order to trigger replication
- verify that the update logs between the master and slaves are properly 
replicated.


The problem I am facing is that whenever I restart the slave, using the 
SolrJettyRunner.start() method, the Solr data directory and tlog 
subdirectory are cleaned up, and not reused. Therefore I am unable to 
test the scenario where the slave has some partial tlog files.


Is there a way to tell the jetty server to reuse the Solr data directory 
/ tlog directory instead of erasing it ? Or is there another way to 
emulate that a slave node is down ?


Thanks
--
Renaud Delbru

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-6460) Keep transaction logs around longer

2014-12-11 Thread Renaud Delbru (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-6460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Renaud Delbru updated SOLR-6460:

Attachment: SOLR-6460.patch

The latest version of the update log extension for cdcr. In addition of the 
previously described features, we extended the transaction log to compute and 
store the number of records in a tlog file. 
The patch SOLR-6819 is required for executing the unit tests.

 Keep transaction logs around longer
 ---

 Key: SOLR-6460
 URL: https://issues.apache.org/jira/browse/SOLR-6460
 Project: Solr
  Issue Type: Sub-task
Reporter: Yonik Seeley
 Attachments: SOLR-6460.patch, SOLR-6460.patch, SOLR-6460.patch, 
 SOLR-6460.patch


 Transaction logs are currently deleted relatively quickly... but we need to 
 keep them around much longer to be used as a source for cross-datacenter 
 recovery.  This will also be useful in the future for enabling peer-sync to 
 use more historical updates before falling back to replication.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6460) Keep transaction logs around longer

2014-12-10 Thread Renaud Delbru (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-6460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14241396#comment-14241396
 ] 

Renaud Delbru commented on SOLR-6460:
-

Cdcr is dependent on those modifcations, but this extension is not dependent of 
Cdcr.
All the modifications were implemented as an extension of the original update 
log. The reason to keep it separated was to avoid to push unexpected problems 
in the other parts of Solr Cloud. 
This extension can be easily integrated into the original update log / 
transaction log. Maybe this could be integrated when we will be more confident 
with it.


 Keep transaction logs around longer
 ---

 Key: SOLR-6460
 URL: https://issues.apache.org/jira/browse/SOLR-6460
 Project: Solr
  Issue Type: Sub-task
Reporter: Yonik Seeley
 Attachments: SOLR-6460.patch, SOLR-6460.patch, SOLR-6460.patch


 Transaction logs are currently deleted relatively quickly... but we need to 
 keep them around much longer to be used as a source for cross-datacenter 
 recovery.  This will also be useful in the future for enabling peer-sync to 
 use more historical updates before falling back to replication.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-6819) Being able to configure the updates log implementation from solrconfig.xml

2014-12-10 Thread Renaud Delbru (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-6819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Renaud Delbru updated SOLR-6819:

Attachment: SOLR-6819.patch

A new patch that reverts the original behavior of the update handler regarding 
the hdfs update log instantiation, in order to avoid back compatibility problem.

 Being able to configure the updates log implementation from solrconfig.xml
 --

 Key: SOLR-6819
 URL: https://issues.apache.org/jira/browse/SOLR-6819
 Project: Solr
  Issue Type: Improvement
  Components: SolrCloud, update
Affects Versions: Trunk
Reporter: Renaud Delbru
 Attachments: SOLR-6819.patch, SOLR-6819.patch


 CDCR requires its own implementation of the updates log. At the moment, there 
 is no way to configure the class to use when instantiating the updates log. 
 The UpdateHandler is deciding to instantiate the base class UpdateLog or its 
 hdfs version HdfsUpdateLog based on the directory path prefix.
 We can extend the UpdateHandler to allow for a class parameter to be defined 
 for the updateLog section of the solrconfig.xml. For example, the relevant 
 part of the solrconfig,xml will look like:
 {code:xml}
   updateHandler class=solr.DirectUpdateHandler2
 updateLog class=solr.CdcrUpdateLog
   str name=dir${solr.ulog.dir:}/str
 /updateLog
   /updateHandler
 {code} 
 where the updateLog entry has a parameter class that indicates that the 
 CdcrUpdateLog implementation must be used.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Created] (SOLR-6823) Improve extensibility of DistributedUpdateProcessor regarding version processing

Renaud Delbru created SOLR-6823:
---

 Summary: Improve extensibility of DistributedUpdateProcessor 
regarding version processing
 Key: SOLR-6823
 URL: https://issues.apache.org/jira/browse/SOLR-6823
 Project: Solr
  Issue Type: Improvement
  Components: SolrCloud, update
Affects Versions: Trunk
Reporter: Renaud Delbru


As described in 6462, 
{quote}
doDeleteByQuery() is structured differently than processAdd() and 
processDelete() in DistributedUpdateProcessor. We refactored doDeleteByQuery() 
by extracting a portion of its code into a helper method versionDeleteByQuery() 
which is then overriden in the CdcrUpdateProcessor. This way doDeleteByQuery() 
is structurally similar to the other two cases and we are able to keep the CDCR 
logic completely separated.
{quote}

This issue provides a patch for the DisitrbutedUpdateProcessor for trunk.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-6823) Improve extensibility of DistributedUpdateProcessor regarding version processing


 [ 
https://issues.apache.org/jira/browse/SOLR-6823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Renaud Delbru updated SOLR-6823:

Attachment: SOLR-6823.patch

 Improve extensibility of DistributedUpdateProcessor regarding version 
 processing
 

 Key: SOLR-6823
 URL: https://issues.apache.org/jira/browse/SOLR-6823
 Project: Solr
  Issue Type: Improvement
  Components: SolrCloud, update
Affects Versions: Trunk
Reporter: Renaud Delbru
 Attachments: SOLR-6823.patch


 As described in 6462, 
 {quote}
 doDeleteByQuery() is structured differently than processAdd() and 
 processDelete() in DistributedUpdateProcessor. We refactored 
 doDeleteByQuery() by extracting a portion of its code into a helper method 
 versionDeleteByQuery() which is then overriden in the CdcrUpdateProcessor. 
 This way doDeleteByQuery() is structurally similar to the other two cases and 
 we are able to keep the CDCR logic completely separated.
 {quote}
 This issue provides a patch for the DisitrbutedUpdateProcessor for trunk.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-6273) Cross Data Center Replicaton


 [ 
https://issues.apache.org/jira/browse/SOLR-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Renaud Delbru updated SOLR-6273:

Attachment: SOLR-6273.patch

The initial patch for cdcr for trunk. It contains a working version of the 
cross data center replication for active-passive scenarios. The 
CdcrRequestHandler provides an API to control and monitor the replication. A 
documentation on how to configure cdcr and of the API can be found 
[here|https://docs.google.com/document/d/1DZHUFM3z9OX171DeGjcLTRI9uULM-NB1KsCSpVL3Zy0/edit?usp=sharing].
This patch includes the following patches: 6621, 6819, 6823, and a few minor 
modifications on the UpdateLog and TransactionLog classes. Other than that, the 
rest of the CDCR code simply extends the Solr Core code.

 Cross Data Center Replicaton
 

 Key: SOLR-6273
 URL: https://issues.apache.org/jira/browse/SOLR-6273
 Project: Solr
  Issue Type: New Feature
Reporter: Yonik Seeley
 Attachments: SOLR-6273.patch


 This is the master issue for Cross Data Center Replication (CDCR)
 described at a high level here: 
 http://heliosearch.org/solr-cross-data-center-replication/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-6823) Improve extensibility of DistributedUpdateProcessor regarding version processing


 [ 
https://issues.apache.org/jira/browse/SOLR-6823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Renaud Delbru updated SOLR-6823:

Description: 
As described in SOLR-6462, 
{quote}
doDeleteByQuery() is structured differently than processAdd() and 
processDelete() in DistributedUpdateProcessor. We refactored doDeleteByQuery() 
by extracting a portion of its code into a helper method versionDeleteByQuery() 
which is then overriden in the CdcrUpdateProcessor. This way doDeleteByQuery() 
is structurally similar to the other two cases and we are able to keep the CDCR 
logic completely separated.
{quote}

This issue provides a patch for the DisitrbutedUpdateProcessor for trunk.

  was:
As described in 6462, 
{quote}
doDeleteByQuery() is structured differently than processAdd() and 
processDelete() in DistributedUpdateProcessor. We refactored doDeleteByQuery() 
by extracting a portion of its code into a helper method versionDeleteByQuery() 
which is then overriden in the CdcrUpdateProcessor. This way doDeleteByQuery() 
is structurally similar to the other two cases and we are able to keep the CDCR 
logic completely separated.
{quote}

This issue provides a patch for the DisitrbutedUpdateProcessor for trunk.


 Improve extensibility of DistributedUpdateProcessor regarding version 
 processing
 

 Key: SOLR-6823
 URL: https://issues.apache.org/jira/browse/SOLR-6823
 Project: Solr
  Issue Type: Improvement
  Components: SolrCloud, update
Affects Versions: Trunk
Reporter: Renaud Delbru
 Attachments: SOLR-6823.patch


 As described in SOLR-6462, 
 {quote}
 doDeleteByQuery() is structured differently than processAdd() and 
 processDelete() in DistributedUpdateProcessor. We refactored 
 doDeleteByQuery() by extracting a portion of its code into a helper method 
 versionDeleteByQuery() which is then overriden in the CdcrUpdateProcessor. 
 This way doDeleteByQuery() is structurally similar to the other two cases and 
 we are able to keep the CDCR logic completely separated.
 {quote}
 This issue provides a patch for the DisitrbutedUpdateProcessor for trunk.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Comment Edited] (SOLR-6273) Cross Data Center Replicaton

[
https://issues.apache.org/jira/browse/SOLR-6273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14235662#comment-14235662
]

Renaud Delbru edited comment on SOLR-6273 at 12/5/14 3:58 PM:
--

The initial patch for cdcr for trunk. It contains a working version of the
cross data center replication for active-passive scenarios. The
CdcrRequestHandler provides an API to control and monitor the replication. A
documentation on how to configure cdcr and of the API can be found
[here|https://docs.google.com/document/d/1DZHUFM3z9OX171DeGjcLTRI9uULM-NB1KsCSpVL3Zy0/edit?usp=sharing].
This patch includes the following patches: SOLR-6621, SOLR-6819, SOLR-6823, and
a few minor modifications on the UpdateLog and TransactionLog classes. Other
than that, the rest of the CDCR code simply extends the Solr Core code.

was (Author: rendel):
The initial patch for cdcr for trunk. It contains a working version of the
cross data center replication for active-passive scenarios. The
CdcrRequestHandler provides an API to control and monitor the replication. A
documentation on how to configure cdcr and of the API can be found
[here|https://docs.google.com/document/d/1DZHUFM3z9OX171DeGjcLTRI9uULM-NB1KsCSpVL3Zy0/edit?usp=sharing].
This patch includes the following patches: 6621, 6819, 6823, and a few minor
modifications on the UpdateLog and TransactionLog classes. Other than that, the
rest of the CDCR code simply extends the Solr Core code.

Cross Data Center Replicaton

Key: SOLR-6273
URL: https://issues.apache.org/jira/browse/SOLR-6273
Project: Solr
Issue Type: New Feature
Reporter: Yonik Seeley
Attachments: SOLR-6273.patch

This is the master issue for Cross Data Center Replication (CDCR)
described at a high level here:
http://heliosearch.org/solr-cross-data-center-replication/

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Created] (SOLR-6819) Being able to configure the updates log implementation from solrconfig.xml

2014-12-04 Thread Renaud Delbru (JIRA)

Renaud Delbru created SOLR-6819:
---

 Summary: Being able to configure the updates log implementation 
from solrconfig.xml
 Key: SOLR-6819
 URL: https://issues.apache.org/jira/browse/SOLR-6819
 Project: Solr
  Issue Type: Improvement
  Components: SolrCloud, update
Affects Versions: Trunk
Reporter: Renaud Delbru


CDCR requires its own implementation of the updates log. At the moment, there 
is no way to configure the class to use when instantiating the updates log. The 
UpdateHandler is deciding to instantiate the base class UpdateLog or its hdfs 
version HdfsUpdateLog based on the directory path prefix.
We can extend the UpdateHandler to allow for a class parameter to be defined 
for the updateLog section of the solrconfig.xml. For example, the relevant part 
of the solrconfig,xml will look like:
{code:xml}
  updateHandler class=solr.DirectUpdateHandler2
updateLog class=solr.CdcrUpdateLog
  str name=dir${solr.ulog.dir:}/str
/updateLog
  /updateHandler
{code} 
where the updateLog entry has a parameter class that indicates that the 
CdcrUpdateLog implementation must be used.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-6819) Being able to configure the updates log implementation from solrconfig.xml

2014-12-04 Thread Renaud Delbru (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-6819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Renaud Delbru updated SOLR-6819:

Attachment: SOLR-6819.patch

The patch implementing the extension to configure the class of the updates log. 
This makes also the update log initialisation part of the UpdateHandler more 
cleaner. Specific instructions for the configuration of the HdfsUpdateLog has 
been moved withing HdfsUpdateLog itself.

 Being able to configure the updates log implementation from solrconfig.xml
 --

 Key: SOLR-6819
 URL: https://issues.apache.org/jira/browse/SOLR-6819
 Project: Solr
  Issue Type: Improvement
  Components: SolrCloud, update
Affects Versions: Trunk
Reporter: Renaud Delbru
 Attachments: SOLR-6819.patch


 CDCR requires its own implementation of the updates log. At the moment, there 
 is no way to configure the class to use when instantiating the updates log. 
 The UpdateHandler is deciding to instantiate the base class UpdateLog or its 
 hdfs version HdfsUpdateLog based on the directory path prefix.
 We can extend the UpdateHandler to allow for a class parameter to be defined 
 for the updateLog section of the solrconfig.xml. For example, the relevant 
 part of the solrconfig,xml will look like:
 {code:xml}
   updateHandler class=solr.DirectUpdateHandler2
 updateLog class=solr.CdcrUpdateLog
   str name=dir${solr.ulog.dir:}/str
 /updateLog
   /updateHandler
 {code} 
 where the updateLog entry has a parameter class that indicates that the 
 CdcrUpdateLog implementation must be used.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6621) SolrZkClient does not guarantee that a watch object will only be triggered once for a given notification

2014-10-16 Thread Renaud Delbru (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-6621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14173785#comment-14173785
 ] 

Renaud Delbru commented on SOLR-6621:
-

I have added some comments, and created a pull request at:
https://github.com/apache/lucene-solr/pull/100

 SolrZkClient does not guarantee that a watch object will only be triggered 
 once for a given notification
 

 Key: SOLR-6621
 URL: https://issues.apache.org/jira/browse/SOLR-6621
 Project: Solr
  Issue Type: Bug
  Components: SolrCloud
Affects Versions: Trunk
Reporter: Renaud Delbru
 Attachments: SOLR-6621


 The SolrZkClient provides methods such as getData or exists. The problem is 
 that the client automatically wraps the provided watcher with a new watcher 
 (see 
 [here|https://github.com/apache/lucene-solr/blob/6ead83a6fafbdd6c444e2a837b09eccf34a255ef/solr/solrj/src/java/org/apache/solr/common/cloud/SolrZkClient.java#L255])
  which breaks the guarantee that a watch object, or function/context pair, 
 will only be triggered once for a given notification. This creates 
 undesirable effects when we are registering the same watch is the Watcher 
 callback method.
 A possible solution would be to introduce a SolrZkWatcher class, that will 
 take care of submitting the job to the zkCallbackExecutor. Components in 
 SolrCloud will extend this class and implement their own callback method. 
 This will ensure that the watcher object that zookeeper receives remains the 
 same.
 See SOLR-6462 for background information.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-6621) SolrZkClient does not guarantee that a watch object will only be triggered once for a given notification

2014-10-14 Thread Renaud Delbru (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-6621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Renaud Delbru updated SOLR-6621:

Attachment: SOLR-6621

Hi,

Were you thinking of something like this ?

 SolrZkClient does not guarantee that a watch object will only be triggered 
 once for a given notification
 

 Key: SOLR-6621
 URL: https://issues.apache.org/jira/browse/SOLR-6621
 Project: Solr
  Issue Type: Bug
  Components: SolrCloud
Affects Versions: Trunk
Reporter: Renaud Delbru
 Attachments: SOLR-6621


 The SolrZkClient provides methods such as getData or exists. The problem is 
 that the client automatically wraps the provided watcher with a new watcher 
 (see 
 [here|https://github.com/apache/lucene-solr/blob/6ead83a6fafbdd6c444e2a837b09eccf34a255ef/solr/solrj/src/java/org/apache/solr/common/cloud/SolrZkClient.java#L255])
  which breaks the guarantee that a watch object, or function/context pair, 
 will only be triggered once for a given notification. This creates 
 undesirable effects when we are registering the same watch is the Watcher 
 callback method.
 A possible solution would be to introduce a SolrZkWatcher class, that will 
 take care of submitting the job to the zkCallbackExecutor. Components in 
 SolrCloud will extend this class and implement their own callback method. 
 This will ensure that the watcher object that zookeeper receives remains the 
 same.
 See SOLR-6462 for background information.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6621) SolrZkClient does not guarantee that a watch object will only be triggered once for a given notification

2014-10-14 Thread Renaud Delbru (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-6621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14170918#comment-14170918
 ] 

Renaud Delbru commented on SOLR-6621:
-

Ok, if it is good, I'll add some documentation on the wrapWatcher method and 
upload a new patch.

 SolrZkClient does not guarantee that a watch object will only be triggered 
 once for a given notification
 

 Key: SOLR-6621
 URL: https://issues.apache.org/jira/browse/SOLR-6621
 Project: Solr
  Issue Type: Bug
  Components: SolrCloud
Affects Versions: Trunk
Reporter: Renaud Delbru
 Attachments: SOLR-6621


 The SolrZkClient provides methods such as getData or exists. The problem is 
 that the client automatically wraps the provided watcher with a new watcher 
 (see 
 [here|https://github.com/apache/lucene-solr/blob/6ead83a6fafbdd6c444e2a837b09eccf34a255ef/solr/solrj/src/java/org/apache/solr/common/cloud/SolrZkClient.java#L255])
  which breaks the guarantee that a watch object, or function/context pair, 
 will only be triggered once for a given notification. This creates 
 undesirable effects when we are registering the same watch is the Watcher 
 callback method.
 A possible solution would be to introduce a SolrZkWatcher class, that will 
 take care of submitting the job to the zkCallbackExecutor. Components in 
 SolrCloud will extend this class and implement their own callback method. 
 This will ensure that the watcher object that zookeeper receives remains the 
 same.
 See SOLR-6462 for background information.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Created] (SOLR-6621) SolrZkClient does not guarantee that a watch object will only be triggered once for a given notification

2014-10-13 Thread Renaud Delbru (JIRA)

Renaud Delbru created SOLR-6621:
---

 Summary: SolrZkClient does not guarantee that a watch object will 
only be triggered once for a given notification
 Key: SOLR-6621
 URL: https://issues.apache.org/jira/browse/SOLR-6621
 Project: Solr
  Issue Type: Bug
  Components: SolrCloud
Affects Versions: Trunk
Reporter: Renaud Delbru


The SolrZkClient provides methods such as getData or exists. The problem is 
that the client automatically wraps the provided watcher with a new watcher 
(see 
[here|https://github.com/apache/lucene-solr/blob/6ead83a6fafbdd6c444e2a837b09eccf34a255ef/solr/solrj/src/java/org/apache/solr/common/cloud/SolrZkClient.java#L255])
 which breaks the guarantee that a watch object, or function/context pair, 
will only be triggered once for a given notification. This creates undesirable 
effects when we are registering the same watch is the Watcher callback method.

A possible solution would be to introduce a SolrZkWatcher class, that will take 
care of submitting the job to the zkCallbackExecutor. Components in SolrCloud 
will extend this class and implement their own callback method. This will 
ensure that the watcher object that zookeeper receives remains the same.

See SOLR-6462 for background information.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6462) forward updates asynchronously to peer clusters/leaders

2014-10-13 Thread Renaud Delbru (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-6462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14169296#comment-14169296
 ] 

Renaud Delbru commented on SOLR-6462:
-

I have started to implement the CDCR request handler that will handle CDCR 
life-cycle actions and forward updates to the peer clusters.
While trying to implement the synchronisation of the life-cycle status amongst 
all the nodes of a cluster by using zookeeper, I have encountered a limitation 
of the SolrZkClient. The SolrZkClient provides methods such as getData or 
exists. The problem is that the client automatically wraps the provided watcher 
with a new watcher (see 
[here|https://github.com/apache/lucene-solr/blob/6ead83a6fafbdd6c444e2a837b09eccf34a255ef/solr/solrj/src/java/org/apache/solr/common/cloud/SolrZkClient.java#L255])
 which breaks the guarantee that a watch object, or function/context pair, 
will only be triggered once for a given notification. This creates undesirable 
effects when we are registering the same watch is the Watcher callback method.

I have created the issue SOLR-6621 to notify about the problem.

 forward updates asynchronously to peer clusters/leaders
 ---

 Key: SOLR-6462
 URL: https://issues.apache.org/jira/browse/SOLR-6462
 Project: Solr
  Issue Type: Sub-task
Reporter: Yonik Seeley

 http://heliosearch.org/solr-cross-data-center-replication/#UpdateFlow
 - An update will be received by the shard leader and versioned
 - Update will be sent from the leader to it’s replicas
 - Concurrently, update will be sent (synchronously or asynchronously) to the 
 shard leader in other clusters
 - Shard leader in the other cluster will receive already versioned update 
 (and not re-version it), and forward the update to it’s replicas



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-6460) Keep transaction logs around longer

2014-10-02 Thread Renaud Delbru (JIRA)

[
https://issues.apache.org/jira/browse/SOLR-6460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Renaud Delbru updated SOLR-6460:

Attachment: SOLR-6460.patch

Here is the latest patch which includes an optimisation to reduce the number of
opened files and some code cleaning. To summarise, the current patch provides
the following:

h4. Cleaning of Old Transaction Logs

The CdcrUpdateLog removes old tlogs based on pointers instead of a fixed size
limit.

h4. Log Reader

The CdcrUpdateLog provides a log reader with scan and seek operations. A log
reader is associated to a log pointer, and is taking care of the life-cycle of
the pointer.

h4. Log Index

To improve the efficiency of the seek operation of the log reader, an index of
transaction log files have been added. This index enables to quickly lookup a
tlog file based on a version number. This index is implemented by adding a
version number to the tlog filename and by leveraging the file system index.
This solution was choosen as it was simpler and more robust than managing a
separate disk-based index.

h4. Number of Opened Files

TransactionLog has been extended to automatically (1) close the output stream
when its refeference count reach 0, and (2) reopen the output stream on demand.
The new tlog (the current tlog being written) is kept open at all time. When a
transaction log is pushed to the old tlog list, its reference count is
decremented, which might trigger the closing of the output stream.
The output stream is reopened in two cases:
* during recovery, to write a commit to the end of an uncapped tlog file;
* when a log reader is accessing it.

At the moment, the logic is splitted into two classes (TransactionLog and
CdcrTransactionLog). We should probably merge the two in the final version.

h4. Integration within the UpdateHandler

There is a nocommit in the UpdateHandler to force the instantiation of the
CdcrUpdateLog instead of the UpdateLog. We need to decide how user will
configure this and modify the UpdateHandler appropriately.

Keep transaction logs around longer
---

Key: SOLR-6460
URL: https://issues.apache.org/jira/browse/SOLR-6460
Project: Solr
Issue Type: Sub-task
Reporter: Yonik Seeley
Attachments: SOLR-6460.patch, SOLR-6460.patch, SOLR-6460.patch

Transaction logs are currently deleted relatively quickly... but we need to
keep them around much longer to be used as a source for cross-datacenter
recovery. This will also be useful in the future for enabling peer-sync to
use more historical updates before falling back to replication.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Comment Edited] (SOLR-6460) Keep transaction logs around longer

2014-10-02 Thread Renaud Delbru (JIRA)

[
https://issues.apache.org/jira/browse/SOLR-6460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14156672#comment-14156672
]

Renaud Delbru edited comment on SOLR-6460 at 10/2/14 3:47 PM:
--

Here is the latest patch which includes an optimisation to reduce the number of
opened files and some code cleaning. To summarise, the current patch provides
the following:

h4. Cleaning of Old Transaction Logs

The CdcrUpdateLog removes old tlogs based on pointers instead of a fixed size
limit.

h4. Log Reader

The CdcrUpdateLog provides a log reader with scan and seek operations. A log
reader is associated to a log pointer, and is taking care of the life-cycle of
the pointer.

h4. Log Index

To improve the efficiency of the seek operation of the log reader, an index of
transaction log files has been added. This index enables to quickly lookup a
tlog file based on a version number. This index is implemented by adding a
version number to the tlog filename and by leveraging the file system index.
This solution was choosen as it was simpler and more robust than managing a
separate disk-based index.

h4. Number of Opened Files

TransactionLog has been extended to automatically (1) close the output stream
when its refeference count reaches 0, and (2) reopen the output stream on
demand.
The new tlog (the current tlog being written) is kept open at all time. When a
transaction log is pushed to the old tlog list, its reference count is
decremented, which might trigger the closing of the output stream.
The output stream is reopened in two cases:
* during recovery, to write a commit to the end of an uncapped tlog file;
* when a log reader is accessing it.

At the moment, the logic is splitted into two classes (TransactionLog and
CdcrTransactionLog). We should probably merge the two in the final version.

h4. Integration within the UpdateHandler

was (Author: rendel):
Here is the latest patch which includes an optimisation to reduce the number of
opened files and some code cleaning. To summarise, the current patch provides
the following:

h4. Cleaning of Old Transaction Logs

The CdcrUpdateLog removes old tlogs based on pointers instead of a fixed size
limit.

h4. Log Reader

The CdcrUpdateLog provides a log reader with scan and seek operations. A log
reader is associated to a log pointer, and is taking care of the life-cycle of
the pointer.

h4. Log Index

h4. Number of Opened Files

At the moment, the logic is splitted into two classes (TransactionLog and
CdcrTransactionLog). We should probably merge the two in the final version.

h4. Integration within the UpdateHandler

Keep transaction logs around longer
---

Key: SOLR-6460
URL: https://issues.apache.org/jira/browse/SOLR-6460
Project: Solr
Issue Type: Sub-task
Reporter: Yonik Seeley
Attachments: SOLR-6460.patch, SOLR-6460.patch, SOLR-6460.patch

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-6460) Keep transaction logs around longer

2014-09-30 Thread Renaud Delbru (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-6460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Renaud Delbru updated SOLR-6460:

Attachment: SOLR-6460.patch

A new patch that introduces efficient seeking over a list of transaction log 
files. Efficient seeking is achieved by adding metadata (version number) to 
tlog filename and by leveraging the filesystem's index. The tlog filename has 
now the following syntax: tlog.${logId}.${startVersion}.




 Keep transaction logs around longer
 ---

 Key: SOLR-6460
 URL: https://issues.apache.org/jira/browse/SOLR-6460
 Project: Solr
  Issue Type: Sub-task
Reporter: Yonik Seeley
 Attachments: SOLR-6460.patch, SOLR-6460.patch


 Transaction logs are currently deleted relatively quickly... but we need to 
 keep them around much longer to be used as a source for cross-datacenter 
 recovery.  This will also be useful in the future for enabling peer-sync to 
 use more historical updates before falling back to replication.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Comment Edited] (SOLR-6460) Keep transaction logs around longer

2014-09-30 Thread Renaud Delbru (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-6460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14153430#comment-14153430
 ] 

Renaud Delbru edited comment on SOLR-6460 at 9/30/14 5:26 PM:
--

A new patch that introduces efficient seeking over a list of transaction log 
files. Efficient seeking is achieved by adding metadata (version number) to 
tlog filename and by leveraging the filesystem's index. The tlog filename has 
now the following syntax:
{noformat}
tlog.${logId}.${startVersion}
{noformat}




was (Author: rendel):
A new patch that introduces efficient seeking over a list of transaction log 
files. Efficient seeking is achieved by adding metadata (version number) to 
tlog filename and by leveraging the filesystem's index. The tlog filename has 
now the following syntax: tlog.${logId}.${startVersion}.




 Keep transaction logs around longer
 ---

 Key: SOLR-6460
 URL: https://issues.apache.org/jira/browse/SOLR-6460
 Project: Solr
  Issue Type: Sub-task
Reporter: Yonik Seeley
 Attachments: SOLR-6460.patch, SOLR-6460.patch


 Transaction logs are currently deleted relatively quickly... but we need to 
 keep them around much longer to be used as a source for cross-datacenter 
 recovery.  This will also be useful in the future for enabling peer-sync to 
 use more historical updates before falling back to replication.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6460) Keep transaction logs around longer

2014-09-24 Thread Renaud Delbru (JIRA)

[
https://issues.apache.org/jira/browse/SOLR-6460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14146174#comment-14146174
]

Renaud Delbru commented on SOLR-6460:
-

Hi,

here is an initial analysis and proposal of the modifications of the UpdateLog
for the CDCR scenario.
Most of the original workflow of the UpdateLog can be left untouched. It is
necessary however to keep the concept of maximum number of records to keep
(except for the cleaning of old transaction logs) in order to not interfere
with the normal workflow.

h4. Cleaning of Old Transaction Logs

The logic to remove old tlog files should be modified so that it relies on
pointers instead of a limit defined by the maximum number of records to keep.
The UpdateLog should be the one in charge of keeping the list of pointers and
of managing their life-cycle (or to deleguate it to the LogReader which is
presented next). Such a pointer, denoted LogPointer, should be composed of a
tlog file and of an associated file pointer.

h4. Log Reader

The UpdateLog must provide a log reader, denoted LogReader, that will be used
by the CDC Replicator to search, scan and read the update logs. The LogReader
will wrap a LogPointer and hide its management (e.g., instantiation, increment,
release).

The operations that must be provided by the LogReader are:
* Scan: move LogPointer to next entry
* Read: read a log entry specified by the LogPointer
* Lookup: lookup a version number - this will be performed during the
initialisation of the CDC Replicator / election of a new leader, therefore
rarely.

The LogReader must not only read olf tlog files, but also the new tlog file
(i.e., transaction log being written). This requires specific logic, since a
LogReader can be exhausted at a time t1 and have new entries available at a
time t2.

h4. Log Index

In order to support efficient lookup of version numbers across a large number
of tlog files, we need a pre-computed index of version numbers across tlog
files.
The index could be designed as a list of tlog files, associated with their
lower and upper bound in term of version numbers. The search will then read
this index to find quickly the tlog files containing a given version number,
then read the tlog file to find the associated entry.
However, a single tlog file can be large in certain scenarios. Therefore, we
could add another secondary index per tlog file. This index will contain a list
of version, pointer pairs. This will allow the LogReader to quickly find an
entry without having to scan the full tlog file. This index will be created and
managed by the TransactionLog.
This secondary index however duplicates the version number for each log entry.
A possible optimisation is to modify the format of the transaction log so that
the version number is not stored as part of the log entry.

h4. Transaction Log

The TransactionLog class is opening the tlog file in the constructor. This
could be problematic with a large numbers of tlog files, as it will exhaust the
file descriptors. One possible solution is to create a subclass for read only
mode that will not open the file in the constructor. Instead, the file will be
opened and closed on-demand by using the TransactionLog#LogReader.
The CDCR Update Logs will take care of converting old transaction log objects
into a read-only version.
This has however indirect consequences on the initialisation of the UpdateLog,
more precisely in the recovery phase (#recoverFromLog), as the UpdateLog might
write a commit (line 1418) at the end of an old tlog during replaying.

h4. Integration within the UpdateHandler

We will have to extend the UpdateHandler constructor in order to have the
possibility to switch the UpdateLog implementation based on some configuration
keys in the solrconfig.xml file.

Keep transaction logs around longer
---

Key: SOLR-6460
URL: https://issues.apache.org/jira/browse/SOLR-6460
Project: Solr
Issue Type: Sub-task
Reporter: Yonik Seeley

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-6460) Keep transaction logs around longer

2014-09-24 Thread Renaud Delbru (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-6460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Renaud Delbru updated SOLR-6460:

Attachment: SOLR-6460.patch

Here is a first patch with an initial implementation of the CdcrUpdateLog which 
includes:
* a cleaning of the old logs based on log pointers
* a log reader that reads both the old and new tlog files.
Many nocommit or todos, but this might provide enough materials for discussion.

 Keep transaction logs around longer
 ---

 Key: SOLR-6460
 URL: https://issues.apache.org/jira/browse/SOLR-6460
 Project: Solr
  Issue Type: Sub-task
Reporter: Yonik Seeley
 Attachments: SOLR-6460.patch


 Transaction logs are currently deleted relatively quickly... but we need to 
 keep them around much longer to be used as a source for cross-datacenter 
 recovery.  This will also be useful in the future for enabling peer-sync to 
 use more historical updates before falling back to replication.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6463) track update progress

2014-09-18 Thread Renaud Delbru (JIRA)

[
https://issues.apache.org/jira/browse/SOLR-6463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139076#comment-14139076
]

Renaud Delbru commented on SOLR-6463:
-

Hi Yonik, All,

Here is a proposal for tracking update progress in CDCR. In this scenario, we
are assuming at the moment the Active-Passive scenario, where there is one
source cluster forwarding updates to one or more target clusters. Looking
forward to read your feedbacks on this proposal.

h4. Updates Tracking Pushing

CDCR replicates data updates from the source to the target Data Center by
leveraging the Updates Log. A background thread regularly checks the Updates
Log for new entries, and then forwards them to the target Data Center. The
thread therefore needs to keep a checkpoint in the form of a pointer to the
last update successfully processed in the Updates Log. Upon acknowledgement
from the target Data Center that updates have been successfully processed, the
Updates Log pointer is updated to reflect the current checkpoint.

This pointer must be synchronized across all the replicas. In the case where
the leader goes down and a new leader is elected, the new leader will be able
to resume replication to the last update by using this synchronized pointer.
The strategy to synchronize such a pointer across replicas will be explained
next.

If for some reason, the target Data Center is offline or fails to process the
updates, the thread will periodically try to contact the target Data Center and
push the updates.

h4. Synchronization of Update Checkpoints

A reliable synchronization of the update checkpoints between the shard leader
and shard replicas is critical to avoid introducing inconsistency between the
source and target Data Centers. Another important requirement is that the
synchronization must be performed with minimal network traffic to maximize
scalability.

In order to achieve this, the strategy is to:
* Uniquely identify each update operation. This unique identifier will serve as
pointer.
* Rely on two storages: an ephemeral storage on the source shard leader, and a
persistent storage on the target cluster.

The shard leader in the source cluster will be in charge of generating a unique
identifier for each update operation, and will keep a copy of the identifier of
the last processed updates in memory. The identifier will be sent to the target
cluster as part of the update request. On the target Data Center side, the
shard leader will receive the update request, store it along with the unique
identifier in the Updates Log, and replicate it to the other shards.

SolrCloud is already providing a unique identifier for each update operation,
i.e., a “version” number. This version number is generated using a time-based
lamport clock which is incremented for each update operation sent. This
provides an “happened-before” ordering of the update operations that will be
leveraged in (1) the initialisation of the update checkpoint on the source
cluster, and in (2) the maintenance strategy of the Updates Log.

The persistent storage on the target cluster is used only during the election
of a new shard leader on the source cluster. If a shard leader goes down on the
source cluster and a new leader is elected, the new leader will contact the
target cluster to retrieve the last update checkpoint and instantiate its
ephemeral pointer. On such a request, the target cluster will retrieve the
latest identifier received across all the shards, and send it back to the
source cluster. To retrieve the latest identifier, every shard leader will look
up the identifier of the first entry in its Update Logs and sent it back to a
coordinator. The coordinator will have to select the highest among them.

This strategy does not require any additional network traffic and ensures
reliable pointer synchronization. Consistency is principally achieved by
leveraging SolrCloud. The update workflow of SolrCloud ensures that every
update is applied to the leader but also to any of the replicas. If the leader
goes down, a new leader is elected. During the leader election, a
synchronization is performed between the new leader and the other replicas. As
a result, this ensures that the new leader has a consistent Update Logs with
the previous leader. Having a consistent Updates Log means that:
* On the source cluster, the update checkpoint can be reused by the new leader.
* On the target cluster, the update checkpoint will be consistent between the
previous and new leader. This ensures the correctness of the update checkpoint
sent by a newly elected leader on the target cluster.

h6. Impact of Solr’s Update Reordering

The Updates Log can differ between the leader and the replicas, but not in an
inconsistent way. During leader to replica synchronisation, Solr’s Distributed
Update Processor will take care of reordering the update

[jira] [Created] (LUCENE-4919) IntsRef, BytesRef and CharsRef returns incorrect hashcode when filled with 0

Renaud Delbru created LUCENE-4919:
-

 Summary: IntsRef, BytesRef and CharsRef returns incorrect hashcode 
when filled with 0
 Key: LUCENE-4919
 URL: https://issues.apache.org/jira/browse/LUCENE-4919
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/other
Affects Versions: 4.2
Reporter: Renaud Delbru
 Fix For: 4.3


IntsRef, BytesRef and CharsRef implementation does not follow the java 
Arrays.hashCode implementation, and returns incorrect hashcode when filled with 
0. 
For example, an IntsRef with { 0 } will return the same hashcode than an 
IntsRef with { 0, 0 }.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (LUCENE-4919) IntsRef, BytesRef and CharsRef returns incorrect hashcode when filled with 0


 [ 
https://issues.apache.org/jira/browse/LUCENE-4919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Renaud Delbru updated LUCENE-4919:
--

Description: 
IntsRef, BytesRef and CharsRef implementation does not follow the java 
Arrays.hashCode implementation, and returns incorrect hashcode when filled with 
0. 
For example, an IntsRef with \{ 0 \} will return the same hashcode than an 
IntsRef with \{ 0, 0 \}.

  was:
IntsRef, BytesRef and CharsRef implementation does not follow the java 
Arrays.hashCode implementation, and returns incorrect hashcode when filled with 
0. 
For example, an IntsRef with { 0 } will return the same hashcode than an 
IntsRef with { 0, 0 }.


 IntsRef, BytesRef and CharsRef returns incorrect hashcode when filled with 0
 

 Key: LUCENE-4919
 URL: https://issues.apache.org/jira/browse/LUCENE-4919
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/other
Affects Versions: 4.2
Reporter: Renaud Delbru
 Fix For: 4.3


 IntsRef, BytesRef and CharsRef implementation does not follow the java 
 Arrays.hashCode implementation, and returns incorrect hashcode when filled 
 with 0. 
 For example, an IntsRef with \{ 0 \} will return the same hashcode than an 
 IntsRef with \{ 0, 0 \}.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (LUCENE-4919) IntsRef, BytesRef and CharsRef returns incorrect hashcode when filled with 0


 [ 
https://issues.apache.org/jira/browse/LUCENE-4919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Renaud Delbru updated LUCENE-4919:
--

Attachment: LUCENE-4919.patch

Here is a patch for IntsRef, BytesRef and CharsRef including unit tests. The 
new hashcode implementation is identical to the one found in Arrays.hashCode.

 IntsRef, BytesRef and CharsRef returns incorrect hashcode when filled with 0
 

 Key: LUCENE-4919
 URL: https://issues.apache.org/jira/browse/LUCENE-4919
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/other
Affects Versions: 4.2
Reporter: Renaud Delbru
 Fix For: 4.3

 Attachments: LUCENE-4919.patch


 IntsRef, BytesRef and CharsRef implementation does not follow the java 
 Arrays.hashCode implementation, and returns incorrect hashcode when filled 
 with 0. 
 For example, an IntsRef with \{ 0 \} will return the same hashcode than an 
 IntsRef with \{ 0, 0 \}.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (LUCENE-4919) IntsRef, BytesRef and CharsRef return incorrect hashcode when filled with 0


 [ 
https://issues.apache.org/jira/browse/LUCENE-4919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Renaud Delbru updated LUCENE-4919:
--

Description: 
IntsRef, BytesRef and CharsRef implementation do not follow the java 
Arrays.hashCode implementation, and return incorrect hashcode when filled with 
0. 
For example, an IntsRef with \{ 0 \} will return the same hashcode than an 
IntsRef with \{ 0, 0 \}.

  was:
IntsRef, BytesRef and CharsRef implementation does not follow the java 
Arrays.hashCode implementation, and returns incorrect hashcode when filled with 
0. 
For example, an IntsRef with \{ 0 \} will return the same hashcode than an 
IntsRef with \{ 0, 0 \}.

Summary: IntsRef, BytesRef and CharsRef return incorrect hashcode when 
filled with 0  (was: IntsRef, BytesRef and CharsRef returns incorrect hashcode 
when filled with 0)

 IntsRef, BytesRef and CharsRef return incorrect hashcode when filled with 0
 ---

 Key: LUCENE-4919
 URL: https://issues.apache.org/jira/browse/LUCENE-4919
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/other
Affects Versions: 4.2
Reporter: Renaud Delbru
 Fix For: 4.3

 Attachments: LUCENE-4919.patch


 IntsRef, BytesRef and CharsRef implementation do not follow the java 
 Arrays.hashCode implementation, and return incorrect hashcode when filled 
 with 0. 
 For example, an IntsRef with \{ 0 \} will return the same hashcode than an 
 IntsRef with \{ 0, 0 \}.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4919) IntsRef, BytesRef and CharsRef return incorrect hashcode when filled with 0


[ 
https://issues.apache.org/jira/browse/LUCENE-4919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13626454#comment-13626454
 ] 

Renaud Delbru commented on LUCENE-4919:
---

Hi Robert,

From my understanding, this applies only for BytesRef (even if this behavior 
sounds dangerous to me). However, why is IntsRef and CharsRef following the 
same behavior ?

 IntsRef, BytesRef and CharsRef return incorrect hashcode when filled with 0
 ---

 Key: LUCENE-4919
 URL: https://issues.apache.org/jira/browse/LUCENE-4919
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/other
Affects Versions: 4.2
Reporter: Renaud Delbru
 Fix For: 4.3

 Attachments: LUCENE-4919.patch


 IntsRef, BytesRef and CharsRef implementation do not follow the java 
 Arrays.hashCode implementation, and return incorrect hashcode when filled 
 with 0. 
 For example, an IntsRef with \{ 0 \} will return the same hashcode than an 
 IntsRef with \{ 0, 0 \}.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4919) IntsRef, BytesRef and CharsRef return incorrect hashcode when filled with 0


[ 
https://issues.apache.org/jira/browse/LUCENE-4919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13626458#comment-13626458
 ] 

Renaud Delbru commented on LUCENE-4919:
---

I see that BytesRef is used a bit everywhere in various contexts, contexts 
which are different from the TermsHash context. This hashcode behavior might 
cause unexpected problems, as I am sure most of the users of BytesRef are 
unaware of this particular behavior.

 IntsRef, BytesRef and CharsRef return incorrect hashcode when filled with 0
 ---

 Key: LUCENE-4919
 URL: https://issues.apache.org/jira/browse/LUCENE-4919
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/other
Affects Versions: 4.2
Reporter: Renaud Delbru
 Fix For: 4.3

 Attachments: LUCENE-4919.patch


 IntsRef, BytesRef and CharsRef implementation do not follow the java 
 Arrays.hashCode implementation, and return incorrect hashcode when filled 
 with 0. 
 For example, an IntsRef with \{ 0 \} will return the same hashcode than an 
 IntsRef with \{ 0, 0 \}.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4919) IntsRef, BytesRef and CharsRef return incorrect hashcode when filled with 0


[ 
https://issues.apache.org/jira/browse/LUCENE-4919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13626471#comment-13626471
 ] 

Renaud Delbru commented on LUCENE-4919:
---

Ok, I understand Robert. That sounds like a big task. I can try to make a first 
pass over it in the next days if you think it is worth it (personally I would 
feel more reassured knowing that the hashcode follows a more common behavior).

 IntsRef, BytesRef and CharsRef return incorrect hashcode when filled with 0
 ---

 Key: LUCENE-4919
 URL: https://issues.apache.org/jira/browse/LUCENE-4919
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/other
Affects Versions: 4.2
Reporter: Renaud Delbru
 Fix For: 4.3

 Attachments: LUCENE-4919.patch


 IntsRef, BytesRef and CharsRef implementation do not follow the java 
 Arrays.hashCode implementation, and return incorrect hashcode when filled 
 with 0. 
 For example, an IntsRef with \{ 0 \} will return the same hashcode than an 
 IntsRef with \{ 0, 0 \}.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4919) IntsRef, BytesRef and CharsRef return incorrect hashcode when filled with 0


[ 
https://issues.apache.org/jira/browse/LUCENE-4919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13626477#comment-13626477
 ] 

Renaud Delbru commented on LUCENE-4919:
---

@Simon: I discovered the issue when using IntsRef. during query processing, I 
am streaming array of integers using IntsRef. I was relying on the hashCode to 
compute a unique identifier for the content of a particular IntsRef until I 
started to see unexpected results in my unit tests. Then I saw that the same 
behaviour is found in the other *Ref classes. 
I could live without it and bypass the problem by changing my implementation 
(and computing myself my own hash code). But I thought this behaviour is not 
very clear for the user, and could be potentially dangerous, and therefore good 
to share it with you.

 IntsRef, BytesRef and CharsRef return incorrect hashcode when filled with 0
 ---

 Key: LUCENE-4919
 URL: https://issues.apache.org/jira/browse/LUCENE-4919
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/other
Affects Versions: 4.2
Reporter: Renaud Delbru
 Fix For: 4.3

 Attachments: LUCENE-4919.patch


 IntsRef, BytesRef and CharsRef implementation do not follow the java 
 Arrays.hashCode implementation, and return incorrect hashcode when filled 
 with 0. 
 For example, an IntsRef with \{ 0 \} will return the same hashcode than an 
 IntsRef with \{ 0, 0 \}.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4919) IntsRef, BytesRef and CharsRef return incorrect hashcode when filled with 0


[ 
https://issues.apache.org/jira/browse/LUCENE-4919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13626480#comment-13626480
 ] 

Renaud Delbru commented on LUCENE-4919:
---

Maybe a simpler solution would be to clearly state this behavior in all the 
methods javadoc.

 IntsRef, BytesRef and CharsRef return incorrect hashcode when filled with 0
 ---

 Key: LUCENE-4919
 URL: https://issues.apache.org/jira/browse/LUCENE-4919
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/other
Affects Versions: 4.2
Reporter: Renaud Delbru
 Fix For: 4.3

 Attachments: LUCENE-4919.patch


 IntsRef, BytesRef and CharsRef implementation do not follow the java 
 Arrays.hashCode implementation, and return incorrect hashcode when filled 
 with 0. 
 For example, an IntsRef with \{ 0 \} will return the same hashcode than an 
 IntsRef with \{ 0, 0 \}.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4919) IntsRef, BytesRef and CharsRef return incorrect hashcode when filled with 0


[ 
https://issues.apache.org/jira/browse/LUCENE-4919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13626486#comment-13626486
 ] 

Renaud Delbru commented on LUCENE-4919:
---

I agree with you Dawid, but this particular behaviour increases the chance of 
getting the same hash for a certain type of inputs. Anyway, I think the general 
decision is to not change their hashCode behvaiour ;o), I am fine with it. Feel 
free to close the issue.
Thanks, and sorry for the distraction.

 IntsRef, BytesRef and CharsRef return incorrect hashcode when filled with 0
 ---

 Key: LUCENE-4919
 URL: https://issues.apache.org/jira/browse/LUCENE-4919
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/other
Affects Versions: 4.2
Reporter: Renaud Delbru
 Fix For: 4.3

 Attachments: LUCENE-4919.patch


 IntsRef, BytesRef and CharsRef implementation do not follow the java 
 Arrays.hashCode implementation, and return incorrect hashcode when filled 
 with 0. 
 For example, an IntsRef with \{ 0 \} will return the same hashcode than an 
 IntsRef with \{ 0, 0 \}.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Closed] (LUCENE-4919) IntsRef, BytesRef and CharsRef return incorrect hashcode when filled with 0