[ 
https://issues.apache.org/jira/browse/SOLR-3180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yonik Seeley updated SOLR-3180:
-------------------------------

    Attachment: test_report_1.txt

Unfortunately, the chaos monkey tests have gathered some rust due to not being 
run regularly.

I've added some instrumentation for one particular type of fail (number of docs 
mismatch between control and collection) and ran into something peculiar.

selected lines from test_report_1.txt
{code}
6952 T39 oasc.ZkController.register Register shard - core:c
ollection1 address:http://127.0.0.1:60440/ij/pf shardId:shard1

9734 T56 oasc.ZkController.register Register shard - core:c
ollection1 address:http://127.0.0.1:48048/ij/pf shardId:shard1

12184 T71 oasc.ZkController.register Register shard - core:
collection1 address:http://127.0.0.1:33264/ij/pf shardId:shard2

59069 T27 C12 P60440 /update {wt=javabin&version=2} {add=[50067 
(1421643212854919168)]} 0 3

89123 T48 C1 P48048 /update {wt=javabin&version=2} {add=[50067 
(1421643244360433664)]} 0 10

89354 T25 C12 P60440 /update {wt=javabin&version=2} {delete=[50067 
(-1421643244613140480)]} 0 1

89364 T47 C1 P48048 /update {wt=javabin&version=2} {delete=[50067 
(-1421643244618383360)]} 0 6

90114 T63 C3 P33264 /update {wt=javabin&version=2} {add=[50067]} 0 31041

217860 T10 oasc.AbstractFullDistribZkTestBase.checkShardConsistency SEVERE 
document count mismatch.  control=1460 sum(shards)=1461 cloudClient=1461
[junit4:junit4]   2> ###### Only in cloudDocList: [{id=50067}]
{code}

The only updates we see from the logs for id 50067 are to collection1 (none to 
the control shard).

Further, grepping for "Register shard" shows that they are all for collection1 
and none for the control collection.  Not sure how that's possible and only be 
off by 1, so I'm probably mis-interpreting the logging somehow...
                
> ChaosMonkey test failures
> -------------------------
>
>                 Key: SOLR-3180
>                 URL: https://issues.apache.org/jira/browse/SOLR-3180
>             Project: Solr
>          Issue Type: Bug
>          Components: SolrCloud
>            Reporter: Yonik Seeley
>         Attachments: test_report_1.txt
>
>
> Handle intermittent failures in the ChaosMonkey tests.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to