[ https://issues.apache.org/jira/browse/SOLR-3180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Yonik Seeley updated SOLR-3180: ------------------------------- Attachment: test_report_1.txt Unfortunately, the chaos monkey tests have gathered some rust due to not being run regularly. I've added some instrumentation for one particular type of fail (number of docs mismatch between control and collection) and ran into something peculiar. selected lines from test_report_1.txt {code} 6952 T39 oasc.ZkController.register Register shard - core:c ollection1 address:http://127.0.0.1:60440/ij/pf shardId:shard1 9734 T56 oasc.ZkController.register Register shard - core:c ollection1 address:http://127.0.0.1:48048/ij/pf shardId:shard1 12184 T71 oasc.ZkController.register Register shard - core: collection1 address:http://127.0.0.1:33264/ij/pf shardId:shard2 59069 T27 C12 P60440 /update {wt=javabin&version=2} {add=[50067 (1421643212854919168)]} 0 3 89123 T48 C1 P48048 /update {wt=javabin&version=2} {add=[50067 (1421643244360433664)]} 0 10 89354 T25 C12 P60440 /update {wt=javabin&version=2} {delete=[50067 (-1421643244613140480)]} 0 1 89364 T47 C1 P48048 /update {wt=javabin&version=2} {delete=[50067 (-1421643244618383360)]} 0 6 90114 T63 C3 P33264 /update {wt=javabin&version=2} {add=[50067]} 0 31041 217860 T10 oasc.AbstractFullDistribZkTestBase.checkShardConsistency SEVERE document count mismatch. control=1460 sum(shards)=1461 cloudClient=1461 [junit4:junit4] 2> ###### Only in cloudDocList: [{id=50067}] {code} The only updates we see from the logs for id 50067 are to collection1 (none to the control shard). Further, grepping for "Register shard" shows that they are all for collection1 and none for the control collection. Not sure how that's possible and only be off by 1, so I'm probably mis-interpreting the logging somehow... > ChaosMonkey test failures > ------------------------- > > Key: SOLR-3180 > URL: https://issues.apache.org/jira/browse/SOLR-3180 > Project: Solr > Issue Type: Bug > Components: SolrCloud > Reporter: Yonik Seeley > Attachments: test_report_1.txt > > > Handle intermittent failures in the ChaosMonkey tests. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org