[
https://issues.apache.org/jira/browse/SOLR-5628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13869826#comment-13869826
]
Hoss Man commented on SOLR-5628:
--------------------------------
One thing that may be making this test special is that at the end of the test,
docs have been added that are not committed -- which may be why more cloud
tests aren't exhibiting similar symptoms more often.
I attempted to clone this test and then prune it down to try and distill out
the bare bones of what was wrong, but ran into an interesting discovery: the
exact same test class, with only the name changed, doesn't fail using the same
seed.
(note: i've verified the following steps on branch_4x both using r1556643 and
r1557806)
{noformat}
hossman@frisbee:~/lucene/4x_dev$ cat
solr/core/src/test/org/apache/solr/cloud/TestDistribDocBasedVersion.java | perl
-pe 's/TestDistribDocBasedVersion/TestUncommitedDocsInCloudTest/s' >
solr/core/src/test/org/apache/solr/cloud/TestUncommitedDocsInCloudTest.java
hossman@frisbee:~/lucene/4x_dev$ diff
solr/core/src/test/org/apache/solr/cloud/TestDistribDocBasedVersion.java
solr/core/src/test/org/apache/solr/cloud/TestUncommitedDocsInCloudTest.java
37c37
< public class TestDistribDocBasedVersion extends AbstractFullDistribZkTestBase
{
---
> public class TestUncommitedDocsInCloudTest extends
> AbstractFullDistribZkTestBase {
55c55
< public TestDistribDocBasedVersion() {
---
> public TestUncommitedDocsInCloudTest() {
hossman@frisbee:~/lucene/4x_dev$ ant clean && cd solr/core && ant test
-Dtestcase=TestUncommitedDocsInCloudTest -Dtests.seed=791402573DC76F3C
-Dtests.multiplier=3 -Dtests.slow=true -Dtests.locale=ar_IQ
-Dtests.timezone=Antarctica/Rothera -Dtests.file.encoding=US-ASCII
...
[junit4] JVM J0: 0.41 .. 32.98 = 32.56s
[junit4] Execution time total: 33 seconds
[junit4] Tests summary: 1 suite, 1 test
[echo] 5 slowest tests:
[junit4:tophints] 231.44s | org.apache.solr.util.SimplePostToolTest
[junit4:tophints] 203.23s | org.apache.solr.cloud.BasicDistributedZkTest
[junit4:tophints] 126.58s | org.apache.solr.cloud.TriLevelCompositeIdRoutingTest
[junit4:tophints] 111.38s |
org.apache.solr.cloud.CollectionsAPIDistributedZkTest
[junit4:tophints] 106.55s | org.apache.solr.cloud.BasicDistributedZk2Test
{noformat}
> Cloud test harness manifesting reproducible failures in
> TestDistribDocBasedVersion
> ----------------------------------------------------------------------------------
>
> Key: SOLR-5628
> URL: https://issues.apache.org/jira/browse/SOLR-5628
> Project: Solr
> Issue Type: Bug
> Reporter: Hoss Man
>
> Jenkins uncovered a test seed that causes a reproducible IndexWriter
> assertion failure in TestDistribDocBasedVersion on the 4x branch.
> McCandless helped dig in and believe that something in the way the solr test
> framework is setup is causing the test to delete the index dirs before the
> IndexWriter is being closed. Meanwhile, it appears that recent changes to 4x
> have caused the nature of the failure to change, so that now -- in addition
> to the IndexWriter assertion failure -- the test cleanup also stalls out and
> the test runner has to terminate some stalled threads.
> details to following in comment, but here's the reproduce line...
> {noformat}
> ant test -Dtestcase=TestDistribDocBasedVersion -Dtests.seed=791402573DC76F3C
> -Dtests.multiplier=3 -Dtests.slow=true
> -Dtests.locale=ar_IQ -Dtests.timezone=Antarctica/Rothera
> -Dtests.file.encoding=US-ASCII
> {noformat}
> And the mail thread regarding this...
> https://mail-archives.apache.org/mod_mbox/lucene-dev/201401.mbox/%3Calpine.DEB.2.02.1401100930260.20275@frisbee%3E
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]