[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14165437#comment-14165437 ] ASF subversion and git services commented on SOLR-6157: --- Commit 1630542 from [~thelabdude] in branch 'dev/branches/branch_5x' [ https://svn.apache.org/r1630542 ] SOLR-6157: re-enable this test on branch_5x as it seems to be passing consistently now on trunk ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler Assignee: Timothy Potter Fix For: 4.10 See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14163612#comment-14163612 ] ASF subversion and git services commented on SOLR-6157: --- Commit 1630140 from [~thelabdude] in branch 'dev/trunk' [ https://svn.apache.org/r1630140 ] SOLR-6157: re-enable this test to see if it runs consistently on Jenkins (beast passed 20/20) ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler Assignee: Timothy Potter Fix For: 4.10 See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14050321#comment-14050321 ] ASF subversion and git services commented on SOLR-6157: --- Commit 1607420 from [~thelabdude] in branch 'dev/trunk' [ https://svn.apache.org/r1607420 ] SOLR-6157: Disable this test for now as it is still hanging on Jenkins. ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler Assignee: Timothy Potter Fix For: 4.10 See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14048956#comment-14048956 ] ASF subversion and git services commented on SOLR-6157: --- Commit 1607110 from [~thelabdude] in branch 'dev/trunk' [ https://svn.apache.org/r1607110 ] SOLR-6157: Refactor the ensureAllReplicasAreActive method into base class and ensure the ClusterState is updated to address intermittent test failures on Jenkins. ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler Assignee: Timothy Potter Fix For: 4.10 See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14041925#comment-14041925 ] Dawid Weiss commented on SOLR-6157: --- The hangs on Jenkins (FreeBSD) may be related to LUCENE-5786. I've updated RR and hopefully fixed this; let's see if it hangs again. ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler Assignee: Timothy Potter Fix For: 4.10 See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14040925#comment-14040925 ] Shalin Shekhar Mangar commented on SOLR-6157: - This might be related to LUCENE-5786 ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler Assignee: Timothy Potter Fix For: 4.10 See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14040926#comment-14040926 ] ASF subversion and git services commented on SOLR-6157: --- Commit 1604868 from [~thelabdude] in branch 'dev/trunk' [ https://svn.apache.org/r1604868 ] SOLR-6157: Hang occurred again, somewhere in tearDown ... changing the code to close the socket proxies after super.tearDown, if that doesn't work, I'll add the AwaitsFix back to the code to ignore this test. ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler Assignee: Timothy Potter Fix For: 4.10 See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14039104#comment-14039104 ] ASF subversion and git services commented on SOLR-6157: --- Commit 1604223 from [~thelabdude] in branch 'dev/branches/branch_4x' [ https://svn.apache.org/r1604223 ] SOLR-6157: Fix hanging unit test. ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler Assignee: Timothy Potter See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14035713#comment-14035713 ] Timothy Potter commented on SOLR-6157: -- It's been running on trunk for a couple of days now without a hang or failure. I think it may be resolved :-) Will backport to branch_4x and 4.9 release branch tomorrow (Thursday) if no hangs / failures today. ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler Assignee: Timothy Potter See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14032620#comment-14032620 ] ASF subversion and git services commented on SOLR-6157: --- Commit 1602924 from [~thelabdude] in branch 'dev/trunk' [ https://svn.apache.org/r1602924 ] SOLR-6157: Added some logging and re-opened the socket proxy to try to figure out why this test is hanging; reenabling temporarily to see if these changes help diagnose the cause of the hang. ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler Assignee: Timothy Potter See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14032631#comment-14032631 ] Timothy Potter commented on SOLR-6157: -- I'll keep an eye on this and if it hangs again, I'll re-disable it with the @AwaitsFix annotation. I should mention that this test does not do very much work at all (not doing too many iterations) and only indexes a few docs (tiny indexes). The only thing exotic it does is to use the SocketProxy class to introduce network partitions between the leader and replicas. I did fix one place where the proxy was closed but never re-opened so that might cause a hang during shutdown. ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler Assignee: Timothy Potter See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14027833#comment-14027833 ] Uwe Schindler commented on SOLR-6157: - Hi Dawid, I disabled the test with @AwaitsFix, so it will no longer hang on Jenkins. Maybe I try to reproduce this locally, passing -Dtests.slow=true (the problem with this test is: its also marked as @Slow, so the normal user would never run it. And nightly it hangs a lot of times). Maybe the test just does too many iterations or has too large indexes (I am not sure why it takes so long!). We may tone it down. ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler Assignee: Timothy Potter See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14026643#comment-14026643 ] Uwe Schindler commented on SOLR-6157: - Next one is also hanging: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10518/console ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14026650#comment-14026650 ] ASF subversion and git services commented on SOLR-6157: --- Commit 1601679 from [~thetaphi] in branch 'dev/branches/branch_4x' [ https://svn.apache.org/r1601679 ] SOLR-6157: Disable test that hangs indefinitely ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14026648#comment-14026648 ] ASF subversion and git services commented on SOLR-6157: --- Commit 1601678 from [~thetaphi] in branch 'dev/trunk' [ https://svn.apache.org/r1601678 ] SOLR-6157: Disable test that hangs indefinitely ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14026690#comment-14026690 ] Timothy Potter commented on SOLR-6157: -- Nothing special about this test so not sure why it would hang ... seems like a problem in the test framework itself. ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler Assignee: Timothy Potter See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14026969#comment-14026969 ] Dawid Weiss commented on SOLR-6157: --- The test framework has been pretty well tested and seems to be working fine. The timeout is set to an incredibly large value because Solr tests take so long. If you let it run until the timeout expires, you will get a stack trace of where each thread was. Uwe, could you send a signal to the hung process next time you see one? Then JVM logs will contain it and I can recover relevant stack traces. ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler Assignee: Timothy Potter See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-6157) ReplicationFactorTest hangs
[ https://issues.apache.org/jira/browse/SOLR-6157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14026971#comment-14026971 ] Dawid Weiss commented on SOLR-6157: --- {code} @TimeoutSuite(millis = 2 * TimeUnits.HOUR) {code} So those tests went beyond the timeout...? Looks like JVM problems with halt(), regardless of what actually caused the stall. Uwe, if you see it next time, try to capture the stack trace (see if the JVM is responding to it at all). ReplicationFactorTest hangs --- Key: SOLR-6157 URL: https://issues.apache.org/jira/browse/SOLR-6157 Project: Solr Issue Type: Bug Components: replication (java) Reporter: Uwe Schindler Assignee: Timothy Potter See: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10517/ You can download all logs from there. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org