[ 
https://issues.apache.org/jira/browse/SOLR-13060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16722216#comment-16722216
 ] 

Steve Rowe commented on SOLR-13060:
-----------------------------------

bq. Just FYI: the upgrade of randomizedtesting does fix the suite timeout 
problem (I just tested it on by running SOLR-13074 with a suite timeout of 10 
seconds...).

Awesome, thanks!

bq. I think one hour is very generous for the sysout loop in SOLR-13074, so 
it'll be enough to fill the disk anyway.

All the examples I've seen have HEARTBEAT messages runing for 40k-50k seconds, 
an order of magnitude higher, which is why I set it to an hour.

bq. I'll work on truncating sysouts up to at most 1 gig, test it on that 
SOLR-13074, then maybe to fix the underlying cause of leaking threads.

Great!

bq. Until this is solved, I don't think it makes sense to run hdfs tests at all 
– they will hang and fill up disk space on jenkins.

+1, I'll apply the {{@AwaitsFix}} annotation.

> Some Nightly HDFS tests never terminate on ASF Jenkins, triggering whole-job 
> timeout, causing Jenkins to kill JVMs, causing dump files to be created that 
> fill all disk space, causing failure of all following jobs on the same node
> -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-13060
>                 URL: https://issues.apache.org/jira/browse/SOLR-13060
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: Tests
>            Reporter: Steve Rowe
>            Priority: Major
>         Attachments: 
> junit4-J0-20181210_065854_4175881849742830327151.spill.part1.gz
>
>
> The 3 tests that are affected: 
> * HdfsAutoAddReplicasIntegrationTest
> * HdfsCollectionsAPIDistributedZkTest
> * MoveReplicaHDFSTest 
> Instances from the dev list:
> 12/1: 
> https://lists.apache.org/thread.html/e04ad0f9113e15f77393ccc26e3505e3090783b1d61bd1c7ff03d33e@%3Cdev.lucene.apache.org%3E
> 12/5: 
> https://lists.apache.org/thread.html/d78c99255abfb5134803c2b77664c1a039d741f92d6e6fcbcc66cd14@%3Cdev.lucene.apache.org%3E
> 12/8: 
> https://lists.apache.org/thread.html/92ad03795ae60b1e94859d49c07740ca303f997ae2532e6f079acfb4@%3Cdev.lucene.apache.org%3E
> 12/8: 
> https://lists.apache.org/thread.html/26aace512bce0b51c4157e67ac3120f93a99905b40040bee26472097@%3Cdev.lucene.apache.org%3E
> 12/11: 
> https://lists.apache.org/thread.html/33558a8dd292fd966a7f476bf345b66905d99f7eb9779a4d17b7ec97@%3Cdev.lucene.apache.org%3E



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to