Hi Michael,

I believe the problem can be not related to RAM space, since today I run into 
the same issue on 200GB RAM machine.

I will try to run intensive/non-intensive tests separately.

Thank you,
Sergey


From: Michael Shuler
Sent: Thursday, October 19, 2017 3:58 PM
To: dev@cassandra.apache.org
Subject: Re: Cassandra DTests deadlocks on random test

7.6G RAM may be a little bit too small, we've seen similar random hangs
in the past on non-resource-intensive tests on m3.large. It doesn't
appear you are skipping resource-intensive tests. Our standard dtest
instance type has been an m3.xlarge, and the resource-intensive tests
are run m3.2xlarge. (or something comparable on RAM & SSD)

dtest run command (excludes resource-intensive tests):
https://github.com/apache/cassandra-builds/blob/master/build-scripts/cassandra-dtest.sh#L53

dtest-large run command (only resource-intensive tests):
https://github.com/apache/cassandra-builds/blob/master/build-scripts/cassandra-dtest.sh#L61

Running dtest without excluding resource-intensive will run everything.

If you wish to troubleshoot your tests when they hang, there should be
/tmp/dtest-XXXXX directories with the ccm cluster left on disk from the
hung test, since they never get to cleanup stage.

-- 
Michael

On 10/19/2017 05:29 AM, Sergey La wrote:
> Hi!
> 
> I have created the patch for the Cassandra version 3.0.14 and trying to
> test it using the cassandra dtests.
> 
> Problem is - dtests deadlocks at some random tests, time and again - on
> unpatched  3.0.14 version of Cassandra.
> 
> What I have done.
> 
> I cloned the cassandra repository (origin
> http://git-wip-us.apache.org/repos/asf/cassandra.git), and checked out to
> tags/cassandra-3.0.14 - head is on f3e38cb638113c2a23855a104d6082da5bc10ddb.
> 
> Then I have cloned the cassandra-dtest repo (origin  git://
> github.com/riptano/cassandra-dtest.git). Head is on
> 6843d76d0a85ad82edf889e8280b87786dc48486.
> 
> I setup dtests according to this instructions:
> https://github.com/riptano/cassandra-dtest/blob/master/INSTALL.md
> 
> In addition, I have setup JAVA8_HOME and JAVA_HOME variables to public jre
> of my 1.8.0_144 jdk.
> 
> I start testing using this command:
> JAVA8_HOME=$JAVA8_HOME nosetests --with-flaky  --with-xunit
> --xunit-file=out.xunit.xml  --force-flaky --max-runs=3 --verbose
> --debug-log=err.debug.nose.txt  1> out.txt 2> err.txt
> I run the tests on x86_64 CentOS 7 with 7.6G of RAM.
> 
> Problem symptoms:
> During the "normal" run (I have got only 1 "normal" run in 5 attempts),
> err.txt is updated constantly with name of the test recently completed, and
> in the end out.xunit.xml file appears, with test summary results. nosetests
> process exits.
> 
> During the "problem" run tests stop progressing (err.txt was not modified
> for 10 hours), out.xunit.xml is not appearing, nosetests process runs. I
> killed java processes, but nothing changed for 2 hours - nosetests process
> still runs, but files are unchanged.
> 
> Any help would be appreciated,
> Sergey
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
For additional commands, e-mail: dev-h...@cassandra.apache.org


Reply via email to