[ 
https://issues.apache.org/jira/browse/BIGTOP-1192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13887530#comment-13887530
 ] 

Mikhail Antonov edited comment on BIGTOP-1192 at 1/31/14 6:53 AM:
------------------------------------------------------------------

Hm, weird. Definitely shouldn't require particular order of execution, and I've 
never seen this kind of error on my machine, running tests for this class or 
whole bigtop-test-framework.

are you just running it like {{{ mvn -Dtest=ClusterFailuresTest test }}} ?

Oh, it's using timeouts, may be that's the reason..But 3 seconds "should be 
enough for everyone"


was (Author: mantonov):
Hm, weird. Definitely shouldn't require particular order of execution, and I've 
never seen this kind of error on my machine, running tests for this class or 
whole bigtop-test-framework.

are you just running it like {{{mvn -Dtest=ClusterFailuresTest test}}} ?

Oh, it's using timeouts, may be that's the reason..But 3 seconds "should be 
enough for everyone"

> Add utilities to facilitate cluster failure testing into bigtop-test-framework
> ------------------------------------------------------------------------------
>
>                 Key: BIGTOP-1192
>                 URL: https://issues.apache.org/jira/browse/BIGTOP-1192
>             Project: Bigtop
>          Issue Type: New Feature
>          Components: Tests
>    Affects Versions: 0.7.0
>            Reporter: Mikhail Antonov
>            Assignee: Mikhail Antonov
>              Labels: itest, smokes
>             Fix For: 0.8.0
>
>         Attachments: BIGTOP-1192.1.patch, BIGTOP-1192.2.patch, 
> BIGTOP-1192.3.patch, BIGTOP-1192.4.patch
>
>
> The goal is to provide Bigtop module maintainers with a set of set of util 
> classes to help develop smoke tests able to simulate certain failures during 
> smoke tests execution on a cluster.
> Summary of what is provided in current patch. 
> Following failure types are supported now:
>  - Service stopped and restarted (on given set of nodes)
>  - Service killed with 'kill -9' and started back up (on given set of nodes)
>  - Node inbound/outbound connections are shut down and brought back up (via 
> iptables).
>  
> System requirements to run smoke tests with failures.
>  *  password-less (PKI-based) root ssh to all nodes in cluster being tested 
> is assumed.
>  *  for local tests, like ClusterFailuresTest, one should have password-less 
> root ssh to localhost.
>  *  env variable BIGTOP_SMOKES_CLUSTER_IDENTITY_FILE should point to 
> according private key file.
> Further thoughts (not included in this patch)
>   Cluster provisioning
>    - Bigtop test framework (failures part of it) doesn't need to know about 
> cluster topology, as it simply executes set of SSH commands on remote hosts 
> (whose addresses are provided by specific
>    module smoke test developer). But the actual tests do need to know about 
> cluster topology to run sophisticated failure scenarios.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to