[
https://issues.apache.org/jira/browse/BIGTOP-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14209826#comment-14209826
]
jay vyas commented on BIGTOP-1521:
----------------------------------
*i agree* that timeouts can prevent stalls, and are a simple and easy way to
solve this.
*but* they are indirect... so lets make sure to add a good error message to
timeout failures though, so its clear, if a test times out, what the most
likely cause is.
> Bigtop smoke-tests hierarchy and fast failure
> ---------------------------------------------
>
> Key: BIGTOP-1521
> URL: https://issues.apache.org/jira/browse/BIGTOP-1521
> Project: Bigtop
> Issue Type: Bug
> Components: tests
> Affects Versions: 0.8.0
> Reporter: jay vyas
>
> *Problem* Sometimes YARN jobs can hang indefinetly, and in the case of the
> {{smoke-tests}} , we also can get an infinite hang it appears.
> This can be reproduced by simply messing up/deleting the core hadoop
> components from {{bigtop-deploy/vm/vagrant-puppet}}'s provision script puppet
> conf file {{provision.sh}} and running {{vagrant up}}.
> *Solution* Let add some smarts to the smoke tester - such that the basic yarn
> services (i. think hadoop-smoke in test-artifcacts does this maybe ) are
> confirmed before any yarn based tests are ran.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)