[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17754910#comment-17754910 ] Zoltan Haindrich commented on HIVE-26806: - made you an admin - didn't know you weren't one :D there are 2 ways to upgrade the plugin: * upgrade individually on the interface * upgrade by building a new htk-jenkins image (https://hub.docker.com/r/kgyrtkirk/htk-jenkins/tags) this second could upgrade everything from jenkins version to all plugins - since it wasn't been done for a while it might be helpfull to do that let me know if you need any help with that; I'm also on asf slack if you want to chat > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17754096#comment-17754096 ] Stamatis Zampetakis commented on HIVE-26806: Thanks for the pointer Zoltan. I don't have admin privileges on Jenkins to upgrade the version of the plugin (if necessary) but I will ping some people to see what can be done. > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17752828#comment-17752828 ] Zoltan Haindrich commented on HIVE-26806: - seems like there is a helpfull feature in the parallel-test-executor https://github.com/jenkinsci/parallel-test-executor-plugin/commit/c9145a5f849f01d6e99c2240eb51d9aaf283ef6a upgrade to >380 could make this go away > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17644252#comment-17644252 ] Stamatis Zampetakis commented on HIVE-26806: I plan to keep the the timeout to the same value (12h) for a while longer since I've seen a recent build (http://ci.hive.apache.org/job/hive-precommit/job/master/1519/) taking 5h30. Although the 5h are not spend executing tests, but also allocating executors, I don't want to risk destabilising CI again now. Let's monitor http://ci.hive.apache.org/job/hive-precommit/job/master/buildTimeTrend for the following days and take a decision next week. > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17643751#comment-17643751 ] Alessandro Solimando commented on HIVE-26806: - [~zabetak], I have deleted the green runs but the first time I re-run, the timeout occurred again. I haven't seen timeout from that run onward, so it has probably worked, but the new random split was unfortunate too by coincidence. So, resuming, deleting past green runs seem to work, no need to close and open PR again if not needed. Thanks! > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17643656#comment-17643656 ] Akshat Mathur commented on HIVE-26806: -- [~zabetak] Closing PR-3803 and opening a new one worked thanks. Run for new PR: http://ci.hive.apache.org/blue/organizations/jenkins/hive-precommit/detail/PR-3832/1/pipeline/ > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17643560#comment-17643560 ] Stamatis Zampetakis commented on HIVE-26806: [~asolimando] For documentation purposes can you elaborate what happened after deleting all successful builds? What was the problem that you observed? > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17643470#comment-17643470 ] Alessandro Solimando commented on HIVE-26806: - It looks that deleting all green past runs did not fix for [https://github.com/apache/hive/pull/3137]. That's a big deal since the PR is huge and review is in progress, I don't think I can close and re-open it. Is there a way to tweak timeout for that PR alone [~zabetak]? > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17643247#comment-17643247 ] Stamatis Zampetakis commented on HIVE-26806: *Important note:* The test timings from master *are not used* to split tests in PRs. The master branch and PR branches have separate Jenkins jobs so one does not use the other as a reference. The splitting of tests on the first run of a PR (or a PR without a previous successful build) is more or less random. > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17643222#comment-17643222 ] Alessandro Solimando commented on HIVE-26806: - Thanks [~zabetak], as you say the issue now affects only existing PRs, I am trying 2. to see if it works, otherwise I will go for 1., I will keep you guys posted here. Forgetting the old affected PRs, I am OK with reducing the timeout to the previous value, since it now works. > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17643213#comment-17643213 ] Stamatis Zampetakis commented on HIVE-26806: The recent builds on master (1513, 1514) are now back to normal and each split takes at most ~2h. [~asolimando] [~ayushtkn] I am planning to revert the timeout back to 6h by committing directly to master in a few hours. Please speak up if there is any reason not do to this. [~akshatm] The Jenkins plugin that is used to split the test into buckets uses the last successful build of the job as a guide. Each PR corresponds to a separate Jenkins Job (http://ci.hive.apache.org/job/hive-precommit/view/change-requests/). The last successful build for your PR is http://ci.hive.apache.org/job/hive-precommit/job/PR-3803/8/ so this is what will be used to split the tests. This is not good cause the successful run has 3K less tests than what exists in master so the splitting will be pretty bad. I see three ways to unblock the current situation and overcome the problem: # Close PR-3803 and open a new one. # Manually delete every successful build for JOB PR-3803 and start a new one. # Increase the timeout on the JenkinsFile and try again. None of these is perfect but I have higher hopes for 1 and 2. > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17642948#comment-17642948 ] Akshat Mathur commented on HIVE-26806: -- I tried re-running the tests with 22 splits even with 24 splits. The issue seems to persist. Here is the latest run for my PR: PR-3803 http://ci.hive.apache.org/blue/organizations/jenkins/hive-precommit/detail/PR-3803/15/pipeline > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17642809#comment-17642809 ] Ayush Saxena commented on HIVE-26806: - The build passed : [http://ci.hive.apache.org/blue/organizations/jenkins/hive-precommit/detail/master/1512/pipeline/661] Hopefully the time should reduce for subsequent builds. Have triggered another: [http://ci.hive.apache.org/job/hive-precommit/job/master/1513/] Ideally the splits are planned according to the previous results, so this should plan well, since last build was successful, lets see if it does so > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17642702#comment-17642702 ] Ayush Saxena commented on HIVE-26806: - I changed the build timeout in 1511, without the commit, just while replaying added the timeout. Which you aborted. :P > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17642695#comment-17642695 ] Stamatis Zampetakis commented on HIVE-26806: The last run http://ci.hive.apache.org/job/hive-precommit/job/master/1510/ also failed with timeout exceeded. I will double the timeout temporarily and commit again. > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17642580#comment-17642580 ] Stamatis Zampetakis commented on HIVE-26806: An attemt to fix the problem by increasing the SPLIT size from 20 to 22 was committed to master: https://github.com/apache/hive/commit/74729e0aa82951387dd5449862203a9a32282801. However the initial Jenkins run for the respective master commit (http://ci.hive.apache.org/job/hive-precommit/job/master/1508/parameters/) didn't seem to pick up the changes for this specific parameter. We stopped the run and manually re-launched http://ci.hive.apache.org/job/hive-precommit/job/master/1510/parameters/ and now we are waiting for the results. > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17642558#comment-17642558 ] Alessandro Solimando commented on HIVE-26806: - In case you have an existing open PR suffering form this and you don't want to rebase, if you have permission to run Jenkins' jobs you just change the default split value to 22 and re-run, HTH > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (HIVE-26806) Precommit tests in CI are timing out after HIVE-26796
[ https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17642552#comment-17642552 ] Stamatis Zampetakis commented on HIVE-26806: The fact that the tests are taking too much time to execute could be due to a bad allocation to buckets from [Jenkins Parallel Test Executor Plugin|https://www.jenkins.io/doc/pipeline/steps/parallel-test-executor/#splittests-split-test-runs]. The problem is probably a result of HIVE-26796 that re-enabled ~1K more tests that were not distributed evenly among splits. > Precommit tests in CI are timing out after HIVE-26796 > - > > Key: HIVE-26806 > URL: https://issues.apache.org/jira/browse/HIVE-26806 > Project: Hive > Issue Type: Bug > Components: Testing Infrastructure >Reporter: Stamatis Zampetakis >Assignee: Stamatis Zampetakis >Priority: Major > > http://ci.hive.apache.org/job/hive-precommit/job/master/1506/ > {noformat} > ancelling nested steps due to timeout > 15:22:08 Sending interrupt signal to process > 15:22:08 Killing processes > 15:22:09 kill finished with exit code 0 > 15:22:19 Terminated > 15:22:19 script returned exit code 143 > [Pipeline] } > [Pipeline] // withEnv > [Pipeline] } > 15:22:19 Deleting 1 temporary files > [Pipeline] // configFileProvider > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (PostProcess) > [Pipeline] sh > [Pipeline] sh > [Pipeline] sh > [Pipeline] junit > 15:22:25 Recording test results > 15:22:32 [Checks API] No suitable checks publisher found. > [Pipeline] } > [Pipeline] // stage > [Pipeline] } > [Pipeline] // container > [Pipeline] } > [Pipeline] // node > [Pipeline] } > [Pipeline] // timeout > [Pipeline] } > [Pipeline] // podTemplate > [Pipeline] } > 15:22:32 Failed in branch split-01 > [Pipeline] // parallel > [Pipeline] } > [Pipeline] // stage > [Pipeline] stage > [Pipeline] { (Archive) > [Pipeline] podTemplate > [Pipeline] { > [Pipeline] timeout > 15:22:33 Timeout set to expire in 6 hr 0 min > {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)