[jira] [Commented] (HIVE-4808) WebHCat job submission is killed by TaskTracker since it's not sending a heartbeat properly
[ https://issues.apache.org/jira/browse/HIVE-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13721511#comment-13721511 ] Alan Gates commented on HIVE-4808: -- I'm not sure I'm running the tests properly. When I run the new test TestHeartbeat_2 it fails with: {code} ./test_harness.pl::TestDriverCurl::checkResStatusCode INFO Check failed: status_code 200 expected, test returned 400 {code} Do I need to do something to set it up properly? WebHCat job submission is killed by TaskTracker since it's not sending a heartbeat properly --- Key: HIVE-4808 URL: https://issues.apache.org/jira/browse/HIVE-4808 Project: Hive Issue Type: Bug Components: HCatalog Affects Versions: 0.11.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4808.1.patch, HIVE-4808.patch (set mapred.task.timeout=7) curl -i -d user.name=ekoifman \ -d jar=/user/ekoifman/webhcate2e/hexamples.jar \ -d class=sleep \ -d arg=-mt \ -d arg=5 \ -d statusdir=/tmp \ 'http://localhost:50111/templeton/v1/mapreduce/jar' The TempletonControllerJob gets retried 4 times (Thus there are 4 SleepJob invocations) with message that it was killed due to inactivity. hexamples.jar = hadoop-examples-*.jar -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4808) WebHCat job submission is killed by TaskTracker since it's not sending a heartbeat properly
[ https://issues.apache.org/jira/browse/HIVE-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13721517#comment-13721517 ] Alan Gates commented on HIVE-4808: -- Never mind, my mistake. I had my test harness configured incorrectly. Tests pass, I'll check this in shortly. WebHCat job submission is killed by TaskTracker since it's not sending a heartbeat properly --- Key: HIVE-4808 URL: https://issues.apache.org/jira/browse/HIVE-4808 Project: Hive Issue Type: Bug Components: HCatalog Affects Versions: 0.11.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4808.1.patch, HIVE-4808.patch (set mapred.task.timeout=7) curl -i -d user.name=ekoifman \ -d jar=/user/ekoifman/webhcate2e/hexamples.jar \ -d class=sleep \ -d arg=-mt \ -d arg=5 \ -d statusdir=/tmp \ 'http://localhost:50111/templeton/v1/mapreduce/jar' The TempletonControllerJob gets retried 4 times (Thus there are 4 SleepJob invocations) with message that it was killed due to inactivity. hexamples.jar = hadoop-examples-*.jar -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4808) WebHCat job submission is killed by TaskTracker since it's not sending a heartbeat properly
[ https://issues.apache.org/jira/browse/HIVE-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13718401#comment-13718401 ] Hive QA commented on HIVE-4808: --- {color:red}Overall{color}: -1 at least one tests failed Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12593808/HIVE-4808.1.patch {color:red}ERROR:{color} -1 due to 1 failed/errored test(s), 2649 tests executed *Failed tests:* {noformat} org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_view_cast {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/164/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/164/console Messages: {noformat} Executing org.apache.hive.ptest.execution.CleanupPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests failed with: TestsFailedException: 1 tests failed {noformat} This message is automatically generated. WebHCat job submission is killed by TaskTracker since it's not sending a heartbeat properly --- Key: HIVE-4808 URL: https://issues.apache.org/jira/browse/HIVE-4808 Project: Hive Issue Type: Bug Components: HCatalog Affects Versions: 0.11.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4808.1.patch, HIVE-4808.patch (set mapred.task.timeout=7) curl -i -d user.name=ekoifman \ -d jar=/user/ekoifman/webhcate2e/hexamples.jar \ -d class=sleep \ -d arg=-mt \ -d arg=5 \ -d statusdir=/tmp \ 'http://localhost:50111/templeton/v1/mapreduce/jar' The TempletonControllerJob gets retried 4 times (Thus there are 4 SleepJob invocations) with message that it was killed due to inactivity. hexamples.jar = hadoop-examples-*.jar -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4808) WebHCat job submission is killed by TaskTracker since it's not sending a heartbeat properly
[ https://issues.apache.org/jira/browse/HIVE-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13717970#comment-13717970 ] Hive QA commented on HIVE-4808: --- {color:red}Overall{color}: -1 no tests executed Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12593808/HIVE-4808.1.patch Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/159/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/159/console Messages: {noformat} Executing org.apache.hive.ptest.execution.CleanupPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Tests failed with: IllegalStateException: Too many bad hosts: 1.0% (10 / 10) is greater than threshold of 50% {noformat} This message is automatically generated. WebHCat job submission is killed by TaskTracker since it's not sending a heartbeat properly --- Key: HIVE-4808 URL: https://issues.apache.org/jira/browse/HIVE-4808 Project: Hive Issue Type: Bug Components: HCatalog Affects Versions: 0.11.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4808.1.patch, HIVE-4808.patch (set mapred.task.timeout=7) curl -i -d user.name=ekoifman \ -d jar=/user/ekoifman/webhcate2e/hexamples.jar \ -d class=sleep \ -d arg=-mt \ -d arg=5 \ -d statusdir=/tmp \ 'http://localhost:50111/templeton/v1/mapreduce/jar' The TempletonControllerJob gets retried 4 times (Thus there are 4 SleepJob invocations) with message that it was killed due to inactivity. hexamples.jar = hadoop-examples-*.jar -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4808) WebHCat job submission is killed by TaskTracker since it's not sending a heartbeat properly
[ https://issues.apache.org/jira/browse/HIVE-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715775#comment-13715775 ] Eugene Koifman commented on HIVE-4808: -- 3 things to keep in mind when testing: 1. TempletonControllerJob is hardcoded to send ping every 1 second 2. Set mapred.task.timeout to 70+ seconds, i.e. larger than interval in #1 3. when running test case (in bug description) set -mt 9 (to be larger than # 2) WebHCat job submission is killed by TaskTracker since it's not sending a heartbeat properly --- Key: HIVE-4808 URL: https://issues.apache.org/jira/browse/HIVE-4808 Project: Hive Issue Type: Bug Components: HCatalog Affects Versions: 0.11.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4808.patch (set mapred.task.timeout=7) curl -i -d user.name=ekoifman \ -d jar=/user/ekoifman/webhcate2e/hexamples.jar \ -d class=sleep \ -d arg=-mt \ -d arg=5 \ -d statusdir=/tmp \ 'http://localhost:50111/templeton/v1/mapreduce/jar' The TempletonControllerJob gets retried 4 times (Thus there are 4 SleepJob invocations) with message that it was killed due to inactivity. hexamples.jar = hadoop-examples-*.jar -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4808) WebHCat job submission is killed by TaskTracker since it's not sending a heartbeat properly
[ https://issues.apache.org/jira/browse/HIVE-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13716040#comment-13716040 ] Eugene Koifman commented on HIVE-4808: -- Couple of suggestions from Hadoop user list on setting timeout programmatically: Yes, you can set it into your Job configuration object in code. If your driver uses the Tool framework, then you can also pass a -Dmapred.task.timeout=value CLI argument when invoking your program. AND 'mapred.task.timeout' is deprecated configuration. You can use 'mapreduce.task.timeout' property to do the same. You could set this configuration while submitting the Job using org.apache.hadoop.conf.Configuration.setLong(String name, long value) API from conf or JobConf. WebHCat job submission is killed by TaskTracker since it's not sending a heartbeat properly --- Key: HIVE-4808 URL: https://issues.apache.org/jira/browse/HIVE-4808 Project: Hive Issue Type: Bug Components: HCatalog Affects Versions: 0.11.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4808.patch (set mapred.task.timeout=7) curl -i -d user.name=ekoifman \ -d jar=/user/ekoifman/webhcate2e/hexamples.jar \ -d class=sleep \ -d arg=-mt \ -d arg=5 \ -d statusdir=/tmp \ 'http://localhost:50111/templeton/v1/mapreduce/jar' The TempletonControllerJob gets retried 4 times (Thus there are 4 SleepJob invocations) with message that it was killed due to inactivity. hexamples.jar = hadoop-examples-*.jar -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4808) WebHCat job submission is killed by TaskTracker since it's not sending a heartbeat properly
[ https://issues.apache.org/jira/browse/HIVE-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13699610#comment-13699610 ] Thejas M Nair commented on HIVE-4808: - It would be good to have an automated test for this as well. I understand that this will add 10+ minutes to the system tests (effectively almost doubling the run time). Maybe, we can have a separate ant target that runs the shorter running system tests, for use by developers before committing changes. WebHCat job submission is killed by TaskTracker since it's not sending a heartbeat properly --- Key: HIVE-4808 URL: https://issues.apache.org/jira/browse/HIVE-4808 Project: Hive Issue Type: Bug Components: HCatalog Affects Versions: 0.11.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4808.patch (set mapred.task.timeout=7) curl -i -d user.name=ekoifman \ -d jar=/user/ekoifman/webhcate2e/hexamples.jar \ -d class=sleep \ -d arg=-mt \ -d arg=5 \ -d statusdir=/tmp \ 'http://localhost:50111/templeton/v1/mapreduce/jar' The TempletonControllerJob gets retried 4 times (Thus there are 4 SleepJob invocations) with message that it was killed due to inactivity. hexamples.jar = hadoop-examples-*.jar -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira