[jira] [Commented] (HIVE-14215) Displaying inconsistent CPU usage data with MR execution engine
[ https://issues.apache.org/jira/browse/HIVE-14215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15377227#comment-15377227 ] Sergio Peña commented on HIVE-14215: Good, +1 too > Displaying inconsistent CPU usage data with MR execution engine > --- > > Key: HIVE-14215 > URL: https://issues.apache.org/jira/browse/HIVE-14215 > Project: Hive > Issue Type: Bug >Affects Versions: 2.2.0 >Reporter: Peter Vary >Assignee: Peter Vary >Priority: Minor > Attachments: HIVE-14215.patch > > > If the MR task is finished after printing the cumulative CPU time then there > is the possibility to print inconsistent CPU usage information. > Correct one: > {noformat} > 2016-07-12 11:31:42,961 Stage-3 map = 0%, reduce = 0% > 2016-07-12 11:31:48,237 Stage-3 map = 100%, reduce = 0%, Cumulative CPU 2.5 > sec > MapReduce Total cumulative CPU time: 2 seconds 500 msec > Ended Job = job_1468321038188_0003 > MapReduce Jobs Launched: > Stage-Stage-3: Map: 1 Cumulative CPU: 2.5 sec HDFS Read: 5864 HDFS Write: > 103 SUCCESS > Total MapReduce CPU Time Spent: 2 seconds 500 msec > {noformat} > One type of inconsistent data (easily reproducible one): > {noformat} > 2016-07-12 11:39:00,540 Stage-3 map = 0%, reduce = 0% > Ended Job = job_1468321038188_0004 > MapReduce Jobs Launched: > Stage-Stage-3: Map: 1 Cumulative CPU: 2.51 sec HDFS Read: 5864 HDFS > Write: 103 SUCCESS > Total MapReduce CPU Time Spent: 2 seconds 510 msec > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-14215) Displaying inconsistent CPU usage data with MR execution engine
[ https://issues.apache.org/jira/browse/HIVE-14215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15375604#comment-15375604 ] Aihua Xu commented on HIVE-14215: - I see. It's missing the last reading of the counter after the execution finishes. +1. > Displaying inconsistent CPU usage data with MR execution engine > --- > > Key: HIVE-14215 > URL: https://issues.apache.org/jira/browse/HIVE-14215 > Project: Hive > Issue Type: Bug >Affects Versions: 2.2.0 >Reporter: Peter Vary >Assignee: Peter Vary >Priority: Minor > Attachments: HIVE-14215.patch > > > If the MR task is finished after printing the cumulative CPU time then there > is the possibility to print inconsistent CPU usage information. > Correct one: > {noformat} > 2016-07-12 11:31:42,961 Stage-3 map = 0%, reduce = 0% > 2016-07-12 11:31:48,237 Stage-3 map = 100%, reduce = 0%, Cumulative CPU 2.5 > sec > MapReduce Total cumulative CPU time: 2 seconds 500 msec > Ended Job = job_1468321038188_0003 > MapReduce Jobs Launched: > Stage-Stage-3: Map: 1 Cumulative CPU: 2.5 sec HDFS Read: 5864 HDFS Write: > 103 SUCCESS > Total MapReduce CPU Time Spent: 2 seconds 500 msec > {noformat} > One type of inconsistent data (easily reproducible one): > {noformat} > 2016-07-12 11:39:00,540 Stage-3 map = 0%, reduce = 0% > Ended Job = job_1468321038188_0004 > MapReduce Jobs Launched: > Stage-Stage-3: Map: 1 Cumulative CPU: 2.51 sec HDFS Read: 5864 HDFS > Write: 103 SUCCESS > Total MapReduce CPU Time Spent: 2 seconds 510 msec > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-14215) Displaying inconsistent CPU usage data with MR execution engine
[ https://issues.apache.org/jira/browse/HIVE-14215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15374761#comment-15374761 ] Peter Vary commented on HIVE-14215: --- The test failures are not related > Displaying inconsistent CPU usage data with MR execution engine > --- > > Key: HIVE-14215 > URL: https://issues.apache.org/jira/browse/HIVE-14215 > Project: Hive > Issue Type: Bug >Affects Versions: 2.2.0 >Reporter: Peter Vary >Assignee: Peter Vary >Priority: Minor > Attachments: HIVE-14215.patch > > > If the MR task is finished after printing the cumulative CPU time then there > is the possibility to print inconsistent CPU usage information. > Correct one: > {noformat} > 2016-07-12 11:31:42,961 Stage-3 map = 0%, reduce = 0% > 2016-07-12 11:31:48,237 Stage-3 map = 100%, reduce = 0%, Cumulative CPU 2.5 > sec > MapReduce Total cumulative CPU time: 2 seconds 500 msec > Ended Job = job_1468321038188_0003 > MapReduce Jobs Launched: > Stage-Stage-3: Map: 1 Cumulative CPU: 2.5 sec HDFS Read: 5864 HDFS Write: > 103 SUCCESS > Total MapReduce CPU Time Spent: 2 seconds 500 msec > {noformat} > One type of inconsistent data (easily reproducible one): > {noformat} > 2016-07-12 11:39:00,540 Stage-3 map = 0%, reduce = 0% > Ended Job = job_1468321038188_0004 > MapReduce Jobs Launched: > Stage-Stage-3: Map: 1 Cumulative CPU: 2.51 sec HDFS Read: 5864 HDFS > Write: 103 SUCCESS > Total MapReduce CPU Time Spent: 2 seconds 510 msec > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-14215) Displaying inconsistent CPU usage data with MR execution engine
[ https://issues.apache.org/jira/browse/HIVE-14215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15374502#comment-15374502 ] Hive QA commented on HIVE-14215: Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12817435/HIVE-14215.patch {color:red}ERROR:{color} -1 due to no test(s) being added or modified. {color:red}ERROR:{color} -1 due to 14 failed/errored test(s), 10314 tests executed *Failed tests:* {noformat} org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_acid_globallimit org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_list_bucket_dml_12 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_list_bucket_dml_13 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_masking_8 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_stats_list_bucket org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_subquery_multiinsert org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_vector_interval_arithmetic org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver_vector_complex_all org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver_vector_complex_join org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver_acid_globallimit org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver_vector_interval_arithmetic org.apache.hadoop.hive.cli.TestMinimrCliDriver.org.apache.hadoop.hive.cli.TestMinimrCliDriver org.apache.hadoop.hive.llap.daemon.impl.TestLlapTokenChecker.testCheckPermissions org.apache.hadoop.hive.llap.daemon.impl.TestLlapTokenChecker.testGetToken {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-MASTER-Build/490/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-MASTER-Build/490/console Test logs: http://ec2-204-236-174-241.us-west-1.compute.amazonaws.com/logs/PreCommit-HIVE-MASTER-Build-490/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 14 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12817435 - PreCommit-HIVE-MASTER-Build > Displaying inconsistent CPU usage data with MR execution engine > --- > > Key: HIVE-14215 > URL: https://issues.apache.org/jira/browse/HIVE-14215 > Project: Hive > Issue Type: Bug >Affects Versions: 2.2.0 >Reporter: Peter Vary >Assignee: Peter Vary >Priority: Minor > Attachments: HIVE-14215.patch > > > If the MR task is finished after printing the cumulative CPU time then there > is the possibility to print inconsistent CPU usage information. > Correct one: > {noformat} > 2016-07-12 11:31:42,961 Stage-3 map = 0%, reduce = 0% > 2016-07-12 11:31:48,237 Stage-3 map = 100%, reduce = 0%, Cumulative CPU 2.5 > sec > MapReduce Total cumulative CPU time: 2 seconds 500 msec > Ended Job = job_1468321038188_0003 > MapReduce Jobs Launched: > Stage-Stage-3: Map: 1 Cumulative CPU: 2.5 sec HDFS Read: 5864 HDFS Write: > 103 SUCCESS > Total MapReduce CPU Time Spent: 2 seconds 500 msec > {noformat} > One type of inconsistent data (easily reproducible one): > {noformat} > 2016-07-12 11:39:00,540 Stage-3 map = 0%, reduce = 0% > Ended Job = job_1468321038188_0004 > MapReduce Jobs Launched: > Stage-Stage-3: Map: 1 Cumulative CPU: 2.51 sec HDFS Read: 5864 HDFS > Write: 103 SUCCESS > Total MapReduce CPU Time Spent: 2 seconds 510 msec > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-14215) Displaying inconsistent CPU usage data with MR execution engine
[ https://issues.apache.org/jira/browse/HIVE-14215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15372954#comment-15372954 ] Peter Vary commented on HIVE-14215: --- To reproduce these consistently, I have to put a Thread.sleep at the end of the while cycle in the method progress(ExecDriverTaskHandle th), after this (line 373): {noformat} console.printInfo(output); task.setStatusMessage(output); reportTime = System.currentTimeMillis(); {noformat} This way I raised the occurrence of the rare situation, where the job is finished after the cpu time generation, but before the check of the while cycle. > Displaying inconsistent CPU usage data with MR execution engine > --- > > Key: HIVE-14215 > URL: https://issues.apache.org/jira/browse/HIVE-14215 > Project: Hive > Issue Type: Bug >Affects Versions: 2.2.0 >Reporter: Peter Vary >Assignee: Peter Vary >Priority: Minor > > If the MR task is finished after printing the cumulative CPU time then there > is the possibility to print inconsistent CPU usage information. > Correct one: > {noformat} > 2016-07-12 11:31:42,961 Stage-3 map = 0%, reduce = 0% > 2016-07-12 11:31:48,237 Stage-3 map = 100%, reduce = 0%, Cumulative CPU 2.5 > sec > MapReduce Total cumulative CPU time: 2 seconds 500 msec > Ended Job = job_1468321038188_0003 > MapReduce Jobs Launched: > Stage-Stage-3: Map: 1 Cumulative CPU: 2.5 sec HDFS Read: 5864 HDFS Write: > 103 SUCCESS > Total MapReduce CPU Time Spent: 2 seconds 500 msec > {noformat} > One type of inconsistent data (easily reproducible one): > {noformat} > 2016-07-12 11:39:00,540 Stage-3 map = 0%, reduce = 0% > Ended Job = job_1468321038188_0004 > MapReduce Jobs Launched: > Stage-Stage-3: Map: 1 Cumulative CPU: 2.51 sec HDFS Read: 5864 HDFS > Write: 103 SUCCESS > Total MapReduce CPU Time Spent: 2 seconds 510 msec > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-14215) Displaying inconsistent CPU usage data with MR execution engine
[ https://issues.apache.org/jira/browse/HIVE-14215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15372929#comment-15372929 ] Peter Vary commented on HIVE-14215: --- And an here it is even worse: {noformat} 2016-07-12 14:00:42,190 Stage-3 map = 0%, reduce = 0% 2016-07-12 14:00:50,465 Stage-3 map = 67%, reduce = 0%, Cumulative CPU 6.56 sec MapReduce Total cumulative CPU time: 6 seconds 560 msec Ended Job = job_1468321038188_0018 MapReduce Jobs Launched: Stage-Stage-3: Map: 3 Cumulative CPU: 10.26 sec HDFS Read: 383060 HDFS Write: 6190612 SUCCESS Total MapReduce CPU Time Spent: 10 seconds 260 msec {noformat} > Displaying inconsistent CPU usage data with MR execution engine > --- > > Key: HIVE-14215 > URL: https://issues.apache.org/jira/browse/HIVE-14215 > Project: Hive > Issue Type: Bug >Affects Versions: 2.2.0 >Reporter: Peter Vary >Assignee: Peter Vary >Priority: Minor > > If the MR task is finished after printing the cumulative CPU time then there > is the possibility to print inconsistent CPU usage information. > Correct one: > {noformat} > 2016-07-12 11:31:42,961 Stage-3 map = 0%, reduce = 0% > 2016-07-12 11:31:48,237 Stage-3 map = 100%, reduce = 0%, Cumulative CPU 2.5 > sec > MapReduce Total cumulative CPU time: 2 seconds 500 msec > Ended Job = job_1468321038188_0003 > MapReduce Jobs Launched: > Stage-Stage-3: Map: 1 Cumulative CPU: 2.5 sec HDFS Read: 5864 HDFS Write: > 103 SUCCESS > Total MapReduce CPU Time Spent: 2 seconds 500 msec > {noformat} > One type of inconsistent data (easily reproducible one): > {noformat} > 2016-07-12 11:39:00,540 Stage-3 map = 0%, reduce = 0% > Ended Job = job_1468321038188_0004 > MapReduce Jobs Launched: > Stage-Stage-3: Map: 1 Cumulative CPU: 2.51 sec HDFS Read: 5864 HDFS > Write: 103 SUCCESS > Total MapReduce CPU Time Spent: 2 seconds 510 msec > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)