[jira] [Commented] (HIVE-17969) Metastore to alter table in batches of partitions when renaming table
[ https://issues.apache.org/jira/browse/HIVE-17969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16245512#comment-16245512 ] Peter Vary commented on HIVE-17969: --- +1 > Metastore to alter table in batches of partitions when renaming table > - > > Key: HIVE-17969 > URL: https://issues.apache.org/jira/browse/HIVE-17969 > Project: Hive > Issue Type: Improvement > Components: Metastore >Reporter: Adam Szita >Assignee: Adam Szita > Attachments: HIVE-17969.0.patch, HIVE-17969.1.patch, > HIVE-17969.2.patch, batched.png, hive9447OptimizationOnly.png, original.png > > > I'm currently trying to speed up the {{alter table rename to}} feature of > HMS. The recently submitted change (HIVE-9447) already helps a lot especially > on Oracle HMS DBs. > This time I intend to gain throughput independently of DB types by enabling > HMS to execute this alter table command on batches of partitions (rather than > 1by1) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (HIVE-17969) Metastore to alter table in batches of partitions when renaming table
[ https://issues.apache.org/jira/browse/HIVE-17969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16245464#comment-16245464 ] Adam Szita commented on HIVE-17969: --- Above tests results are all good for us, the listed failures are unrelated to this patch. > Metastore to alter table in batches of partitions when renaming table > - > > Key: HIVE-17969 > URL: https://issues.apache.org/jira/browse/HIVE-17969 > Project: Hive > Issue Type: Improvement > Components: Metastore >Reporter: Adam Szita >Assignee: Adam Szita > Attachments: HIVE-17969.0.patch, HIVE-17969.1.patch, > HIVE-17969.2.patch, batched.png, hive9447OptimizationOnly.png, original.png > > > I'm currently trying to speed up the {{alter table rename to}} feature of > HMS. The recently submitted change (HIVE-9447) already helps a lot especially > on Oracle HMS DBs. > This time I intend to gain throughput independently of DB types by enabling > HMS to execute this alter table command on batches of partitions (rather than > 1by1) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (HIVE-17969) Metastore to alter table in batches of partitions when renaming table
[ https://issues.apache.org/jira/browse/HIVE-17969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16242404#comment-16242404 ] Hive QA commented on HIVE-17969: Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12896379/HIVE-17969.2.patch {color:red}ERROR:{color} -1 due to no test(s) being added or modified. {color:red}ERROR:{color} -1 due to 10 failed/errored test(s), 11366 tests executed *Failed tests:* {noformat} org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[insert_values_orig_table_use_metadata] (batchId=62) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[llap_acid_fast] (batchId=157) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[sysdb] (batchId=156) org.apache.hadoop.hive.cli.TestNegativeMinimrCliDriver.testCliDriver[ct_noperm_loc] (batchId=94) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[subquery_multi] (batchId=111) org.apache.hadoop.hive.cli.TestTezPerfCliDriver.testCliDriver[query14] (batchId=243) org.apache.hadoop.hive.cli.TestTezPerfCliDriver.testCliDriver[query23] (batchId=243) org.apache.hadoop.hive.cli.control.TestDanglingQOuts.checkDanglingQOut (batchId=206) org.apache.hadoop.hive.ql.exec.tez.TestWorkloadManager.testDestroyAndReturn (batchId=281) org.apache.hadoop.hive.ql.parse.TestReplicationScenarios.testConstraints (batchId=223) {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/7685/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/7685/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-7685/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 10 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12896379 - PreCommit-HIVE-Build > Metastore to alter table in batches of partitions when renaming table > - > > Key: HIVE-17969 > URL: https://issues.apache.org/jira/browse/HIVE-17969 > Project: Hive > Issue Type: Improvement > Components: Metastore >Reporter: Adam Szita >Assignee: Adam Szita > Attachments: HIVE-17969.0.patch, HIVE-17969.1.patch, > HIVE-17969.2.patch, batched.png, hive9447OptimizationOnly.png, original.png > > > I'm currently trying to speed up the {{alter table rename to}} feature of > HMS. The recently submitted change (HIVE-9447) already helps a lot especially > on Oracle HMS DBs. > This time I intend to gain throughput independently of DB types by enabling > HMS to execute this alter table command on batches of partitions (rather than > 1by1) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (HIVE-17969) Metastore to alter table in batches of partitions when renaming table
[ https://issues.apache.org/jira/browse/HIVE-17969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16241790#comment-16241790 ] Adam Szita commented on HIVE-17969: --- Test results shows some related failures: *.testAlterViewPartition. It is due to a missing nullcheck. This and another small issue is now fixed in [^HIVE-17969.2.patch] > Metastore to alter table in batches of partitions when renaming table > - > > Key: HIVE-17969 > URL: https://issues.apache.org/jira/browse/HIVE-17969 > Project: Hive > Issue Type: Improvement > Components: Metastore >Reporter: Adam Szita >Assignee: Adam Szita > Attachments: HIVE-17969.0.patch, HIVE-17969.1.patch, > HIVE-17969.2.patch, batched.png, hive9447OptimizationOnly.png, original.png > > > I'm currently trying to speed up the {{alter table rename to}} feature of > HMS. The recently submitted change (HIVE-9447) already helps a lot especially > on Oracle HMS DBs. > This time I intend to gain throughput independently of DB types by enabling > HMS to execute this alter table command on batches of partitions (rather than > 1by1) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (HIVE-17969) Metastore to alter table in batches of partitions when renaming table
[ https://issues.apache.org/jira/browse/HIVE-17969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16240953#comment-16240953 ] Hive QA commented on HIVE-17969: Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12896208/HIVE-17969.1.patch {color:red}ERROR:{color} -1 due to no test(s) being added or modified. {color:red}ERROR:{color} -1 due to 12 failed/errored test(s), 11355 tests executed *Failed tests:* {noformat} org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[insert_values_orig_table_use_metadata] (batchId=62) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[sysdb] (batchId=156) org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver[explainanalyze_2] (batchId=102) org.apache.hadoop.hive.cli.TestNegativeMinimrCliDriver.testCliDriver[ct_noperm_loc] (batchId=94) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[subquery_multi] (batchId=111) org.apache.hadoop.hive.cli.control.TestDanglingQOuts.checkDanglingQOut (batchId=206) org.apache.hadoop.hive.metastore.TestEmbeddedHiveMetaStore.testAlterViewParititon (batchId=211) org.apache.hadoop.hive.metastore.TestRemoteHiveMetaStore.testAlterViewParititon (batchId=213) org.apache.hadoop.hive.metastore.TestSetUGIOnBothClientServer.testAlterViewParititon (batchId=209) org.apache.hadoop.hive.metastore.TestSetUGIOnOnlyClient.testAlterViewParititon (batchId=208) org.apache.hadoop.hive.metastore.TestSetUGIOnOnlyServer.testAlterViewParititon (batchId=218) org.apache.hadoop.hive.ql.parse.TestReplicationScenarios.testConstraints (batchId=223) {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/7664/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/7664/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-7664/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 12 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12896208 - PreCommit-HIVE-Build > Metastore to alter table in batches of partitions when renaming table > - > > Key: HIVE-17969 > URL: https://issues.apache.org/jira/browse/HIVE-17969 > Project: Hive > Issue Type: Improvement > Components: Metastore >Reporter: Adam Szita >Assignee: Adam Szita > Attachments: HIVE-17969.0.patch, HIVE-17969.1.patch, batched.png, > hive9447OptimizationOnly.png, original.png > > > I'm currently trying to speed up the {{alter table rename to}} feature of > HMS. The recently submitted change (HIVE-9447) already helps a lot especially > on Oracle HMS DBs. > This time I intend to gain throughput independently of DB types by enabling > HMS to execute this alter table command on batches of partitions (rather than > 1by1) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (HIVE-17969) Metastore to alter table in batches of partitions when renaming table
[ https://issues.apache.org/jira/browse/HIVE-17969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16240084#comment-16240084 ] Peter Vary commented on HIVE-17969: --- +1 pending tests > Metastore to alter table in batches of partitions when renaming table > - > > Key: HIVE-17969 > URL: https://issues.apache.org/jira/browse/HIVE-17969 > Project: Hive > Issue Type: Improvement > Components: Metastore >Reporter: Adam Szita >Assignee: Adam Szita > Attachments: HIVE-17969.0.patch, batched.png, > hive9447OptimizationOnly.png, original.png > > > I'm currently trying to speed up the {{alter table rename to}} feature of > HMS. The recently submitted change (HIVE-9447) already helps a lot especially > on Oracle HMS DBs. > This time I intend to gain throughput independently of DB types by enabling > HMS to execute this alter table command on batches of partitions (rather than > 1by1) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (HIVE-17969) Metastore to alter table in batches of partitions when renaming table
[ https://issues.apache.org/jira/browse/HIVE-17969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238012#comment-16238012 ] Hive QA commented on HIVE-17969: Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12895511/batched.png {color:red}ERROR:{color} -1 due to build exiting with an error Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/7616/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/7616/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-7616/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Tests exited with: NonZeroExitCodeException Command 'bash /data/hiveptest/working/scratch/source-prep.sh' failed with exit status 1 and output '+ date '+%Y-%m-%d %T.%3N' 2017-11-03 17:11:43.872 + [[ -n /usr/lib/jvm/java-8-openjdk-amd64 ]] + export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64 + JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64 + export PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games + PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games + export 'ANT_OPTS=-Xmx1g -XX:MaxPermSize=256m ' + ANT_OPTS='-Xmx1g -XX:MaxPermSize=256m ' + export 'MAVEN_OPTS=-Xmx1g ' + MAVEN_OPTS='-Xmx1g ' + cd /data/hiveptest/working/ + tee /data/hiveptest/logs/PreCommit-HIVE-Build-7616/source-prep.txt + [[ false == \t\r\u\e ]] + mkdir -p maven ivy + [[ git = \s\v\n ]] + [[ git = \g\i\t ]] + [[ -z master ]] + [[ -d apache-github-source-source ]] + [[ ! -d apache-github-source-source/.git ]] + [[ ! -d apache-github-source-source ]] + date '+%Y-%m-%d %T.%3N' 2017-11-03 17:11:43.875 + cd apache-github-source-source + git fetch origin + git reset --hard HEAD HEAD is now at c68b25f HIVE-17936 : Dynamic Semijoin Reduction : markSemiJoinForDPP marks unwanted semijoin branches (Deepak Jaiswal via Jason Dere) + git clean -f -d + git checkout master Already on 'master' Your branch is up-to-date with 'origin/master'. + git reset --hard origin/master HEAD is now at c68b25f HIVE-17936 : Dynamic Semijoin Reduction : markSemiJoinForDPP marks unwanted semijoin branches (Deepak Jaiswal via Jason Dere) + git merge --ff-only origin/master Already up-to-date. + date '+%Y-%m-%d %T.%3N' 2017-11-03 17:11:44.402 + patchCommandPath=/data/hiveptest/working/scratch/smart-apply-patch.sh + patchFilePath=/data/hiveptest/working/scratch/build.patch + [[ -f /data/hiveptest/working/scratch/build.patch ]] + chmod +x /data/hiveptest/working/scratch/smart-apply-patch.sh + /data/hiveptest/working/scratch/smart-apply-patch.sh /data/hiveptest/working/scratch/build.patch patch: Only garbage was found in the patch input. patch: Only garbage was found in the patch input. patch: Only garbage was found in the patch input. fatal: unrecognized input The patch does not appear to apply with p0, p1, or p2 + exit 1 ' {noformat} This message is automatically generated. ATTACHMENT ID: 12895511 - PreCommit-HIVE-Build > Metastore to alter table in batches of partitions when renaming table > - > > Key: HIVE-17969 > URL: https://issues.apache.org/jira/browse/HIVE-17969 > Project: Hive > Issue Type: Improvement > Components: Metastore >Reporter: Adam Szita >Assignee: Adam Szita >Priority: Major > Attachments: HIVE-17969.0.patch, batched.png, > hive9447OptimizationOnly.png, original.png > > > I'm currently trying to speed up the {{alter table rename to}} feature of > HMS. The recently submitted change (HIVE-9447) already helps a lot especially > on Oracle HMS DBs. > This time I intend to gain throughput independently of DB types by enabling > HMS to execute this alter table command on batches of partitions (rather than > 1by1) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (HIVE-17969) Metastore to alter table in batches of partitions when renaming table
[ https://issues.apache.org/jira/browse/HIVE-17969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238010#comment-16238010 ] Hive QA commented on HIVE-17969: Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12895511/batched.png {color:red}ERROR:{color} -1 due to build exiting with an error Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/7615/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/7615/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-7615/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Tests exited with: NonZeroExitCodeException Command 'bash /data/hiveptest/working/scratch/source-prep.sh' failed with exit status 1 and output '+ date '+%Y-%m-%d %T.%3N' 2017-11-03 17:11:07.275 + [[ -n /usr/lib/jvm/java-8-openjdk-amd64 ]] + export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64 + JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64 + export PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games + PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games + export 'ANT_OPTS=-Xmx1g -XX:MaxPermSize=256m ' + ANT_OPTS='-Xmx1g -XX:MaxPermSize=256m ' + export 'MAVEN_OPTS=-Xmx1g ' + MAVEN_OPTS='-Xmx1g ' + cd /data/hiveptest/working/ + tee /data/hiveptest/logs/PreCommit-HIVE-Build-7615/source-prep.txt + [[ false == \t\r\u\e ]] + mkdir -p maven ivy + [[ git = \s\v\n ]] + [[ git = \g\i\t ]] + [[ -z master ]] + [[ -d apache-github-source-source ]] + [[ ! -d apache-github-source-source/.git ]] + [[ ! -d apache-github-source-source ]] + date '+%Y-%m-%d %T.%3N' 2017-11-03 17:11:07.279 + cd apache-github-source-source + git fetch origin + git reset --hard HEAD HEAD is now at c68b25f HIVE-17936 : Dynamic Semijoin Reduction : markSemiJoinForDPP marks unwanted semijoin branches (Deepak Jaiswal via Jason Dere) + git clean -f -d Removing common/src/java/org/apache/hadoop/hive/conf/HiveConf.java.orig Removing standalone-metastore/src/gen/org/ + git checkout master Already on 'master' Your branch is up-to-date with 'origin/master'. + git reset --hard origin/master HEAD is now at c68b25f HIVE-17936 : Dynamic Semijoin Reduction : markSemiJoinForDPP marks unwanted semijoin branches (Deepak Jaiswal via Jason Dere) + git merge --ff-only origin/master Already up-to-date. + date '+%Y-%m-%d %T.%3N' 2017-11-03 17:11:12.117 + patchCommandPath=/data/hiveptest/working/scratch/smart-apply-patch.sh + patchFilePath=/data/hiveptest/working/scratch/build.patch + [[ -f /data/hiveptest/working/scratch/build.patch ]] + chmod +x /data/hiveptest/working/scratch/smart-apply-patch.sh + /data/hiveptest/working/scratch/smart-apply-patch.sh /data/hiveptest/working/scratch/build.patch patch: Only garbage was found in the patch input. patch: Only garbage was found in the patch input. patch: Only garbage was found in the patch input. fatal: unrecognized input The patch does not appear to apply with p0, p1, or p2 + exit 1 ' {noformat} This message is automatically generated. ATTACHMENT ID: 12895511 - PreCommit-HIVE-Build > Metastore to alter table in batches of partitions when renaming table > - > > Key: HIVE-17969 > URL: https://issues.apache.org/jira/browse/HIVE-17969 > Project: Hive > Issue Type: Improvement > Components: Metastore >Reporter: Adam Szita >Assignee: Adam Szita >Priority: Major > Attachments: HIVE-17969.0.patch, batched.png, > hive9447OptimizationOnly.png, original.png > > > I'm currently trying to speed up the {{alter table rename to}} feature of > HMS. The recently submitted change (HIVE-9447) already helps a lot especially > on Oracle HMS DBs. > This time I intend to gain throughput independently of DB types by enabling > HMS to execute this alter table command on batches of partitions (rather than > 1by1) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (HIVE-17969) Metastore to alter table in batches of partitions when renaming table
[ https://issues.apache.org/jira/browse/HIVE-17969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16236683#comment-16236683 ] Adam Szita commented on HIVE-17969: --- The optimization in [^HIVE-17969.0.patch] aims at reducing the time the query takes during the alterPartition() call. Inside this method we call removeUnusedColumnDescriptor(). In case we have many (if not all..) partitions with the same schema we make a lot of redundant calls when checking removability of the old column descriptor. Instead of this we should take a batch of partitions, see what old CDs there are (in case there is no schema change in 30k partitions than it will be one for the 30k parts..) and then run the check on this distinct set of CDs only - thereby saving a lot of DB query time. [~pvary] can you take a look please? > Metastore to alter table in batches of partitions when renaming table > - > > Key: HIVE-17969 > URL: https://issues.apache.org/jira/browse/HIVE-17969 > Project: Hive > Issue Type: Improvement > Components: Metastore >Reporter: Adam Szita >Assignee: Adam Szita >Priority: Major > Attachments: HIVE-17969.0.patch > > > I'm currently trying to speed up the {{alter table rename to}} feature of > HMS. The recently submitted change (HIVE-9447) already helps a lot especially > on Oracle HMS DBs. > This time I intend to gain throughput independently of DB types by enabling > HMS to execute this alter table command on batches of partitions (rather than > 1by1) -- This message was sent by Atlassian JIRA (v6.4.14#64029)