[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16941056#comment-16941056 ] L. C. Hsieh commented on HIVE-18702: Hi [~rajesh.balamohan], I'm tracing down this issue from Spark. Can you let me know what the Jira ticket number you created is? Thanks. > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Ivan Suller >Priority: Major > Fix For: 4.0.0 > > Attachments: HIVE-18702.1.patch, HIVE-18702.2.patch, > HIVE-18702.3.patch, HIVE-18702.3.patch, HIVE-18702.4.patch, HIVE-18702.5.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary28 > {code} > {code} > create external table bug1(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug1'; > insert overwrite table bug select * from bug1; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Time taken: 0.097 seconds > {code} > *ACTUAL RESULT:* > {code} > hive> select * from bug; > OK > Peter 34 > John 25 > Mary 28 > Time taken: 0.198 seconds, Fetched: 3 row(s) > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16862664#comment-16862664 ] Rajesh Balamohan commented on HIVE-18702: - Thanks for creating and fixing this ticket. There is another corner case where this would still show wrong result. 1. Table `A` is created with partition `y`. 2. Data is added by external system (say y="1"), but not yet registered in table A. 3. Run `insert ovewrite` on table A 4. This should still show old contents, because in this case `oldPartPath` would be null. So the actual data wouldn't be deleted. I will create a separate ticket to track this issue with a small sample testcase. > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Ivan Suller >Priority: Major > Fix For: 4.0.0 > > Attachments: HIVE-18702.1.patch, HIVE-18702.2.patch, > HIVE-18702.3.patch, HIVE-18702.3.patch, HIVE-18702.4.patch, HIVE-18702.5.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary28 > {code} > {code} > create external table bug1(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug1'; > insert overwrite table bug select * from bug1; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Time taken: 0.097 seconds > {code} > *ACTUAL RESULT:* > {code} > hive> select * from bug; > OK > Peter 34 > John 25 > Mary 28 > Time taken: 0.198 seconds, Fetched: 3 row(s) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16818806#comment-16818806 ] Hive QA commented on HIVE-18702: Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12966034/HIVE-18702.5.patch {color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified. {color:green}SUCCESS:{color} +1 due to 15942 tests passed Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/16967/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/16967/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-16967/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.YetusPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase {noformat} This message is automatically generated. ATTACHMENT ID: 12966034 - PreCommit-HIVE-Build > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Ivan Suller >Priority: Major > Fix For: 2.4.0, 3.2.0 > > Attachments: HIVE-18702.1.patch, HIVE-18702.2.patch, > HIVE-18702.3.patch, HIVE-18702.3.patch, HIVE-18702.4.patch, HIVE-18702.5.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary28 > {code} > {code} > create external table bug1(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug1'; > insert overwrite table bug select * from bug1; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Time taken: 0.097 seconds > {code} > *ACTUAL RESULT:* > {code} > hive> select * from bug; > OK > Peter 34 > John 25 > Mary 28 > Time taken: 0.198 seconds, Fetched: 3 row(s) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16818768#comment-16818768 ] Hive QA commented on HIVE-18702: | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | || || || || {color:brown} master Compile Tests {color} || | {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 1m 56s{color} | {color:blue} Maven dependency ordering for branch {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 7m 13s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 7s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 41s{color} | {color:green} master passed {color} | | {color:blue}0{color} | {color:blue} findbugs {color} | {color:blue} 4m 1s{color} | {color:blue} ql in master has 2265 extant Findbugs warnings. {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 59s{color} | {color:green} master passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 0m 28s{color} | {color:blue} Maven dependency ordering for patch {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 1m 31s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 7s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 1m 7s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 41s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 4m 14s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 1m 0s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 14s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 25m 42s{color} | {color:black} {color} | \\ \\ || Subsystem || Report/Notes || | Optional Tests | asflicense javac javadoc findbugs checkstyle compile | | uname | Linux hiveptest-server-upstream 3.16.0-4-amd64 #1 SMP Debian 3.16.43-2+deb8u5 (2017-09-19) x86_64 GNU/Linux | | Build tool | maven | | Personality | /data/hiveptest/working/yetus_PreCommit-HIVE-Build-16967/dev-support/hive-personality.sh | | git revision | master / 5759778 | | Default Java | 1.8.0_111 | | findbugs | v3.0.0 | | modules | C: ql itests U: . | | Console output | http://104.198.109.242/logs//PreCommit-HIVE-Build-16967/yetus.txt | | Powered by | Apache Yetushttp://yetus.apache.org | This message was automatically generated. > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Ivan Suller >Priority: Major > Fix For: 2.4.0, 3.2.0 > > Attachments: HIVE-18702.1.patch, HIVE-18702.2.patch, > HIVE-18702.3.patch, HIVE-18702.3.patch, HIVE-18702.4.patch, HIVE-18702.5.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16818654#comment-16818654 ] Zoltan Haindrich commented on HIVE-18702: - +1 pending tests > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Ivan Suller >Priority: Major > Fix For: 2.4.0, 3.2.0 > > Attachments: HIVE-18702.1.patch, HIVE-18702.2.patch, > HIVE-18702.3.patch, HIVE-18702.3.patch, HIVE-18702.4.patch, HIVE-18702.5.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary28 > {code} > {code} > create external table bug1(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug1'; > insert overwrite table bug select * from bug1; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Time taken: 0.097 seconds > {code} > *ACTUAL RESULT:* > {code} > hive> select * from bug; > OK > Peter 34 > John 25 > Mary 28 > Time taken: 0.198 seconds, Fetched: 3 row(s) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16817973#comment-16817973 ] Hive QA commented on HIVE-18702: Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12965935/HIVE-18702.4.patch {color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified. {color:red}ERROR:{color} -1 due to 1 failed/errored test(s), 15940 tests executed *Failed tests:* {noformat} org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[insert_overwrite] (batchId=21) {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/16954/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/16954/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-16954/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.YetusPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 1 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12965935 - PreCommit-HIVE-Build > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Ivan Suller >Priority: Major > Fix For: 2.4.0, 3.2.0 > > Attachments: HIVE-18702.1.patch, HIVE-18702.2.patch, > HIVE-18702.3.patch, HIVE-18702.3.patch, HIVE-18702.4.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary28 > {code} > {code} > create external table bug1(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug1'; > insert overwrite table bug select * from bug1; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Time taken: 0.097 seconds > {code} > *ACTUAL RESULT:* > {code} > hive> select * from bug; > OK > Peter 34 > John 25 > Mary 28 > Time taken: 0.198 seconds, Fetched: 3 row(s) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16817934#comment-16817934 ] Hive QA commented on HIVE-18702: | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | || || || || {color:brown} master Compile Tests {color} || | {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 1m 10s{color} | {color:blue} Maven dependency ordering for branch {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 7m 39s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 7s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 42s{color} | {color:green} master passed {color} | | {color:blue}0{color} | {color:blue} findbugs {color} | {color:blue} 4m 12s{color} | {color:blue} ql in master has 2265 extant Findbugs warnings. {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 59s{color} | {color:green} master passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 0m 29s{color} | {color:blue} Maven dependency ordering for patch {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 1m 26s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 5s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 1m 5s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 42s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 4m 14s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 59s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 14s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 25m 24s{color} | {color:black} {color} | \\ \\ || Subsystem || Report/Notes || | Optional Tests | asflicense javac javadoc findbugs checkstyle compile | | uname | Linux hiveptest-server-upstream 3.16.0-4-amd64 #1 SMP Debian 3.16.43-2+deb8u5 (2017-09-19) x86_64 GNU/Linux | | Build tool | maven | | Personality | /data/hiveptest/working/yetus_PreCommit-HIVE-Build-16954/dev-support/hive-personality.sh | | git revision | master / 079a720 | | Default Java | 1.8.0_111 | | findbugs | v3.0.0 | | modules | C: ql itests U: . | | Console output | http://104.198.109.242/logs//PreCommit-HIVE-Build-16954/yetus.txt | | Powered by | Apache Yetushttp://yetus.apache.org | This message was automatically generated. > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Ivan Suller >Priority: Major > Fix For: 2.4.0, 3.2.0 > > Attachments: HIVE-18702.1.patch, HIVE-18702.2.patch, > HIVE-18702.3.patch, HIVE-18702.3.patch, HIVE-18702.4.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary28 > {code} > {code}
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16816440#comment-16816440 ] Hive QA commented on HIVE-18702: Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12965725/HIVE-18702.3.patch {color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified. {color:red}ERROR:{color} -1 due to 7 failed/errored test(s), 15940 tests executed *Failed tests:* {noformat} org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[insert_overwrite] (batchId=21) org.apache.hadoop.hive.ql.parse.TestReplTableMigrationWithJsonFormat.testBootstrapLoadMigrationManagedToAcid (batchId=254) org.apache.hadoop.hive.ql.parse.TestReplWithJsonMessageFormat.testConcatenatePartitionedTable (batchId=246) org.apache.hadoop.hive.ql.parse.TestReplWithJsonMessageFormat.testConcatenateTable (batchId=246) org.apache.hadoop.hive.ql.parse.TestReplicationScenarios.testConcatenatePartitionedTable (batchId=252) org.apache.hadoop.hive.ql.parse.TestReplicationScenarios.testConcatenateTable (batchId=252) org.apache.hadoop.hive.ql.parse.TestReplicationWithTableMigration.testBootstrapLoadMigrationManagedToAcid (batchId=249) {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/16934/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/16934/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-16934/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.YetusPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 7 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12965725 - PreCommit-HIVE-Build > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Ivan Suller >Priority: Major > Fix For: 2.4.0, 3.2.0 > > Attachments: HIVE-18702.1.patch, HIVE-18702.2.patch, > HIVE-18702.3.patch, HIVE-18702.3.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary28 > {code} > {code} > create external table bug1(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug1'; > insert overwrite table bug select * from bug1; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Time taken: 0.097 seconds > {code} > *ACTUAL RESULT:* > {code} > hive> select * from bug; > OK > Peter 34 > John 25 > Mary 28 > Time taken: 0.198 seconds, Fetched: 3 row(s) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16816388#comment-16816388 ] Hive QA commented on HIVE-18702: | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | || || || || {color:brown} master Compile Tests {color} || | {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 1m 49s{color} | {color:blue} Maven dependency ordering for branch {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 7m 10s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 9s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 42s{color} | {color:green} master passed {color} | | {color:blue}0{color} | {color:blue} findbugs {color} | {color:blue} 4m 6s{color} | {color:blue} ql in master has 2265 extant Findbugs warnings. {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 57s{color} | {color:green} master passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 0m 30s{color} | {color:blue} Maven dependency ordering for patch {color} | | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 1m 25s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 4s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 1m 4s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 41s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 4m 13s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 58s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 14s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 25m 28s{color} | {color:black} {color} | \\ \\ || Subsystem || Report/Notes || | Optional Tests | asflicense javac javadoc findbugs checkstyle compile | | uname | Linux hiveptest-server-upstream 3.16.0-4-amd64 #1 SMP Debian 3.16.43-2+deb8u5 (2017-09-19) x86_64 GNU/Linux | | Build tool | maven | | Personality | /data/hiveptest/working/yetus_PreCommit-HIVE-Build-16934/dev-support/hive-personality.sh | | git revision | master / ec6af1b | | Default Java | 1.8.0_111 | | findbugs | v3.0.0 | | modules | C: ql itests U: . | | Console output | http://104.198.109.242/logs//PreCommit-HIVE-Build-16934/yetus.txt | | Powered by | Apache Yetushttp://yetus.apache.org | This message was automatically generated. > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Ivan Suller >Priority: Major > Fix For: 2.4.0, 3.2.0 > > Attachments: HIVE-18702.1.patch, HIVE-18702.2.patch, > HIVE-18702.3.patch, HIVE-18702.3.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary28 > {code} > {code} > create external t
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16816278#comment-16816278 ] Hive QA commented on HIVE-18702: Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12911562/HIVE-18702.2.patch {color:red}ERROR:{color} -1 due to build exiting with an error Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/16932/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/16932/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-16932/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Tests exited with: NonZeroExitCodeException Command 'bash /data/hiveptest/working/scratch/source-prep.sh' failed with exit status 1 and output '+ date '+%Y-%m-%d %T.%3N' 2019-04-12 13:47:50.577 + [[ -n /usr/lib/jvm/java-8-openjdk-amd64 ]] + export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64 + JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64 + export PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games + PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games + export 'ANT_OPTS=-Xmx1g -XX:MaxPermSize=256m ' + ANT_OPTS='-Xmx1g -XX:MaxPermSize=256m ' + export 'MAVEN_OPTS=-Xmx1g ' + MAVEN_OPTS='-Xmx1g ' + cd /data/hiveptest/working/ + tee /data/hiveptest/logs/PreCommit-HIVE-Build-16932/source-prep.txt + [[ false == \t\r\u\e ]] + mkdir -p maven ivy + [[ git = \s\v\n ]] + [[ git = \g\i\t ]] + [[ -z master ]] + [[ -d apache-github-source-source ]] + [[ ! -d apache-github-source-source/.git ]] + [[ ! -d apache-github-source-source ]] + date '+%Y-%m-%d %T.%3N' 2019-04-12 13:47:50.581 + cd apache-github-source-source + git fetch origin >From https://github.com/apache/hive dfa1fc9..ec6af1b master -> origin/master + git reset --hard HEAD HEAD is now at dfa1fc9 HIVE-21602: Dropping an external table created by migration case should delete the data directory (Sankar Hariappan, reviewed by Anishek Agarwal) + git clean -f -d Removing standalone-metastore/metastore-server/src/gen/ + git checkout master Already on 'master' Your branch is behind 'origin/master' by 1 commit, and can be fast-forwarded. (use "git pull" to update your local branch) + git reset --hard origin/master HEAD is now at ec6af1b HIVE-21109: Support stats replication for ACID tables (Ashutosh Bapat, reviewed by Sankar Hariappan) + git merge --ff-only origin/master Already up-to-date. + date '+%Y-%m-%d %T.%3N' 2019-04-12 13:47:51.755 + rm -rf ../yetus_PreCommit-HIVE-Build-16932 + mkdir ../yetus_PreCommit-HIVE-Build-16932 + git gc + cp -R . ../yetus_PreCommit-HIVE-Build-16932 + mkdir /data/hiveptest/logs/PreCommit-HIVE-Build-16932/yetus + patchCommandPath=/data/hiveptest/working/scratch/smart-apply-patch.sh + patchFilePath=/data/hiveptest/working/scratch/build.patch + [[ -f /data/hiveptest/working/scratch/build.patch ]] + chmod +x /data/hiveptest/working/scratch/smart-apply-patch.sh + /data/hiveptest/working/scratch/smart-apply-patch.sh /data/hiveptest/working/scratch/build.patch error: a/ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java: does not exist in index error: patch failed: ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java:1484 Falling back to three-way merge... Applied patch to 'ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java' with conflicts. Going to apply patch with: git apply -p1 error: patch failed: ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java:1484 Falling back to three-way merge... Applied patch to 'ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java' with conflicts. U ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java + result=1 + '[' 1 -ne 0 ']' + rm -rf yetus_PreCommit-HIVE-Build-16932 + exit 1 ' {noformat} This message is automatically generated. ATTACHMENT ID: 12911562 - PreCommit-HIVE-Build > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Ivan Suller >Priority: Major > Fix For: 2.4.0, 3.2.0 > > Attachments: HIVE-18702.1.patch, HIVE-18702.2.patch, > HIVE-18702.3.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16527012#comment-16527012 ] Hive QA commented on HIVE-18702: Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12911562/HIVE-18702.2.patch {color:red}ERROR:{color} -1 due to build exiting with an error Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/12228/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/12228/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-12228/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Tests exited with: NonZeroExitCodeException Command 'bash /data/hiveptest/working/scratch/source-prep.sh' failed with exit status 1 and output '+ date '+%Y-%m-%d %T.%3N' 2018-06-29 01:25:16.429 + [[ -n /usr/lib/jvm/java-8-openjdk-amd64 ]] + export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64 + JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64 + export PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games + PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games + export 'ANT_OPTS=-Xmx1g -XX:MaxPermSize=256m ' + ANT_OPTS='-Xmx1g -XX:MaxPermSize=256m ' + export 'MAVEN_OPTS=-Xmx1g ' + MAVEN_OPTS='-Xmx1g ' + cd /data/hiveptest/working/ + tee /data/hiveptest/logs/PreCommit-HIVE-Build-12228/source-prep.txt + [[ false == \t\r\u\e ]] + mkdir -p maven ivy + [[ git = \s\v\n ]] + [[ git = \g\i\t ]] + [[ -z master ]] + [[ -d apache-github-source-source ]] + [[ ! -d apache-github-source-source/.git ]] + [[ ! -d apache-github-source-source ]] + date '+%Y-%m-%d %T.%3N' 2018-06-29 01:25:16.433 + cd apache-github-source-source + git fetch origin + git reset --hard HEAD HEAD is now at 1b3ac73 HIVE-20010: Fix create view over literals (Zoltan Haindrich, reviewed by Ashutosh Chauhan, Daniel Dai) + git clean -f -d + git checkout master Already on 'master' Your branch is up-to-date with 'origin/master'. + git reset --hard origin/master HEAD is now at 1b3ac73 HIVE-20010: Fix create view over literals (Zoltan Haindrich, reviewed by Ashutosh Chauhan, Daniel Dai) + git merge --ff-only origin/master Already up-to-date. + date '+%Y-%m-%d %T.%3N' 2018-06-29 01:25:17.012 + rm -rf ../yetus_PreCommit-HIVE-Build-12228 + mkdir ../yetus_PreCommit-HIVE-Build-12228 + git gc + cp -R . ../yetus_PreCommit-HIVE-Build-12228 + mkdir /data/hiveptest/logs/PreCommit-HIVE-Build-12228/yetus + patchCommandPath=/data/hiveptest/working/scratch/smart-apply-patch.sh + patchFilePath=/data/hiveptest/working/scratch/build.patch + [[ -f /data/hiveptest/working/scratch/build.patch ]] + chmod +x /data/hiveptest/working/scratch/smart-apply-patch.sh + /data/hiveptest/working/scratch/smart-apply-patch.sh /data/hiveptest/working/scratch/build.patch error: a/ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java: does not exist in index error: patch failed: ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java:1493 Falling back to three-way merge... Applied patch to 'ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java' with conflicts. Going to apply patch with: git apply -p1 error: patch failed: ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java:1493 Falling back to three-way merge... Applied patch to 'ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java' with conflicts. U ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java + result=1 + '[' 1 -ne 0 ']' + rm -rf yetus_PreCommit-HIVE-Build-12228 + exit 1 ' {noformat} This message is automatically generated. ATTACHMENT ID: 12911562 - PreCommit-HIVE-Build > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Oleksiy Sayankin >Priority: Major > Fix For: 2.4.0, 3.2.0 > > Attachments: HIVE-18702.1.patch, HIVE-18702.2.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TE
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16431271#comment-16431271 ] Vineet Garg commented on HIVE-18702: Deferring this to 3.1.0 since the branch for 3.0.0 has been cut off. Please update the JIRA if you would like to get your patch in 3.0.0. > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Oleksiy Sayankin >Priority: Major > Fix For: 2.4.0, 3.1.0 > > Attachments: HIVE-18702.1.patch, HIVE-18702.2.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary28 > {code} > {code} > create external table bug1(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug1'; > insert overwrite table bug select * from bug1; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Time taken: 0.097 seconds > {code} > *ACTUAL RESULT:* > {code} > hive> select * from bug; > OK > Peter 34 > John 25 > Mary 28 > Time taken: 0.198 seconds, Fetched: 3 row(s) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16416291#comment-16416291 ] Ashutosh Chauhan commented on HIVE-18702: - [~osayankin] Can you reupload the patch and mark it Patch Available so that Hive QA can run on it ? > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Oleksiy Sayankin >Priority: Major > Fix For: 3.0.0, 2.3.3 > > Attachments: HIVE-18702.1.patch, HIVE-18702.2.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary28 > {code} > {code} > create external table bug1(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug1'; > insert overwrite table bug select * from bug1; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Time taken: 0.097 seconds > {code} > *ACTUAL RESULT:* > {code} > hive> select * from bug; > OK > Peter 34 > John 25 > Mary 28 > Time taken: 0.198 seconds, Fetched: 3 row(s) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16416191#comment-16416191 ] Daniel Dai commented on HIVE-18702: --- [~ashutoshc], do you plan to commit it to 2.3.3 branch? > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Oleksiy Sayankin >Priority: Major > Fix For: 3.0.0, 2.3.3 > > Attachments: HIVE-18702.1.patch, HIVE-18702.2.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary28 > {code} > {code} > create external table bug1(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug1'; > insert overwrite table bug select * from bug1; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Time taken: 0.097 seconds > {code} > *ACTUAL RESULT:* > {code} > hive> select * from bug; > OK > Peter 34 > John 25 > Mary 28 > Time taken: 0.198 seconds, Fetched: 3 row(s) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16372882#comment-16372882 ] Oleksiy Sayankin commented on HIVE-18702: - Done. > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Oleksiy Sayankin >Priority: Major > Fix For: 3.0.0, 2.3.3 > > Attachments: HIVE-18702.1.patch, HIVE-18702.2.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary28 > {code} > {code} > create external table bug1(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug1'; > insert overwrite table bug select * from bug1; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Time taken: 0.097 seconds > {code} > *ACTUAL RESULT:* > {code} > hive> select * from bug; > OK > Peter 34 > John 25 > Mary 28 > Time taken: 0.198 seconds, Fetched: 3 row(s) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16370838#comment-16370838 ] Ashutosh Chauhan commented on HIVE-18702: - +1 [~osayankin] Can you rebase your patch and reupload so that tests can run? > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Oleksiy Sayankin >Priority: Major > Fix For: 3.0.0, 2.3.3 > > Attachments: HIVE-18702.1.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary28 > {code} > {code} > create external table bug1(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug1'; > insert overwrite table bug select * from bug1; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Time taken: 0.097 seconds > {code} > *ACTUAL RESULT:* > {code} > hive> select * from bug; > OK > Peter 34 > John 25 > Mary 28 > Time taken: 0.198 seconds, Fetched: 3 row(s) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16363229#comment-16363229 ] Hive QA commented on HIVE-18702: Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12910402/HIVE-18702.1.patch {color:red}ERROR:{color} -1 due to build exiting with an error Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/9198/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/9198/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-9198/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Tests exited with: NonZeroExitCodeException Command 'bash /data/hiveptest/working/scratch/source-prep.sh' failed with exit status 1 and output '+ date '+%Y-%m-%d %T.%3N' 2018-02-13 23:34:41.053 + [[ -n /usr/lib/jvm/java-8-openjdk-amd64 ]] + export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64 + JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64 + export PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games + PATH=/usr/lib/jvm/java-8-openjdk-amd64/bin/:/usr/local/bin:/usr/bin:/bin:/usr/local/games:/usr/games + export 'ANT_OPTS=-Xmx1g -XX:MaxPermSize=256m ' + ANT_OPTS='-Xmx1g -XX:MaxPermSize=256m ' + export 'MAVEN_OPTS=-Xmx1g ' + MAVEN_OPTS='-Xmx1g ' + cd /data/hiveptest/working/ + tee /data/hiveptest/logs/PreCommit-HIVE-Build-9198/source-prep.txt + [[ false == \t\r\u\e ]] + mkdir -p maven ivy + [[ git = \s\v\n ]] + [[ git = \g\i\t ]] + [[ -z master ]] + [[ -d apache-github-source-source ]] + [[ ! -d apache-github-source-source/.git ]] + [[ ! -d apache-github-source-source ]] + date '+%Y-%m-%d %T.%3N' 2018-02-13 23:34:41.058 + cd apache-github-source-source + git fetch origin + git reset --hard HEAD HEAD is now at cf4114e HIVE-17627 : Use druid scan query instead of the select query. (Nishant Bangarwa via Slim B, Ashutosh Chauhan) + git clean -f -d + git checkout master Already on 'master' Your branch is up-to-date with 'origin/master'. + git reset --hard origin/master HEAD is now at cf4114e HIVE-17627 : Use druid scan query instead of the select query. (Nishant Bangarwa via Slim B, Ashutosh Chauhan) + git merge --ff-only origin/master Already up-to-date. + date '+%Y-%m-%d %T.%3N' 2018-02-13 23:34:44.680 + rm -rf ../yetus rm: cannot remove ?../yetus/ql/target?: Directory not empty ' {noformat} This message is automatically generated. ATTACHMENT ID: 12910402 - PreCommit-HIVE-Build > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Oleksiy Sayankin >Priority: Major > Fix For: 3.0.0, 2.3.3 > > Attachments: HIVE-18702.1.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary28 > {code} > {code} > create external table bug1(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug1'; > insert overwrite table bug select * from bug1; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Time taken: 0.097 seconds > {code} > *ACTUAL RESULT:* > {code} > hive> select * from bug; > OK > Peter 34 > John 25 > Mary 28 > Time taken: 0.198 seconds, Fetched: 3 row(s) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-18702) INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting
[ https://issues.apache.org/jira/browse/HIVE-18702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16362512#comment-16362512 ] Oleksiy Sayankin commented on HIVE-18702: - *FIXED* *ROOT-CAUSE* This if statement does not work {code} FileStatus[] statuses = HiveStatsUtils.getFileStatusRecurse( tmpPath, ((dpCtx == null) ? 1 : dpCtx.getNumDPCols()), fs); if(statuses != null && statuses.length > 0) { {code} when there are no files in {{/bug/.hive-staging_hive_2018-02-13_14-14-39_529_3325659916929491937-1/_task_tmp.-ext-1}}. Thus folder {{-ext-1}} is not created. After that this section of code {code} protected void replaceFiles(Path tablePath, Path srcf, Path destf, Path oldPath, HiveConf conf, boolean isSrcLocal, boolean purge) throws HiveException { try { FileSystem destFs = destf.getFileSystem(conf); // check if srcf contains nested sub-directories FileStatus[] srcs; FileSystem srcFs; try { srcFs = srcf.getFileSystem(conf); srcs = srcFs.globStatus(srcf); } catch (IOException e) { throw new HiveException("Getting globStatus " + srcf.toString(), e); } if (srcs == null) { LOG.info("No sources specified to move: " + srcf); return; } {code} returns {{LOG.info("No sources specified to move: " + srcf);}} and existing values in the table are not overwritten. *SOLUTION* Use {{fs.exists(tmpPath)}} instead of {{FileStatus[] statuses}}. > INSERT OVERWRITE TABLE doesn't clean the table directory before overwriting > --- > > Key: HIVE-18702 > URL: https://issues.apache.org/jira/browse/HIVE-18702 > Project: Hive > Issue Type: Bug >Affects Versions: 2.3.2 >Reporter: Oleksiy Sayankin >Assignee: Oleksiy Sayankin >Priority: Major > Fix For: 3.0.0, 2.3.3 > > Attachments: HIVE-18702.1.patch > > > Enable Hive on TEZ. (MR works fine). > *STEP 1. Create test data* > {code} > nano /home/test/users.txt > {code} > Add to file: > {code} > Peter,34 > John,25 > Mary,28 > {code} > {code} > hadoop fs -mkdir /bug > hadoop fs -copyFromLocal /home/test/users.txt /bug > hadoop fs -ls /bug > {code} > *EXPECTED RESULT:* > {code} > Found 2 items > > -rwxr-xr-x 3 root root 25 2015-10-15 16:11 /bug/users.txt > {code} > *STEP 2. Upload data to hive* > {code} > create external table bug(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug'; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Peter 34 > John25 > Mary28 > {code} > {code} > create external table bug1(name string, age int) ROW FORMAT DELIMITED FIELDS > TERMINATED BY ',' LINES TERMINATED BY '\n' LOCATION '/bug1'; > insert overwrite table bug select * from bug1; > select * from bug; > {code} > *EXPECTED RESULT:* > {code} > OK > Time taken: 0.097 seconds > {code} > *ACTUAL RESULT:* > {code} > hive> select * from bug; > OK > Peter 34 > John 25 > Mary 28 > Time taken: 0.198 seconds, Fetched: 3 row(s) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)