[jira] [Commented] (HIVE-16132) DataSize stats don't seem correct in semijoin opt branch

2017-03-11 Thread Hive QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-16132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15906269#comment-15906269
 ] 

Hive QA commented on HIVE-16132:




Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12857484/HIVE-16132.6.patch

{color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified.

{color:red}ERROR:{color} -1 due to 1 failed/errored test(s), 10339 tests 
executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[columnstats_part_coltype]
 (batchId=153)
{noformat}

Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/4089/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/4089/console
Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-4089/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 1 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12857484 - PreCommit-HIVE-Build

> DataSize stats don't seem correct in semijoin opt branch
> 
>
> Key: HIVE-16132
> URL: https://issues.apache.org/jira/browse/HIVE-16132
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Reporter: Deepak Jaiswal
>Assignee: Deepak Jaiswal
> Attachments: HIVE-16132.1.patch, HIVE-16132.2.patch, 
> HIVE-16132.3.patch, HIVE-16132.4.patch, HIVE-16132.5.patch, HIVE-16132.6.patch
>
>
> For the following operator tree snippet, the second Select is the start of a 
> semijoin optimization branch. Take a look at the Data size - it is the same 
> as the data size for its parent Select, even though the second select has 
> only a single bigint column in its projection (the parent has 2 columns). I 
> would expect the size to be 533328 (16 bytes * 3).
> Fixing this estimate may become important if we need to estimate the cost of 
> generating the min/max/bloomfilter.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Commented] (HIVE-16132) DataSize stats don't seem correct in semijoin opt branch

2017-03-11 Thread Hive QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-16132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15906131#comment-15906131
 ] 

Hive QA commented on HIVE-16132:




Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12857449/HIVE-16132.5.patch

{color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified.

{color:red}ERROR:{color} -1 due to 1 failed/errored test(s), 10339 tests 
executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[dynamic_semijoin_reduction]
 (batchId=148)
{noformat}

Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/4087/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/4087/console
Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-4087/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 1 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12857449 - PreCommit-HIVE-Build

> DataSize stats don't seem correct in semijoin opt branch
> 
>
> Key: HIVE-16132
> URL: https://issues.apache.org/jira/browse/HIVE-16132
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Reporter: Deepak Jaiswal
>Assignee: Deepak Jaiswal
> Attachments: HIVE-16132.1.patch, HIVE-16132.2.patch, 
> HIVE-16132.3.patch, HIVE-16132.4.patch, HIVE-16132.5.patch
>
>
> For the following operator tree snippet, the second Select is the start of a 
> semijoin optimization branch. Take a look at the Data size - it is the same 
> as the data size for its parent Select, even though the second select has 
> only a single bigint column in its projection (the parent has 2 columns). I 
> would expect the size to be 533328 (16 bytes * 3).
> Fixing this estimate may become important if we need to estimate the cost of 
> generating the min/max/bloomfilter.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Commented] (HIVE-16132) DataSize stats don't seem correct in semijoin opt branch

2017-03-10 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-16132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15905955#comment-15905955
 ] 

Gunther Hagleitner commented on HIVE-16132:
---

+1

> DataSize stats don't seem correct in semijoin opt branch
> 
>
> Key: HIVE-16132
> URL: https://issues.apache.org/jira/browse/HIVE-16132
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Reporter: Deepak Jaiswal
>Assignee: Deepak Jaiswal
> Attachments: HIVE-16132.1.patch, HIVE-16132.2.patch, 
> HIVE-16132.3.patch, HIVE-16132.4.patch
>
>
> For the following operator tree snippet, the second Select is the start of a 
> semijoin optimization branch. Take a look at the Data size - it is the same 
> as the data size for its parent Select, even though the second select has 
> only a single bigint column in its projection (the parent has 2 columns). I 
> would expect the size to be 533328 (16 bytes * 3).
> Fixing this estimate may become important if we need to estimate the cost of 
> generating the min/max/bloomfilter.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Commented] (HIVE-16132) DataSize stats don't seem correct in semijoin opt branch

2017-03-10 Thread Deepak Jaiswal (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-16132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15905772#comment-15905772
 ] 

Deepak Jaiswal commented on HIVE-16132:
---

[~hagleitn] Can you please review?

https://reviews.apache.org/r/57391/

> DataSize stats don't seem correct in semijoin opt branch
> 
>
> Key: HIVE-16132
> URL: https://issues.apache.org/jira/browse/HIVE-16132
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Reporter: Deepak Jaiswal
>Assignee: Deepak Jaiswal
> Attachments: HIVE-16132.1.patch, HIVE-16132.2.patch, 
> HIVE-16132.3.patch, HIVE-16132.4.patch
>
>
> For the following operator tree snippet, the second Select is the start of a 
> semijoin optimization branch. Take a look at the Data size - it is the same 
> as the data size for its parent Select, even though the second select has 
> only a single bigint column in its projection (the parent has 2 columns). I 
> would expect the size to be 533328 (16 bytes * 3).
> Fixing this estimate may become important if we need to estimate the cost of 
> generating the min/max/bloomfilter.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Commented] (HIVE-16132) DataSize stats don't seem correct in semijoin opt branch

2017-03-10 Thread Hive QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-16132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15905496#comment-15905496
 ] 

Hive QA commented on HIVE-16132:




Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12857348/HIVE-16132.4.patch

{color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified.

{color:red}ERROR:{color} -1 due to 1 failed/errored test(s), 10336 tests 
executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[dynamic_semijoin_reduction]
 (batchId=148)
{noformat}

Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/4072/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/4072/console
Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-4072/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 1 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12857348 - PreCommit-HIVE-Build

> DataSize stats don't seem correct in semijoin opt branch
> 
>
> Key: HIVE-16132
> URL: https://issues.apache.org/jira/browse/HIVE-16132
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Reporter: Deepak Jaiswal
>Assignee: Deepak Jaiswal
> Attachments: HIVE-16132.1.patch, HIVE-16132.2.patch, 
> HIVE-16132.3.patch, HIVE-16132.4.patch
>
>
> For the following operator tree snippet, the second Select is the start of a 
> semijoin optimization branch. Take a look at the Data size - it is the same 
> as the data size for its parent Select, even though the second select has 
> only a single bigint column in its projection (the parent has 2 columns). I 
> would expect the size to be 533328 (16 bytes * 3).
> Fixing this estimate may become important if we need to estimate the cost of 
> generating the min/max/bloomfilter.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Commented] (HIVE-16132) DataSize stats don't seem correct in semijoin opt branch

2017-03-09 Thread Hive QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-16132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15904266#comment-15904266
 ] 

Hive QA commented on HIVE-16132:




Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12857117/HIVE-16132.2.patch

{color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified.

{color:red}ERROR:{color} -1 due to 5 failed/errored test(s), 10331 tests 
executed
*Failed tests:*
{noformat}
TestHs2Hooks - did not produce a TEST-*.xml file (likely timed out) 
(batchId=210)
org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[dynamic_semijoin_reduction]
 (batchId=148)
org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[mergejoin] 
(batchId=151)
org.apache.hadoop.hive.cli.TestSparkNegativeCliDriver.org.apache.hadoop.hive.cli.TestSparkNegativeCliDriver
 (batchId=232)
org.apache.hive.service.server.TestHS2HttpServer.testContextRootUrlRewrite 
(batchId=187)
{noformat}

Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/4058/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/4058/console
Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-4058/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 5 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12857117 - PreCommit-HIVE-Build

> DataSize stats don't seem correct in semijoin opt branch
> 
>
> Key: HIVE-16132
> URL: https://issues.apache.org/jira/browse/HIVE-16132
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Reporter: Deepak Jaiswal
>Assignee: Deepak Jaiswal
> Attachments: HIVE-16132.1.patch, HIVE-16132.2.patch
>
>
> For the following operator tree snippet, the second Select is the start of a 
> semijoin optimization branch. Take a look at the Data size - it is the same 
> as the data size for its parent Select, even though the second select has 
> only a single bigint column in its projection (the parent has 2 columns). I 
> would expect the size to be 533328 (16 bytes * 3).
> Fixing this estimate may become important if we need to estimate the cost of 
> generating the min/max/bloomfilter.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Commented] (HIVE-16132) DataSize stats don't seem correct in semijoin opt branch

2017-03-07 Thread Deepak Jaiswal (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-16132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15899915#comment-15899915
 ] 

Deepak Jaiswal commented on HIVE-16132:
---

https://reviews.apache.org/r/57391/

> DataSize stats don't seem correct in semijoin opt branch
> 
>
> Key: HIVE-16132
> URL: https://issues.apache.org/jira/browse/HIVE-16132
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Reporter: Deepak Jaiswal
>Assignee: Deepak Jaiswal
> Attachments: HIVE-16132.1.patch
>
>
> For the following operator tree snippet, the second Select is the start of a 
> semijoin optimization branch. Take a look at the Data size - it is the same 
> as the data size for its parent Select, even though the second select has 
> only a single bigint column in its projection (the parent has 2 columns). I 
> would expect the size to be 533328 (16 bytes * 3).
> Fixing this estimate may become important if we need to estimate the cost of 
> generating the min/max/bloomfilter.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Commented] (HIVE-16132) DataSize stats don't seem correct in semijoin opt branch

2017-03-07 Thread Hive QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-16132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15898994#comment-15898994
 ] 

Hive QA commented on HIVE-16132:




Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12856399/HIVE-16132.1.patch

{color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified.

{color:red}ERROR:{color} -1 due to 6 failed/errored test(s), 10328 tests 
executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestBeeLineDriver.testCliDriver[escape_comments] 
(batchId=229)
org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[dynamic_semijoin_reduction_3]
 (batchId=152)
org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[mergejoin] 
(batchId=150)
org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[schema_evol_text_vec_table]
 (batchId=147)
org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[vector_between_in] 
(batchId=119)
org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[vector_count_distinct]
 (batchId=106)
{noformat}

Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/3985/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/3985/console
Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-3985/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 6 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12856399 - PreCommit-HIVE-Build

> DataSize stats don't seem correct in semijoin opt branch
> 
>
> Key: HIVE-16132
> URL: https://issues.apache.org/jira/browse/HIVE-16132
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Reporter: Deepak Jaiswal
>Assignee: Deepak Jaiswal
> Attachments: HIVE-16132.1.patch
>
>
> For the following operator tree snippet, the second Select is the start of a 
> semijoin optimization branch. Take a look at the Data size - it is the same 
> as the data size for its parent Select, even though the second select has 
> only a single bigint column in its projection (the parent has 2 columns). I 
> would expect the size to be 533328 (16 bytes * 3).
> Fixing this estimate may become important if we need to estimate the cost of 
> generating the min/max/bloomfilter.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)