[jira] [Commented] (HIVE-2457) Files in Avro-backed Hive tables do not have a .avro extension

2011-09-23 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13113579#comment-13113579
 ] 

jirapos...@reviews.apache.org commented on HIVE-2457:
-



bq.  On 2011-09-21 00:26:28, Carl Steinbach wrote:
bq.   trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java, line 892
bq.   https://reviews.apache.org/r/1989/diff/1/?file=2#file2line892
bq.  
bq.   Please add this configuration property to HiveConf and 
hive-default.xml
bq.  
bq.  Tom White wrote:
bq.  Does Hive have the concept of private configuration properties? This 
is one that would set by SerDe's, not by users, which is why I didn't add it to 
HiveConf/hive-default.xml.
bq.  
bq.  Carl Steinbach wrote:
bq.  No, it doesn't, but it should. I'll file a JIRA. In the meantime this 
property should still be included in hive-default and HiveConf.

OK, I added the property to hive-default and HiveConf in the latest patch.


- Tom


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1989/#review1982
---


On 2011-09-20 22:28:53, Carl Steinbach wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1989/
bq.  ---
bq.  
bq.  (Updated 2011-09-20 22:28:53)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Review for HIVE-2457
bq.  
bq.  
bq.  This addresses bug HIVE-2457.
bq.  https://issues.apache.org/jira/browse/HIVE-2457
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FileSinkOperator.java 
1173340 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java 1173340 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/io/HiveFileFormatUtils.java 
1173340 
bq.trunk/ql/src/test/org/apache/hadoop/hive/ql/exec/TestUtilities.java 
PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1989/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Carl
bq.  
bq.



 Files in Avro-backed Hive tables do not have a .avro extension
 

 Key: HIVE-2457
 URL: https://issues.apache.org/jira/browse/HIVE-2457
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor, Serializers/Deserializers
Reporter: Tom White
Assignee: Tom White
 Attachments: HIVE-2457.patch, HIVE-2457.patch


 When using the Avro SerDe (see HIVE-895, https://github.com/jghoman/haivvreo) 
 the files created for an Avro table do not have a .avro extension, which 
 causes problems for tools like Avro MapReduce or Sqoop which expect the 
 extension. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2450) move lock retry logic into ZooKeeperHiveLockManager

2011-09-23 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13113769#comment-13113769
 ] 

jirapos...@reviews.apache.org commented on HIVE-2450:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1950/
---

(Updated 2011-09-23 21:10:26.466632)


Review request for hive and Ning Zhang.


Changes
---

update comment in hive-default.xml


Summary
---

move lock retry logic into ZooKeeperHiveLockManager


This addresses bug HIVE-2450.
https://issues.apache.org/jira/browse/HIVE-2450


Diffs (updated)
-

  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1171255 
  trunk/conf/hive-default.xml 1171255 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1171255 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/lockmgr/HiveLockObject.java 
1171255 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/lockmgr/zookeeper/ZooKeeperHiveLockManager.java
 1171255 

Diff: https://reviews.apache.org/r/1950/diff


Testing
---

will run tests locally


Thanks,

Yongqiang



 move lock retry logic into ZooKeeperHiveLockManager
 ---

 Key: HIVE-2450
 URL: https://issues.apache.org/jira/browse/HIVE-2450
 Project: Hive
  Issue Type: Improvement
Reporter: He Yongqiang
Assignee: He Yongqiang
 Attachments: HIVE-2450.1.patch, HIVE-2450.2.patch




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2450) move lock retry logic into ZooKeeperHiveLockManager

2011-09-23 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13113857#comment-13113857
 ] 

jirapos...@reviews.apache.org commented on HIVE-2450:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1950/#review2061
---


Could you explain why we want the retry logic down inside of the 
zookeeper-specific implementation?  It seems to me that having it outside is 
better, since then it doesn't have to be reimplemented in other lock manager 
implementations as they are added.


trunk/ql/src/java/org/apache/hadoop/hive/ql/lockmgr/zookeeper/ZooKeeperHiveLockManager.java
https://reviews.apache.org/r/1950/#comment4643

quorumServers is not used by this method...why has it been added here?



trunk/ql/src/java/org/apache/hadoop/hive/ql/lockmgr/zookeeper/ZooKeeperHiveLockManager.java
https://reviews.apache.org/r/1950/#comment4642

But don't we still want to rethrow eventually out of this method?  Here you 
are squelching the exception completely.


- John


On 2011-09-23 21:10:26, Yongqiang He wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1950/
bq.  ---
bq.  
bq.  (Updated 2011-09-23 21:10:26)
bq.  
bq.  
bq.  Review request for hive and Ning Zhang.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  move lock retry logic into ZooKeeperHiveLockManager
bq.  
bq.  
bq.  This addresses bug HIVE-2450.
bq.  https://issues.apache.org/jira/browse/HIVE-2450
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1171255 
bq.trunk/conf/hive-default.xml 1171255 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1171255 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/lockmgr/HiveLockObject.java 
1171255 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/lockmgr/zookeeper/ZooKeeperHiveLockManager.java
 1171255 
bq.  
bq.  Diff: https://reviews.apache.org/r/1950/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  will run tests locally
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Yongqiang
bq.  
bq.



 move lock retry logic into ZooKeeperHiveLockManager
 ---

 Key: HIVE-2450
 URL: https://issues.apache.org/jira/browse/HIVE-2450
 Project: Hive
  Issue Type: Improvement
Reporter: He Yongqiang
Assignee: He Yongqiang
 Attachments: HIVE-2450.1.patch, HIVE-2450.2.patch




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2181) Clean up the scratch.dir (tmp/hive-root) while restarting Hive server.

2011-09-22 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13112796#comment-13112796
 ] 

jirapos...@reviews.apache.org commented on HIVE-2181:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1481/
---

(Updated 2011-09-22 18:26:19.812602)


Review request for hive.


Changes
---

In testcases server startup is removed and added unit tests for 
cleanUpScratchDir().


Summary
---

Now queries leaves the map outputs under scratch.dir after execution. If the 
hive server is stopped we need not keep the stopped server's map oputputs. So 
whle starting the server we can clear the scratch.dir. This can help in 
improved disk usage.

Implemented a cleanup method in HiveServer and it will trigger based on the 
property value hive.start.cleanup.scrachdir


This addresses bug HIVE-2181.
https://issues.apache.org/jira/browse/HIVE-2181


Diffs (updated)
-

  trunk/common/src/java/org/apache/hadoop/hive/common/ServerUtils.java 
PRE-CREATION 
  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1174277 
  trunk/conf/hive-default.xml 1174277 
  trunk/service/src/java/org/apache/hadoop/hive/service/HiveServer.java 1174277 
  trunk/service/src/test/org/apache/hadoop/hive/service/TestHiveServer.java 
1174277 

Diff: https://reviews.apache.org/r/1481/diff


Testing
---

Added test case for this scenario.


Thanks,

chinna



  Clean up the scratch.dir (tmp/hive-root) while restarting Hive server. 
 

 Key: HIVE-2181
 URL: https://issues.apache.org/jira/browse/HIVE-2181
 Project: Hive
  Issue Type: Bug
  Components: Server Infrastructure
Affects Versions: 0.8.0
 Environment: Suse linux, Hadoop 20.1, Hive 0.8
Reporter: sanoj mathew
Assignee: Chinna Rao Lalam
Priority: Minor
 Attachments: HIVE-2181.1.patch, HIVE-2181.2.patch, HIVE-2181.3.patch, 
 HIVE-2181.4.patch, HIVE-2181.5.patch, HIVE-2181.6.patch, HIVE-2181.patch

   Original Estimate: 48h
  Remaining Estimate: 48h

 Now queries leaves the map outputs under scratch.dir after execution. If the 
 hive server is stopped we need not keep the stopped server's map oputputs. So 
 whle starting the server we can clear the scratch.dir. This can help in 
 improved disk usage.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2457) Files in Avro-backed Hive tables do not have a .avro extension

2011-09-22 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13112826#comment-13112826
 ] 

jirapos...@reviews.apache.org commented on HIVE-2457:
-



bq.  On 2011-09-21 00:26:28, Carl Steinbach wrote:
bq.   trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java, line 892
bq.   https://reviews.apache.org/r/1989/diff/1/?file=2#file2line892
bq.  
bq.   Please add this configuration property to HiveConf and 
hive-default.xml
bq.  
bq.  Tom White wrote:
bq.  Does Hive have the concept of private configuration properties? This 
is one that would set by SerDe's, not by users, which is why I didn't add it to 
HiveConf/hive-default.xml.

No, it doesn't, but it should. I'll file a JIRA. In the meantime this property 
should still be included in hive-default and HiveConf.


- Carl


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1989/#review1982
---


On 2011-09-20 22:28:53, Carl Steinbach wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1989/
bq.  ---
bq.  
bq.  (Updated 2011-09-20 22:28:53)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Review for HIVE-2457
bq.  
bq.  
bq.  This addresses bug HIVE-2457.
bq.  https://issues.apache.org/jira/browse/HIVE-2457
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FileSinkOperator.java 
1173340 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java 1173340 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/io/HiveFileFormatUtils.java 
1173340 
bq.trunk/ql/src/test/org/apache/hadoop/hive/ql/exec/TestUtilities.java 
PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1989/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Carl
bq.  
bq.



 Files in Avro-backed Hive tables do not have a .avro extension
 

 Key: HIVE-2457
 URL: https://issues.apache.org/jira/browse/HIVE-2457
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor, Serializers/Deserializers
Reporter: Tom White
Assignee: Tom White
 Attachments: HIVE-2457.patch, HIVE-2457.patch


 When using the Avro SerDe (see HIVE-895, https://github.com/jghoman/haivvreo) 
 the files created for an Avro table do not have a .avro extension, which 
 causes problems for tools like Avro MapReduce or Sqoop which expect the 
 extension. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2461) Add method to PerfLogger to perform cleanup/final steps.

2011-09-22 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13112850#comment-13112850
 ] 

jirapos...@reviews.apache.org commented on HIVE-2461:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/2023/
---

Review request for hive, Yongqiang He and Ning Zhang.


Summary
---

I added a method called finishPerfLogger, which is called at the end of the run 
method in Driver to allow PerfLogger subclasses to perform any cleanup tasks or 
logging that requires all the perf stats to have been calculated.  It takes as 
parameters a Log, to be consistent with the other methods in PerfLogger, as 
well as the QueryPlan from Driver in order to allow information such as the 
query ID to be logged.


This addresses bug HIVE-2461.
https://issues.apache.org/jira/browse/HIVE-2461


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1173881 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/log/PerfLogger.java 1173881 

Diff: https://reviews.apache.org/r/2023/diff


Testing
---

I created a subclass of PerfLogger which implemented the finishPerfLogger 
method, and verified it was successfully called and executed.

I also ran some of the unit tests to verify nothing was broken.


Thanks,

Kevin



 Add method to PerfLogger to perform cleanup/final steps.
 

 Key: HIVE-2461
 URL: https://issues.apache.org/jira/browse/HIVE-2461
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2461.1.patch.txt


 I think a method added to PerfLogger to perform cleanup/final steps would be 
 very useful.  For example, it could be used to close any database connections 
 created as part of a PerfLogger subclass, or to perform logging that requires 
 all perf values to first be calculated.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2462) make INNER a non-reserved keyword

2011-09-22 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13112860#comment-13112860
 ] 

jirapos...@reviews.apache.org commented on HIVE-2462:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/2024/
---

Review request for hive.


Summary
---

HIVE-2462


This addresses bug HIVE-2462.
https://issues.apache.org/jira/browse/HIVE-2462


Diffs
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
 1174316 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/innerjoin.q
 1174316 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/innerjoin.q.out
 1174316 

Diff: https://reviews.apache.org/r/2024/diff


Testing
---


Thanks,

John



 make INNER a non-reserved keyword
 -

 Key: HIVE-2462
 URL: https://issues.apache.org/jira/browse/HIVE-2462
 Project: Hive
  Issue Type: Improvement
Reporter: John Sichi
Assignee: John Sichi
 Fix For: 0.9.0

 Attachments: HIVE-2462.1.patch


 HIVE-2191 introduced the INNER keyword as reserved, which breaks backwards 
 compatibility for queries which were using it as an identifier.  This patch 
 addresses that.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2442) Metastore upgrade script and schema DDL for Hive 0.8.0

2011-09-22 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13112981#comment-13112981
 ] 

jirapos...@reviews.apache.org commented on HIVE-2442:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/2026/
---

Review request for hive.


Summary
---

Metastore upgrade scripts and schema DDL for Hive 0.8.0 and 0.9.0-dev.
The 0.8.0 version of this patch does not include the 0.9.0 schema DDL
and 0.8.0-to-0.9.0 upgrade script.


This addresses bug HIVE-2442.
https://issues.apache.org/jira/browse/HIVE-2442


Diffs
-

  metastore/scripts/upgrade/derby/hive-schema-0.8.0.derby.sql PRE-CREATION 
  metastore/scripts/upgrade/derby/hive-schema-0.9.0.derby.sql PRE-CREATION 
  metastore/scripts/upgrade/derby/upgrade-0.7.0-to-0.8.0.derby.sql PRE-CREATION 
  metastore/scripts/upgrade/derby/upgrade-0.8.0-to-0.9.0.derby.sql PRE-CREATION 
  metastore/scripts/upgrade/mysql/008-HIVE-2246.mysql.sql 2e3821d 
  metastore/scripts/upgrade/mysql/hive-schema-0.8.0.mysql.sql PRE-CREATION 
  metastore/scripts/upgrade/mysql/hive-schema-0.9.0.mysql.sql PRE-CREATION 
  metastore/scripts/upgrade/mysql/upgrade-0.7.0-to-0.8.0.mysql.sql PRE-CREATION 
  metastore/scripts/upgrade/mysql/upgrade-0.8.0-to-0.9.0.mysql.sql PRE-CREATION 

Diff: https://reviews.apache.org/r/2026/diff


Testing
---


Thanks,

Carl



 Metastore upgrade script and schema DDL for Hive 0.8.0
 --

 Key: HIVE-2442
 URL: https://issues.apache.org/jira/browse/HIVE-2442
 Project: Hive
  Issue Type: Task
  Components: Metastore
Reporter: Carl Steinbach
Assignee: Carl Steinbach
Priority: Blocker
 Fix For: 0.8.0

 Attachments: HIVE-2442-trunk.1.patch.txt




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2461) Add method to PerfLogger to perform cleanup/final steps.

2011-09-22 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13113042#comment-13113042
 ] 

jirapos...@reviews.apache.org commented on HIVE-2461:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/2023/
---

(Updated 2011-09-22 23:54:37.554390)


Review request for hive, Yongqiang He and Ning Zhang.


Summary (updated)
---

I added a method called finishPerfLogger, which is called at the end of the run 
method in Driver to allow PerfLogger subclasses to perform any cleanup tasks or 
logging that requires all the perf stats to have been calculated.  It takes as 
parameters a Log, to be consistent with the other methods in PerfLogger, as 
well as the QueryPlan from Driver in order to allow information such as the 
query ID to be logged.

I should note that finishPerfLogger is not guaranteed to run.  This was my 
intention.


This addresses bug HIVE-2461.
https://issues.apache.org/jira/browse/HIVE-2461


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1173881 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/log/PerfLogger.java 1173881 

Diff: https://reviews.apache.org/r/2023/diff


Testing
---

I created a subclass of PerfLogger which implemented the finishPerfLogger 
method, and verified it was successfully called and executed.

I also ran some of the unit tests to verify nothing was broken.


Thanks,

Kevin



 Add method to PerfLogger to perform cleanup/final steps.
 

 Key: HIVE-2461
 URL: https://issues.apache.org/jira/browse/HIVE-2461
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2461.1.patch.txt


 I think a method added to PerfLogger to perform cleanup/final steps would be 
 very useful.  For example, it could be used to close any database connections 
 created as part of a PerfLogger subclass, or to perform logging that requires 
 all perf values to first be calculated.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2461) Add method to PerfLogger to perform cleanup/final steps.

2011-09-22 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13113047#comment-13113047
 ] 

jirapos...@reviews.apache.org commented on HIVE-2461:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/2023/#review2030
---



trunk/ql/src/java/org/apache/hadoop/hive/ql/log/PerfLogger.java
https://reviews.apache.org/r/2023/#comment4579

one nitpick is that do you think call this function a 'close' will be more 
clear? we already have a PerfLogEnd().


- Yongqiang


On 2011-09-22 23:54:37, Kevin Wilfong wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/2023/
bq.  ---
bq.  
bq.  (Updated 2011-09-22 23:54:37)
bq.  
bq.  
bq.  Review request for hive, Yongqiang He and Ning Zhang.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  I added a method called finishPerfLogger, which is called at the end of 
the run method in Driver to allow PerfLogger subclasses to perform any cleanup 
tasks or logging that requires all the perf stats to have been calculated.  It 
takes as parameters a Log, to be consistent with the other methods in 
PerfLogger, as well as the QueryPlan from Driver in order to allow information 
such as the query ID to be logged.
bq.  
bq.  I should note that finishPerfLogger is not guaranteed to run.  This was my 
intention.
bq.  
bq.  
bq.  This addresses bug HIVE-2461.
bq.  https://issues.apache.org/jira/browse/HIVE-2461
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1173881 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/log/PerfLogger.java 1173881 
bq.  
bq.  Diff: https://reviews.apache.org/r/2023/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  I created a subclass of PerfLogger which implemented the finishPerfLogger 
method, and verified it was successfully called and executed.
bq.  
bq.  I also ran some of the unit tests to verify nothing was broken.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Kevin
bq.  
bq.



 Add method to PerfLogger to perform cleanup/final steps.
 

 Key: HIVE-2461
 URL: https://issues.apache.org/jira/browse/HIVE-2461
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2461.1.patch.txt


 I think a method added to PerfLogger to perform cleanup/final steps would be 
 very useful.  For example, it could be used to close any database connections 
 created as part of a PerfLogger subclass, or to perform logging that requires 
 all perf values to first be calculated.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2461) Add method to PerfLogger to perform cleanup/final steps.

2011-09-22 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13113048#comment-13113048
 ] 

jirapos...@reviews.apache.org commented on HIVE-2461:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/2023/
---

(Updated 2011-09-23 00:08:56.319090)


Review request for hive, Yongqiang He and Ning Zhang.


Changes
---

Thanks Yongqiang.  I didn't like the name either, but the only other one I 
could come up with was finalize which I didn't want to use for obvious reasons.


Summary
---

I added a method called finishPerfLogger, which is called at the end of the run 
method in Driver to allow PerfLogger subclasses to perform any cleanup tasks or 
logging that requires all the perf stats to have been calculated.  It takes as 
parameters a Log, to be consistent with the other methods in PerfLogger, as 
well as the QueryPlan from Driver in order to allow information such as the 
query ID to be logged.

I should note that finishPerfLogger is not guaranteed to run.  This was my 
intention.


This addresses bug HIVE-2461.
https://issues.apache.org/jira/browse/HIVE-2461


Diffs (updated)
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1173881 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/log/PerfLogger.java 1173881 

Diff: https://reviews.apache.org/r/2023/diff


Testing
---

I created a subclass of PerfLogger which implemented the finishPerfLogger 
method, and verified it was successfully called and executed.

I also ran some of the unit tests to verify nothing was broken.


Thanks,

Kevin



 Add method to PerfLogger to perform cleanup/final steps.
 

 Key: HIVE-2461
 URL: https://issues.apache.org/jira/browse/HIVE-2461
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2461.1.patch.txt, HIVE-2461.2.patch.txt


 I think a method added to PerfLogger to perform cleanup/final steps would be 
 very useful.  For example, it could be used to close any database connections 
 created as part of a PerfLogger subclass, or to perform logging that requires 
 all perf values to first be calculated.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2181) Clean up the scratch.dir (tmp/hive-root) while restarting Hive server.

2011-09-21 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13109494#comment-13109494
 ] 

jirapos...@reviews.apache.org commented on HIVE-2181:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1481/
---

(Updated 2011-09-21 13:30:22.283181)


Review request for hive.


Changes
---

spelling mistake corrected


Summary
---

Now queries leaves the map outputs under scratch.dir after execution. If the 
hive server is stopped we need not keep the stopped server's map oputputs. So 
whle starting the server we can clear the scratch.dir. This can help in 
improved disk usage.

Implemented a cleanup method in HiveServer and it will trigger based on the 
property value hive.start.cleanup.scrachdir


This addresses bug HIVE-2181.
https://issues.apache.org/jira/browse/HIVE-2181


Diffs (updated)
-

  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1173539 
  trunk/conf/hive-default.xml 1173539 
  trunk/service/src/java/org/apache/hadoop/hive/service/HiveServer.java 1173539 
  trunk/service/src/test/org/apache/hadoop/hive/service/TestHiveServer.java 
1173539 

Diff: https://reviews.apache.org/r/1481/diff


Testing
---

Added test case for this scenario.


Thanks,

chinna



  Clean up the scratch.dir (tmp/hive-root) while restarting Hive server. 
 

 Key: HIVE-2181
 URL: https://issues.apache.org/jira/browse/HIVE-2181
 Project: Hive
  Issue Type: Bug
  Components: Server Infrastructure
Affects Versions: 0.8.0
 Environment: Suse linux, Hadoop 20.1, Hive 0.8
Reporter: sanoj mathew
Assignee: Chinna Rao Lalam
Priority: Minor
 Attachments: HIVE-2181.1.patch, HIVE-2181.2.patch, HIVE-2181.3.patch, 
 HIVE-2181.4.patch, HIVE-2181.5.patch, HIVE-2181.patch

   Original Estimate: 48h
  Remaining Estimate: 48h

 Now queries leaves the map outputs under scratch.dir after execution. If the 
 hive server is stopped we need not keep the stopped server's map oputputs. So 
 whle starting the server we can clear the scratch.dir. This can help in 
 improved disk usage.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2178) Log related Check style Comments fixes

2011-09-21 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13109498#comment-13109498
 ] 

jirapos...@reviews.apache.org commented on HIVE-2178:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1978/
---

(Updated 2011-09-21 13:36:23.594198)


Review request for hive, Carl Steinbach and John Sichi.


Changes
---

Thrift generated code changes are excluded from the patch. Thrift generated 
code changes i will follow up.


Summary
---

Fixed Log related Check style Comments like including root exception and adding 
messages.


This addresses bug HIVE-2178.
https://issues.apache.org/jira/browse/HIVE-2178


Diffs (updated)
-

  trunk/jdbc/src/java/org/apache/hadoop/hive/jdbc/HiveConnection.java 1173507 
  trunk/jdbc/src/java/org/apache/hadoop/hive/jdbc/HiveDataSource.java 1173507 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1173507 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Partition.java 1173507 
  trunk/ql/src/test/results/clientnegative/describe_xpath1.q.out 1173507 
  trunk/ql/src/test/results/clientnegative/describe_xpath2.q.out 1173507 
  trunk/ql/src/test/results/clientnegative/describe_xpath3.q.out 1173507 
  trunk/ql/src/test/results/clientnegative/describe_xpath4.q.out 1173507 
  trunk/ql/src/test/results/clientnegative/lockneg4.q.out 1173507 
  trunk/ql/src/test/results/clientnegative/lockneg5.q.out 1173507 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/dynamic_type/SimpleCharStream.java
 1173507 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/thrift/TBinarySortableProtocol.java
 1173507 

Diff: https://reviews.apache.org/r/1978/diff


Testing
---

Ran all unit tests


Thanks,

chinna



 Log related Check style Comments fixes
 --

 Key: HIVE-2178
 URL: https://issues.apache.org/jira/browse/HIVE-2178
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.5.0, 0.8.0
 Environment: Hadoop 0.20.1, Hive0.8.0 and SUSE Linux Enterprise 
 Server 10 SP2 (i586) - Kernel 2.6.16.60-0.21-smp (5)
Reporter: Chinna Rao Lalam
Assignee: Chinna Rao Lalam
 Attachments: HIVE-2178.1.patch, HIVE-2178.2.patch, HIVE-2178.3.patch, 
 HIVE-2178.patch


 Fix Log related Check style Comments

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2178) Log related Check style Comments fixes

2011-09-21 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13109501#comment-13109501
 ] 

jirapos...@reviews.apache.org commented on HIVE-2178:
-



bq.  On 2011-09-20 18:12:16, Ashutosh Chauhan wrote:
bq.   
trunk/metastore/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/metastore/api/MetaException.java,
 lines 118-121
bq.   https://reviews.apache.org/r/1978/diff/1/?file=43792#file43792line118
bq.  
bq.   Same as above.

Thrift generated code changes are excluded from the patch. Thrift generated 
code changes i will follow up.


- chinna


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1978/#review1975
---


On 2011-09-21 13:36:23, chinna wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1978/
bq.  ---
bq.  
bq.  (Updated 2011-09-21 13:36:23)
bq.  
bq.  
bq.  Review request for hive, Carl Steinbach and John Sichi.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Fixed Log related Check style Comments like including root exception and 
adding messages.
bq.  
bq.  
bq.  This addresses bug HIVE-2178.
bq.  https://issues.apache.org/jira/browse/HIVE-2178
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/jdbc/src/java/org/apache/hadoop/hive/jdbc/HiveConnection.java 
1173507 
bq.trunk/jdbc/src/java/org/apache/hadoop/hive/jdbc/HiveDataSource.java 
1173507 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1173507 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Partition.java 
1173507 
bq.trunk/ql/src/test/results/clientnegative/describe_xpath1.q.out 1173507 
bq.trunk/ql/src/test/results/clientnegative/describe_xpath2.q.out 1173507 
bq.trunk/ql/src/test/results/clientnegative/describe_xpath3.q.out 1173507 
bq.trunk/ql/src/test/results/clientnegative/describe_xpath4.q.out 1173507 
bq.trunk/ql/src/test/results/clientnegative/lockneg4.q.out 1173507 
bq.trunk/ql/src/test/results/clientnegative/lockneg5.q.out 1173507 
bq.
trunk/serde/src/java/org/apache/hadoop/hive/serde2/dynamic_type/SimpleCharStream.java
 1173507 
bq.
trunk/serde/src/java/org/apache/hadoop/hive/serde2/thrift/TBinarySortableProtocol.java
 1173507 
bq.  
bq.  Diff: https://reviews.apache.org/r/1978/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Ran all unit tests
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  chinna
bq.  
bq.



 Log related Check style Comments fixes
 --

 Key: HIVE-2178
 URL: https://issues.apache.org/jira/browse/HIVE-2178
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.5.0, 0.8.0
 Environment: Hadoop 0.20.1, Hive0.8.0 and SUSE Linux Enterprise 
 Server 10 SP2 (i586) - Kernel 2.6.16.60-0.21-smp (5)
Reporter: Chinna Rao Lalam
Assignee: Chinna Rao Lalam
 Attachments: HIVE-2178.1.patch, HIVE-2178.2.patch, HIVE-2178.3.patch, 
 HIVE-2178.patch


 Fix Log related Check style Comments

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2206) add a new optimizer for query correlation discovery and optimization

2011-09-21 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13109753#comment-13109753
 ] 

jirapos...@reviews.apache.org commented on HIVE-2206:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/2001/
---

Review request for hive.


Summary
---

This optimizer exploits intra-query correlations and merges multiple correlated 
MapReduce jobs into one jobs.


This addresses bug HIVE-2206.
https://issues.apache.org/jira/browse/HIVE-2206


Diffs
-

  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1173271 
  
trunk/ql/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/ql/plan/api/OperatorType.java
 1173271 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/CorrelationCompositeOperator.java
 PRE-CREATION 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/CorrelationDispatchOperator.java
 PRE-CREATION 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/CorrelationFakeReduceSinkOperator.java
 PRE-CREATION 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/CorrelationManualForwardOperator.java
 PRE-CREATION 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/CorrelationReduceSinkOperator.java
 PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/ExecReducer.java 1173271 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FunctionRegistry.java 
1173271 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Operator.java 1173271 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/OperatorFactory.java 1173271 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/CorrelationGenMRRedSink1.java
 PRE-CREATION 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/CorrelationOptimizer.java 
PRE-CREATION 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/CorrelationOptimizerUtils.java
 PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/GenMapRedUtils.java 
1173271 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/Optimizer.java 1173271 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ParseContext.java 1173271 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
1173271 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CorrelationCompositeDesc.java 
PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CorrelationDispatchDesc.java 
PRE-CREATION 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CorrelationFakeReduceSinkDesc.java
 PRE-CREATION 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CorrelationManualForwardDesc.java
 PRE-CREATION 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CorrelationReduceSinkDesc.java 
PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/MapredWork.java 1173271 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCountDistinct.java
 PRE-CREATION 
  trunk/ql/src/test/org/apache/hadoop/hive/ql/exec/TestExecDriver.java 1173271 
  trunk/ql/src/test/results/clientpositive/show_functions.q.out 1173271 
  trunk/ql/src/test/results/compiler/plan/groupby1.q.xml 1173271 
  trunk/ql/src/test/results/compiler/plan/groupby2.q.xml 1173271 
  trunk/ql/src/test/results/compiler/plan/groupby3.q.xml 1173271 
  trunk/ql/src/test/results/compiler/plan/groupby5.q.xml 1173271 

Diff: https://reviews.apache.org/r/2001/diff


Testing
---

Ran all unit tests


Thanks,

Yin



 add a new optimizer for query correlation discovery and optimization
 

 Key: HIVE-2206
 URL: https://issues.apache.org/jira/browse/HIVE-2206
 Project: Hive
  Issue Type: New Feature
Reporter: He Yongqiang
Assignee: Yin Huai
 Attachments: HIVE-2206.1.patch.txt, HIVE-2206.2.patch.txt, 
 HIVE-2206.3.patch.txt, HIVE-2206.4.patch.txt, HIVE-2206.5.patch.txt, Queries, 
 YSmartPatchForHive.patch, testQueries.q


 reference:
 http://www.cse.ohio-state.edu/hpcs/WWW/HTML/publications/papers/TR-11-7.pdf

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2178) Log related Check style Comments fixes

2011-09-20 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108714#comment-13108714
 ] 

jirapos...@reviews.apache.org commented on HIVE-2178:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1978/
---

Review request for hive, Carl Steinbach and John Sichi.


Summary
---

Fixed Log related Check style Comments like including root exception and adding 
messages.


This addresses bug HIVE-2178.
https://issues.apache.org/jira/browse/HIVE-2178


Diffs
-

  trunk/jdbc/src/java/org/apache/hadoop/hive/jdbc/HiveConnection.java 1172979 
  trunk/jdbc/src/java/org/apache/hadoop/hive/jdbc/HiveDataSource.java 1172979 
  
trunk/metastore/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/metastore/api/InvalidObjectException.java
 1172979 
  
trunk/metastore/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/metastore/api/MetaException.java
 1172979 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
1172979 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java 
1172979 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1172979 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Partition.java 1172979 
  trunk/ql/src/test/results/clientnegative/describe_xpath1.q.out 1172979 
  trunk/ql/src/test/results/clientnegative/describe_xpath2.q.out 1172979 
  trunk/ql/src/test/results/clientnegative/describe_xpath3.q.out 1172979 
  trunk/ql/src/test/results/clientnegative/describe_xpath4.q.out 1172979 
  trunk/ql/src/test/results/clientnegative/lockneg4.q.out 1172979 
  trunk/ql/src/test/results/clientnegative/lockneg5.q.out 1172979 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/dynamic_type/SimpleCharStream.java
 1172979 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/thrift/TBinarySortableProtocol.java
 1172979 

Diff: https://reviews.apache.org/r/1978/diff


Testing
---

Ran all unit tests


Thanks,

chinna



 Log related Check style Comments fixes
 --

 Key: HIVE-2178
 URL: https://issues.apache.org/jira/browse/HIVE-2178
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.5.0, 0.8.0
 Environment: Hadoop 0.20.1, Hive0.8.0 and SUSE Linux Enterprise 
 Server 10 SP2 (i586) - Kernel 2.6.16.60-0.21-smp (5)
Reporter: Chinna Rao Lalam
Assignee: Chinna Rao Lalam
 Attachments: HIVE-2178.1.patch, HIVE-2178.2.patch, HIVE-2178.patch


 Fix Log related Check style Comments

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2178) Log related Check style Comments fixes

2011-09-20 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108879#comment-13108879
 ] 

jirapos...@reviews.apache.org commented on HIVE-2178:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1978/#review1975
---



trunk/metastore/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/metastore/api/InvalidObjectException.java
https://reviews.apache.org/r/1978/#comment4465

This is a generated code. You should not hand edit it. If you find a need 
for editing it, its probably a bug in Thrift , you can open a jira on that 
project.



trunk/metastore/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/metastore/api/MetaException.java
https://reviews.apache.org/r/1978/#comment4466

Same as above.


- Ashutosh


On 2011-09-20 13:48:02, chinna wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1978/
bq.  ---
bq.  
bq.  (Updated 2011-09-20 13:48:02)
bq.  
bq.  
bq.  Review request for hive, Carl Steinbach and John Sichi.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Fixed Log related Check style Comments like including root exception and 
adding messages.
bq.  
bq.  
bq.  This addresses bug HIVE-2178.
bq.  https://issues.apache.org/jira/browse/HIVE-2178
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/jdbc/src/java/org/apache/hadoop/hive/jdbc/HiveConnection.java 
1172979 
bq.trunk/jdbc/src/java/org/apache/hadoop/hive/jdbc/HiveDataSource.java 
1172979 
bq.
trunk/metastore/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/metastore/api/InvalidObjectException.java
 1172979 
bq.
trunk/metastore/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/metastore/api/MetaException.java
 1172979 
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
1172979 
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java 
1172979 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1172979 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Partition.java 
1172979 
bq.trunk/ql/src/test/results/clientnegative/describe_xpath1.q.out 1172979 
bq.trunk/ql/src/test/results/clientnegative/describe_xpath2.q.out 1172979 
bq.trunk/ql/src/test/results/clientnegative/describe_xpath3.q.out 1172979 
bq.trunk/ql/src/test/results/clientnegative/describe_xpath4.q.out 1172979 
bq.trunk/ql/src/test/results/clientnegative/lockneg4.q.out 1172979 
bq.trunk/ql/src/test/results/clientnegative/lockneg5.q.out 1172979 
bq.
trunk/serde/src/java/org/apache/hadoop/hive/serde2/dynamic_type/SimpleCharStream.java
 1172979 
bq.
trunk/serde/src/java/org/apache/hadoop/hive/serde2/thrift/TBinarySortableProtocol.java
 1172979 
bq.  
bq.  Diff: https://reviews.apache.org/r/1978/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Ran all unit tests
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  chinna
bq.  
bq.



 Log related Check style Comments fixes
 --

 Key: HIVE-2178
 URL: https://issues.apache.org/jira/browse/HIVE-2178
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.5.0, 0.8.0
 Environment: Hadoop 0.20.1, Hive0.8.0 and SUSE Linux Enterprise 
 Server 10 SP2 (i586) - Kernel 2.6.16.60-0.21-smp (5)
Reporter: Chinna Rao Lalam
Assignee: Chinna Rao Lalam
 Attachments: HIVE-2178.1.patch, HIVE-2178.2.patch, HIVE-2178.patch


 Fix Log related Check style Comments

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2181) Clean up the scratch.dir (tmp/hive-root) while restarting Hive server.

2011-09-20 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108956#comment-13108956
 ] 

jirapos...@reviews.apache.org commented on HIVE-2181:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1481/#review1977
---



trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
https://reviews.apache.org/r/1481/#comment4467

SCRATCH needs a T in it (everywhere).


- John


On 2011-09-16 17:30:40, chinna wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1481/
bq.  ---
bq.  
bq.  (Updated 2011-09-16 17:30:40)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Now queries leaves the map outputs under scratch.dir after execution. If 
the hive server is stopped we need not keep the stopped server's map oputputs. 
So whle starting the server we can clear the scratch.dir. This can help in 
improved disk usage.
bq.  
bq.  Implemented a cleanup method in HiveServer and it will trigger based on 
the property value hive.start.cleanup.scrachdir
bq.  
bq.  
bq.  This addresses bug HIVE-2181.
bq.  https://issues.apache.org/jira/browse/HIVE-2181
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/conf/hive-default.xml 1171488 
bq.trunk/service/src/java/org/apache/hadoop/hive/service/HiveServer.java 
1171488 
bq.
trunk/service/src/test/org/apache/hadoop/hive/service/TestHiveServer.java 
1171488 
bq.trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1171488 
bq.  
bq.  Diff: https://reviews.apache.org/r/1481/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Added test case for this scenario.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  chinna
bq.  
bq.



  Clean up the scratch.dir (tmp/hive-root) while restarting Hive server. 
 

 Key: HIVE-2181
 URL: https://issues.apache.org/jira/browse/HIVE-2181
 Project: Hive
  Issue Type: Bug
  Components: Server Infrastructure
Affects Versions: 0.8.0
 Environment: Suse linux, Hadoop 20.1, Hive 0.8
Reporter: sanoj mathew
Assignee: Chinna Rao Lalam
Priority: Minor
 Attachments: HIVE-2181.1.patch, HIVE-2181.2.patch, HIVE-2181.3.patch, 
 HIVE-2181.4.patch, HIVE-2181.patch

   Original Estimate: 48h
  Remaining Estimate: 48h

 Now queries leaves the map outputs under scratch.dir after execution. If the 
 hive server is stopped we need not keep the stopped server's map oputputs. So 
 whle starting the server we can clear the scratch.dir. This can help in 
 improved disk usage.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2457) Files in Avro-backed Hive tables do not have a .avro extension

2011-09-20 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13109036#comment-13109036
 ] 

jirapos...@reviews.apache.org commented on HIVE-2457:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1989/
---

Review request for hive.


Summary
---

Review for HIVE-2457


This addresses bug HIVE-2457.
https://issues.apache.org/jira/browse/HIVE-2457


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FileSinkOperator.java 
1173340 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java 1173340 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/io/HiveFileFormatUtils.java 
1173340 
  trunk/ql/src/test/org/apache/hadoop/hive/ql/exec/TestUtilities.java 
PRE-CREATION 

Diff: https://reviews.apache.org/r/1989/diff


Testing
---


Thanks,

Carl



 Files in Avro-backed Hive tables do not have a .avro extension
 

 Key: HIVE-2457
 URL: https://issues.apache.org/jira/browse/HIVE-2457
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor, Serializers/Deserializers
Reporter: Tom White
 Attachments: HIVE-2457.patch


 When using the Avro SerDe (see HIVE-895, https://github.com/jghoman/haivvreo) 
 the files created for an Avro table do not have a .avro extension, which 
 causes problems for tools like Avro MapReduce or Sqoop which expect the 
 extension. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2457) Files in Avro-backed Hive tables do not have a .avro extension

2011-09-20 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13109130#comment-13109130
 ] 

jirapos...@reviews.apache.org commented on HIVE-2457:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1989/#review1982
---



trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java
https://reviews.apache.org/r/1989/#comment4468

Please add this configuration property to HiveConf and hive-default.xml



trunk/ql/src/test/org/apache/hadoop/hive/ql/exec/TestUtilities.java
https://reviews.apache.org/r/1989/#comment4479

In addition to the unit test it would also be nice to test this via 
TestCliDriver.

It should be possible to verify this from the CLI by doing something like 
this:


-- Set the filename suffix property. Then create a new table and stream
-- data into it. Then use the dfs cat command to dump the contents of
-- the raw files in the warehouse to stdout
hive dfs -cat ${hiveconf:hive.metastore.warehouse.dir}/tablename/*.avro;


- Carl


On 2011-09-20 22:28:53, Carl Steinbach wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1989/
bq.  ---
bq.  
bq.  (Updated 2011-09-20 22:28:53)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Review for HIVE-2457
bq.  
bq.  
bq.  This addresses bug HIVE-2457.
bq.  https://issues.apache.org/jira/browse/HIVE-2457
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FileSinkOperator.java 
1173340 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java 1173340 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/io/HiveFileFormatUtils.java 
1173340 
bq.trunk/ql/src/test/org/apache/hadoop/hive/ql/exec/TestUtilities.java 
PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1989/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Carl
bq.  
bq.



 Files in Avro-backed Hive tables do not have a .avro extension
 

 Key: HIVE-2457
 URL: https://issues.apache.org/jira/browse/HIVE-2457
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor, Serializers/Deserializers
Reporter: Tom White
Assignee: Tom White
 Attachments: HIVE-2457.patch


 When using the Avro SerDe (see HIVE-895, https://github.com/jghoman/haivvreo) 
 the files created for an Avro table do not have a .avro extension, which 
 causes problems for tools like Avro MapReduce or Sqoop which expect the 
 extension. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2453) Need a way to categorize queries in hooks for improved logging

2011-09-19 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13107989#comment-13107989
 ] 

jirapos...@reviews.apache.org commented on HIVE-2453:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1933/
---

(Updated 2011-09-19 17:09:57.838587)


Review request for hive and Ning Zhang.


Changes
---

QueryProperties now captures distribute by as Ning requested, and cluster 
by as it seemed like a logical addition.

I added test cases for these as well.


Summary
---

The information that would be useful for categorizing queries is clearest in 
the Semantic Analyzer, when the data from the Parser is interpreted.  I added a 
new class which is designed to collect that data here, and place it ultimately 
in the QueryPlan where it will be available to hooks.

The information I collect is whether or not the query has the following clauses:
  Join
  Group By
  Order By
  Sort By
  Group By after a Join clause

Also, I store whether or not a script is used for mapping or reducing.


This addresses bug HIVE-2453.
https://issues.apache.org/jira/browse/HIVE-2453


Diffs (updated)
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/QueryPlan.java 1170719 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/QueryProperties.java PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java 
1170719 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
1170719 
  
trunk/ql/src/test/org/apache/hadoop/hive/ql/hooks/CheckQueryPropertiesHook.java 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/query_properties.q PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/query_properties.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/1933/diff


Testing
---

I added a new test, which runs a variety of queries, such that each of the 
flags in QueryProperties is set by at least one query, and also some are set in 
combinations.
I also added a hook which prints the contents of QueryProperties to error on 
the console.

I checked the output in the results file and verified it matched what I 
expected.


Thanks,

Kevin



 Need a way to categorize queries in hooks for improved logging
 --

 Key: HIVE-2453
 URL: https://issues.apache.org/jira/browse/HIVE-2453
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2453.1.patch.txt, HIVE-2453.2.patch.txt


 We need a way to categorize queries, such as whether or not the include a 
 join clause, a group by clause, etc., in the hooks.  This will allow for 
 better performance logging.
 Currently the only way I can find is to go through the operators in the 
 tasks, but which operators are used for the different types of queries may 
 change over time.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2456) JDBCStatsAggregator DELETE STATEMENT should escape _ and %

2011-09-19 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108001#comment-13108001
 ] 

jirapos...@reviews.apache.org commented on HIVE-2456:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1954/
---

(Updated 2011-09-19 17:18:39.180498)


Review request for hive.


Changes
---

escape '%' and '_' in DELETE statement


Summary
---

JDBCStatsAggregator first aggregates stats from all publishers, and then delete 
these intermediate results. The delete is using LIKE operator, so it needs to 
escape '_' and '%'.


This addresses bug HIVE-2456.
https://issues.apache.org/jira/browse/HIVE-2456


Diffs
-


Diff: https://reviews.apache.org/r/1954/diff


Testing
---

running unit tests. 


Thanks,

Ning



 JDBCStatsAggregator DELETE STATEMENT should escape _ and %
 --

 Key: HIVE-2456
 URL: https://issues.apache.org/jira/browse/HIVE-2456
 Project: Hive
  Issue Type: Improvement
Reporter: Ning Zhang
Assignee: Ning Zhang
 Attachments: HIVE-2456.patch


 JDBCStatsAggregator first aggregates stats from all publishers, and then 
 delete these intermediate results. The delete is using LIKE operator, so it 
 needs to escape '_' and '%'.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2456) JDBCStatsAggregator DELETE STATEMENT should escape _ and %

2011-09-19 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108011#comment-13108011
 ] 

jirapos...@reviews.apache.org commented on HIVE-2456:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1957/
---

Review request for hive.


Summary
---

JDBCStatsAggregator first aggregates stats from all publishers, and then delete 
these intermediate results. The delete is using LIKE operator, so it needs to 
escape '_' and '%'.


This addresses bug HIVE-2456.
https://issues.apache.org/jira/browse/HIVE-2456


Diffs
-

  
trunk/ql/src/java/org/apache/hadoop/hive/ql/stats/jdbc/JDBCStatsAggregator.java 
1172720 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/stats/jdbc/JDBCStatsUtils.java 
1172720 

Diff: https://reviews.apache.org/r/1957/diff


Testing
---

Running unit tests. 


Thanks,

Ning



 JDBCStatsAggregator DELETE STATEMENT should escape _ and %
 --

 Key: HIVE-2456
 URL: https://issues.apache.org/jira/browse/HIVE-2456
 Project: Hive
  Issue Type: Improvement
Reporter: Ning Zhang
Assignee: Ning Zhang
 Attachments: HIVE-2456.patch


 JDBCStatsAggregator first aggregates stats from all publishers, and then 
 delete these intermediate results. The delete is using LIKE operator, so it 
 needs to escape '_' and '%'.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2450) move lock retry logic into ZooKeeperHiveLockManager

2011-09-18 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13107636#comment-13107636
 ] 

jirapos...@reviews.apache.org commented on HIVE-2450:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1950/
---

Review request for hive and Ning Zhang.


Summary
---

move lock retry logic into ZooKeeperHiveLockManager


This addresses bug HIVE-2450.
https://issues.apache.org/jira/browse/HIVE-2450


Diffs
-

  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1171255 
  trunk/conf/hive-default.xml 1171255 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1171255 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/lockmgr/HiveLockObject.java 
1171255 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/lockmgr/zookeeper/ZooKeeperHiveLockManager.java
 1171255 

Diff: https://reviews.apache.org/r/1950/diff


Testing
---

will run tests locally


Thanks,

Yongqiang



 move lock retry logic into ZooKeeperHiveLockManager
 ---

 Key: HIVE-2450
 URL: https://issues.apache.org/jira/browse/HIVE-2450
 Project: Hive
  Issue Type: Improvement
Reporter: He Yongqiang
Assignee: He Yongqiang
 Attachments: HIVE-2450.1.patch




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2380) Add ByteArray Datatype

2011-09-17 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13107357#comment-13107357
 ] 

jirapos...@reviews.apache.org commented on HIVE-2380:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1586/
---

(Updated 2011-09-18 05:52:41.197110)


Review request for hive.


Changes
---

Updated patch incorporating feedback


Summary
---

This patch adds bytearray as a new datatype in Hive.


This addresses bug HIVE-2380.
https://issues.apache.org/jira/browse/HIVE-2380


Diffs (updated)
-

  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/MetaStoreUtils.java 
1172168 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FunctionRegistry.java 
1172168 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java 
1172168 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g 1172168 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ParseDriver.java 1172168 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/TypeCheckProcFactory.java 
1172168 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFConcat.java 1172168 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFLength.java 1172168 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFSubstr.java 1172168 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToString.java 1172168 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFToBinary.java 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/ba_table1.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/ba_table2.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/ba_table_udfs.q PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/ba_table1.q.out PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/ba_table2.q.out PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/ba_table_udfs.q.out PRE-CREATION 
  trunk/serde/if/serde.thrift 1172168 
  trunk/serde/src/gen/thrift/gen-cpp/serde_constants.h 1172168 
  trunk/serde/src/gen/thrift/gen-cpp/serde_constants.cpp 1172168 
  
trunk/serde/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/serde/Constants.java
 1172168 
  trunk/serde/src/gen/thrift/gen-php/serde/serde_constants.php 1172168 
  trunk/serde/src/gen/thrift/gen-py/org_apache_hadoop_hive_serde/constants.py 
1172168 
  trunk/serde/src/gen/thrift/gen-rb/serde_constants.rb 1172168 
  trunk/serde/src/java/org/apache/hadoop/hive/serde2/SerDeUtils.java 1172168 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/binarysortable/BinarySortableSerDe.java
 1172168 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/columnar/LazyBinaryColumnarSerDe.java
 1172168 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/columnar/LazyBinaryColumnarStruct.java
 1172168 
  trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazy/LazyBinary.java 
PRE-CREATION 
  trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazy/LazyFactory.java 
1172168 
  trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazy/LazyUtils.java 
1172168 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazy/objectinspector/primitive/LazyBinaryObjectInspector.java
 PRE-CREATION 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazy/objectinspector/primitive/LazyPrimitiveObjectInspectorFactory.java
 1172168 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazybinary/LazyBinaryBinary.java
 PRE-CREATION 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazybinary/LazyBinaryFactory.java
 1172168 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazybinary/LazyBinarySerDe.java
 1172168 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazybinary/LazyBinaryUtils.java
 1172168 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ObjectInspectorConverters.java
 1172168 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ObjectInspectorUtils.java
 1172168 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/PrimitiveObjectInspector.java
 1172168 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/BinaryObjectInspector.java
 PRE-CREATION 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/JavaBinaryObjectInspector.java
 PRE-CREATION 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/PrimitiveObjectInspectorConverter.java
 1172168 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/PrimitiveObjectInspectorFactory.java
 1172168 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/PrimitiveObjectInspectorUtils.java
 1172168 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/SettableBinaryObjectInspector.java
 

[jira] [Commented] (HIVE-2181) Clean up the scratch.dir (tmp/hive-root) while restarting Hive server.

2011-09-16 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13106615#comment-13106615
 ] 

jirapos...@reviews.apache.org commented on HIVE-2181:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1481/
---

(Updated 2011-09-16 17:30:40.980271)


Review request for hive.


Changes
---

Rebased the patch on latest trunk


Summary
---

Now queries leaves the map outputs under scratch.dir after execution. If the 
hive server is stopped we need not keep the stopped server's map oputputs. So 
whle starting the server we can clear the scratch.dir. This can help in 
improved disk usage.

Implemented a cleanup method in HiveServer and it will trigger based on the 
property value hive.start.cleanup.scrachdir


This addresses bug HIVE-2181.
https://issues.apache.org/jira/browse/HIVE-2181


Diffs (updated)
-

  trunk/conf/hive-default.xml 1171488 
  trunk/service/src/java/org/apache/hadoop/hive/service/HiveServer.java 1171488 
  trunk/service/src/test/org/apache/hadoop/hive/service/TestHiveServer.java 
1171488 
  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1171488 

Diff: https://reviews.apache.org/r/1481/diff


Testing
---

Added test case for this scenario.


Thanks,

chinna



  Clean up the scratch.dir (tmp/hive-root) while restarting Hive server. 
 

 Key: HIVE-2181
 URL: https://issues.apache.org/jira/browse/HIVE-2181
 Project: Hive
  Issue Type: Bug
  Components: Server Infrastructure
Affects Versions: 0.8.0
 Environment: Suse linux, Hadoop 20.1, Hive 0.8
Reporter: sanoj mathew
Assignee: Chinna Rao Lalam
Priority: Minor
 Attachments: HIVE-2181.1.patch, HIVE-2181.2.patch, HIVE-2181.3.patch, 
 HIVE-2181.patch

   Original Estimate: 48h
  Remaining Estimate: 48h

 Now queries leaves the map outputs under scratch.dir after execution. If the 
 hive server is stopped we need not keep the stopped server's map oputputs. So 
 whle starting the server we can clear the scratch.dir. This can help in 
 improved disk usage.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2017) Driver.execute() should maintaining SessionState in case of runtime errors

2011-09-16 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13106619#comment-13106619
 ] 

jirapos...@reviews.apache.org commented on HIVE-2017:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1609/
---

(Updated 2011-09-16 17:33:41.236186)


Review request for hive, John Sichi and Ning Zhang.


Changes
---

Rebased the patch on the latest trunk


Summary
---

while processing two parallel tasks if one of the task fails the 
Driver.taskCleanup() will call system.exit() this will shutdown the jvm so it 
is replaced with the logic to stop the remaining tasks.. Here need to cleanup 2 
kinds of tasks one is non-mr tasks and  mr tasks. For stopping non-mr tasks 
used the thread.interrupt() because every non-mr task will be executed as a 
thread and for mr tasks maintained a variable called jobKillUri's this variable 
will track the spawned job kill uri, in taskCleanup() if it is a mr task using 
this variable kill the job. 


This addresses bug HIVE-2017.
https://issues.apache.org/jira/browse/HIVE-2017


Diffs (updated)
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/HadoopJobExecHelper.java 
1170977 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Task.java 1170977 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/BlockMergeTask.java 
1170977 
  trunk/ql/src/test/queries/clientnegative/alter_exit.q PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/alter_exit.q.out PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1170977 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java 1170977 

Diff: https://reviews.apache.org/r/1609/diff


Testing
---

Added test case for this scenario.  Ran existing test cases


Thanks,

chinna



 Driver.execute() should maintaining SessionState in case of runtime errors
 --

 Key: HIVE-2017
 URL: https://issues.apache.org/jira/browse/HIVE-2017
 Project: Hive
  Issue Type: Bug
Reporter: Ning Zhang
Assignee: Chinna Rao Lalam
 Attachments: HIVE-2017.1.patch, HIVE-2017.2.patch


 Here's a snippet from Driver.execute():
 {code}
 // TODO: This error messaging is not very informative. Fix that.
 errorMessage = FAILED: Execution Error, return code  + exitVal 
 +  from 
 + tsk.getClass().getName();
 SQLState = 08S01;
 console.printError(errorMessage);
 if (running.size() != 0) {
   taskCleanup();
 }
 return 9;
 {code}
 I simply returned in case of runtime errors without maintaining SessionState. 
 It could cause resource leak mentioned in HIVE-1959. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2446) Introduction of client statistics publishers possibility

2011-09-16 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13106620#comment-13106620
 ] 

jirapos...@reviews.apache.org commented on HIVE-2446:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1909/
---

(Updated 2011-09-16 17:35:58.909415)


Review request for hive and Ning Zhang.


Changes
---

Changed interface of ClientStatsPublisher, made instantiation of Client Stats 
Publishers more efficient, this diff no longer removes some unused imports


Summary (updated)
---

The purpose of this change is to allow publication or storage of counters while 
the job is running.

Introduced two new variables to hive-default.xml and HiveConf.java: 
hive.client.stats.publishers and hive.client.stats.counters. First one 
specifies classes names, whose instances will be executed by 
HadoopJobExecHelper.java (similarly as hooks are) in its method 
progress(ExecDriverTaskHandle): MapRedStats. Second one specifies list of 
counters that any client stat publishers should publish or stored. Details 
regarding format of this list is up to a specific deployment (it is up to 
client stats publishers to parse it), yet it is required to use display names 
of counter groups and counters.

Added interface ClientStatsPublishers in org.apache.hadoop.hive.ql.stats 
package, that must be implemented by all stats publishers.

Added code to progress(ExecDriverTaskHandle): MapRedStats from 
HadoopJobExecHelper.java that puts counters' values to a Java map and then 
executes registered client stats publishers giving them that map and running 
job id. Added two new methods to HadoopJobExecHelper: 
extractAllCounterValues(Counters) and getClientStatsPublishers() that are used 
by code from previous sentence.

Made cosmetic changes in one other class


This addresses bug HIVE-2446.
https://issues.apache.org/jira/browse/HIVE-2446


Diffs (updated)
-

  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1171283 
  trunk/conf/hive-default.xml 1171283 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/HadoopJobExecHelper.java 
1171283 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/history/HiveHistory.java 1171283 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/stats/ClientStatsPublisher.java 
PRE-CREATION 

Diff: https://reviews.apache.org/r/1909/diff


Testing
---

Run some random tests, and still running the unit tests.


Thanks,

Robert



 Introduction of client statistics publishers possibility
 

 Key: HIVE-2446
 URL: https://issues.apache.org/jira/browse/HIVE-2446
 Project: Hive
  Issue Type: Improvement
  Components: Clients, Statistics
Reporter: Robert Surówka
Assignee: Robert Surówka
Priority: Minor
 Attachments: HIVE-2446.1.patch, HIVE-2446.1.patch, HIVE-2446.2.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 The purpose of this change is to allow publication or storage of counters 
 while the job is running.
 Introduced two new variables to hive-default.xml and HiveConf.java: 
 hive.client.stats.publishers and hive.client.stats.counters. First one 
 specifies classes names, whose instances will be executed by 
 HadoopJobExecHelper.java (similarly as hooks are) in its method 
 progress(ExecDriverTaskHandle): MapRedStats. Second one specifies list of 
 counters that any client stat publishers should publish or stored. Details 
 regarding format of this list is up to a specific deployment (it is up to 
 client stats publishers to parse it), yet it is required to use display names 
 of counter groups and counters.
 Added interface ClientStatsPublishers in org.apache.hadoop.hive.ql.stats 
 package, that must be implemented by all stats publishers.
 Added code to progress(ExecDriverTaskHandle): MapRedStats from 
 HadoopJobExecHelper.java that puts counters' values to a Java map and then 
 executes registered client stats publishers giving them that map and running 
 job id. Added two new methods to HadoopJobExecHelper: 
 extractAllCounterValues(Counters) and getClientStatsPublishers() that are 
 used by code from previous sentence.
 Made cosmetic changes in two other classes
  

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2446) Introduction of client statistics publishers possibility

2011-09-16 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13106625#comment-13106625
 ] 

jirapos...@reviews.apache.org commented on HIVE-2446:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1909/#review1935
---

Ship it!


- Ning


On 2011-09-16 17:35:58, Robert Surówka wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1909/
bq.  ---
bq.  
bq.  (Updated 2011-09-16 17:35:58)
bq.  
bq.  
bq.  Review request for hive and Ning Zhang.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  The purpose of this change is to allow publication or storage of counters 
while the job is running.
bq.  
bq.  Introduced two new variables to hive-default.xml and HiveConf.java: 
hive.client.stats.publishers and hive.client.stats.counters. First one 
specifies classes names, whose instances will be executed by 
HadoopJobExecHelper.java (similarly as hooks are) in its method 
progress(ExecDriverTaskHandle): MapRedStats. Second one specifies list of 
counters that any client stat publishers should publish or stored. Details 
regarding format of this list is up to a specific deployment (it is up to 
client stats publishers to parse it), yet it is required to use display names 
of counter groups and counters.
bq.  
bq.  Added interface ClientStatsPublishers in org.apache.hadoop.hive.ql.stats 
package, that must be implemented by all stats publishers.
bq.  
bq.  Added code to progress(ExecDriverTaskHandle): MapRedStats from 
HadoopJobExecHelper.java that puts counters' values to a Java map and then 
executes registered client stats publishers giving them that map and running 
job id. Added two new methods to HadoopJobExecHelper: 
extractAllCounterValues(Counters) and getClientStatsPublishers() that are used 
by code from previous sentence.
bq.  
bq.  Made cosmetic changes in one other class
bq.  
bq.  
bq.  This addresses bug HIVE-2446.
bq.  https://issues.apache.org/jira/browse/HIVE-2446
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1171283 
bq.trunk/conf/hive-default.xml 1171283 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/HadoopJobExecHelper.java 
1171283 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/history/HiveHistory.java 
1171283 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/stats/ClientStatsPublisher.java 
PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1909/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Run some random tests, and still running the unit tests.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Robert
bq.  
bq.



 Introduction of client statistics publishers possibility
 

 Key: HIVE-2446
 URL: https://issues.apache.org/jira/browse/HIVE-2446
 Project: Hive
  Issue Type: Improvement
  Components: Clients, Statistics
Reporter: Robert Surówka
Assignee: Robert Surówka
Priority: Minor
 Attachments: HIVE-2446.1.patch, HIVE-2446.1.patch, HIVE-2446.2.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 The purpose of this change is to allow publication or storage of counters 
 while the job is running.
 Introduced two new variables to hive-default.xml and HiveConf.java: 
 hive.client.stats.publishers and hive.client.stats.counters. First one 
 specifies classes names, whose instances will be executed by 
 HadoopJobExecHelper.java (similarly as hooks are) in its method 
 progress(ExecDriverTaskHandle): MapRedStats. Second one specifies list of 
 counters that any client stat publishers should publish or stored. Details 
 regarding format of this list is up to a specific deployment (it is up to 
 client stats publishers to parse it), yet it is required to use display names 
 of counter groups and counters.
 Added interface ClientStatsPublishers in org.apache.hadoop.hive.ql.stats 
 package, that must be implemented by all stats publishers.
 Added code to progress(ExecDriverTaskHandle): MapRedStats from 
 HadoopJobExecHelper.java that puts counters' values to a Java map and then 
 executes registered client stats publishers giving them that map and running 
 job id. Added two new methods to HadoopJobExecHelper: 
 extractAllCounterValues(Counters) and getClientStatsPublishers() that are 
 used by code from previous sentence.
 Made cosmetic changes in two other classes
  

--
This message is automatically generated by JIRA.
For more information on JIRA, see: 

[jira] [Commented] (HIVE-2453) Need a way to categorize queries in hooks for improved logging

2011-09-16 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13106720#comment-13106720
 ] 

jirapos...@reviews.apache.org commented on HIVE-2453:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1933/
---

Review request for hive and Ning Zhang.


Summary
---

The information that would be useful for categorizing queries is clearest in 
the Semantic Analyzer, when the data from the Parser is interpreted.  I added a 
new class which is designed to collect that data here, and place it ultimately 
in the QueryPlan where it will be available to hooks.

The information I collect is whether or not the query has the following clauses:
  Join
  Group By
  Order By
  Sort By
  Group By after a Join clause

Also, I store whether or not a script is used for mapping or reducing.


This addresses bug HIVE-2453.
https://issues.apache.org/jira/browse/HIVE-2453


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/QueryPlan.java 1170719 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/QueryProperties.java PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java 
1170719 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
1170719 
  
trunk/ql/src/test/org/apache/hadoop/hive/ql/hooks/CheckQueryPropertiesHook.java 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/query_properties.q PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/query_properties.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/1933/diff


Testing
---

I added a new test, which runs a variety of queries, such that each of the 
flags in QueryProperties is set by at least one query, and also some are set in 
combinations.
I also added a hook which prints the contents of QueryProperties to error on 
the console.

I checked the output in the results file and verified it matched what I 
expected.


Thanks,

Kevin



 Need a way to categorize queries in hooks for improved logging
 --

 Key: HIVE-2453
 URL: https://issues.apache.org/jira/browse/HIVE-2453
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2453.1.patch.txt


 We need a way to categorize queries, such as whether or not the include a 
 join clause, a group by clause, etc., in the hooks.  This will allow for 
 better performance logging.
 Currently the only way I can find is to go through the operators in the 
 tasks, but which operators are used for the different types of queries may 
 change over time.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2453) Need a way to categorize queries in hooks for improved logging

2011-09-16 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13106830#comment-13106830
 ] 

jirapos...@reviews.apache.org commented on HIVE-2453:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1933/#review1946
---



trunk/ql/src/java/org/apache/hadoop/hive/ql/QueryProperties.java
https://reviews.apache.org/r/1933/#comment4427

can you split it into 2 parts: useScriptInMapper and useScriptInReducer? 


- Ning


On 2011-09-16 19:04:32, Kevin Wilfong wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1933/
bq.  ---
bq.  
bq.  (Updated 2011-09-16 19:04:32)
bq.  
bq.  
bq.  Review request for hive and Ning Zhang.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  The information that would be useful for categorizing queries is clearest 
in the Semantic Analyzer, when the data from the Parser is interpreted.  I 
added a new class which is designed to collect that data here, and place it 
ultimately in the QueryPlan where it will be available to hooks.
bq.  
bq.  The information I collect is whether or not the query has the following 
clauses:
bq.Join
bq.Group By
bq.Order By
bq.Sort By
bq.Group By after a Join clause
bq.  
bq.  Also, I store whether or not a script is used for mapping or reducing.
bq.  
bq.  
bq.  This addresses bug HIVE-2453.
bq.  https://issues.apache.org/jira/browse/HIVE-2453
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/QueryPlan.java 1170719 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/QueryProperties.java 
PRE-CREATION 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java 
1170719 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
1170719 
bq.
trunk/ql/src/test/org/apache/hadoop/hive/ql/hooks/CheckQueryPropertiesHook.java 
PRE-CREATION 
bq.trunk/ql/src/test/queries/clientpositive/query_properties.q PRE-CREATION 
bq.trunk/ql/src/test/results/clientpositive/query_properties.q.out 
PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1933/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  I added a new test, which runs a variety of queries, such that each of the 
flags in QueryProperties is set by at least one query, and also some are set in 
combinations.
bq.  I also added a hook which prints the contents of QueryProperties to error 
on the console.
bq.  
bq.  I checked the output in the results file and verified it matched what I 
expected.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Kevin
bq.  
bq.



 Need a way to categorize queries in hooks for improved logging
 --

 Key: HIVE-2453
 URL: https://issues.apache.org/jira/browse/HIVE-2453
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2453.1.patch.txt


 We need a way to categorize queries, such as whether or not the include a 
 join clause, a group by clause, etc., in the hooks.  This will allow for 
 better performance logging.
 Currently the only way I can find is to go through the operators in the 
 tasks, but which operators are used for the different types of queries may 
 change over time.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2453) Need a way to categorize queries in hooks for improved logging

2011-09-16 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13106941#comment-13106941
 ] 

jirapos...@reviews.apache.org commented on HIVE-2453:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1933/
---

(Updated 2011-09-17 00:14:50.529819)


Review request for hive and Ning Zhang.


Summary
---

The information that would be useful for categorizing queries is clearest in 
the Semantic Analyzer, when the data from the Parser is interpreted.  I added a 
new class which is designed to collect that data here, and place it ultimately 
in the QueryPlan where it will be available to hooks.

The information I collect is whether or not the query has the following clauses:
  Join
  Group By
  Order By
  Sort By
  Group By after a Join clause

Also, I store whether or not a script is used for mapping or reducing.


This addresses bug HIVE-2453.
https://issues.apache.org/jira/browse/HIVE-2453


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/QueryPlan.java 1170719 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/QueryProperties.java PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java 
1170719 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
1170719 
  
trunk/ql/src/test/org/apache/hadoop/hive/ql/hooks/CheckQueryPropertiesHook.java 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/query_properties.q PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/query_properties.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/1933/diff


Testing
---

I added a new test, which runs a variety of queries, such that each of the 
flags in QueryProperties is set by at least one query, and also some are set in 
combinations.
I also added a hook which prints the contents of QueryProperties to error on 
the console.

I checked the output in the results file and verified it matched what I 
expected.


Thanks,

Kevin



 Need a way to categorize queries in hooks for improved logging
 --

 Key: HIVE-2453
 URL: https://issues.apache.org/jira/browse/HIVE-2453
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2453.1.patch.txt


 We need a way to categorize queries, such as whether or not the include a 
 join clause, a group by clause, etc., in the hooks.  This will allow for 
 better performance logging.
 Currently the only way I can find is to go through the operators in the 
 tasks, but which operators are used for the different types of queries may 
 change over time.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2453) Need a way to categorize queries in hooks for improved logging

2011-09-16 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13107017#comment-13107017
 ] 

jirapos...@reviews.apache.org commented on HIVE-2453:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1933/#review1956
---

Ship it!


- Ning


On 2011-09-17 00:14:50, Kevin Wilfong wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1933/
bq.  ---
bq.  
bq.  (Updated 2011-09-17 00:14:50)
bq.  
bq.  
bq.  Review request for hive and Ning Zhang.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  The information that would be useful for categorizing queries is clearest 
in the Semantic Analyzer, when the data from the Parser is interpreted.  I 
added a new class which is designed to collect that data here, and place it 
ultimately in the QueryPlan where it will be available to hooks.
bq.  
bq.  The information I collect is whether or not the query has the following 
clauses:
bq.Join
bq.Group By
bq.Order By
bq.Sort By
bq.Group By after a Join clause
bq.  
bq.  Also, I store whether or not a script is used for mapping or reducing.
bq.  
bq.  
bq.  This addresses bug HIVE-2453.
bq.  https://issues.apache.org/jira/browse/HIVE-2453
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/QueryPlan.java 1170719 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/QueryProperties.java 
PRE-CREATION 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java 
1170719 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
1170719 
bq.
trunk/ql/src/test/org/apache/hadoop/hive/ql/hooks/CheckQueryPropertiesHook.java 
PRE-CREATION 
bq.trunk/ql/src/test/queries/clientpositive/query_properties.q PRE-CREATION 
bq.trunk/ql/src/test/results/clientpositive/query_properties.q.out 
PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1933/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  I added a new test, which runs a variety of queries, such that each of the 
flags in QueryProperties is set by at least one query, and also some are set in 
combinations.
bq.  I also added a hook which prints the contents of QueryProperties to error 
on the console.
bq.  
bq.  I checked the output in the results file and verified it matched what I 
expected.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Kevin
bq.  
bq.



 Need a way to categorize queries in hooks for improved logging
 --

 Key: HIVE-2453
 URL: https://issues.apache.org/jira/browse/HIVE-2453
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2453.1.patch.txt


 We need a way to categorize queries, such as whether or not the include a 
 join clause, a group by clause, etc., in the hooks.  This will allow for 
 better performance logging.
 Currently the only way I can find is to go through the operators in the 
 tasks, but which operators are used for the different types of queries may 
 change over time.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-198) Parse errors report incorrectly.

2011-09-15 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13105662#comment-13105662
 ] 

jirapos...@reviews.apache.org commented on HIVE-198:



---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1818/
---

(Updated 2011-09-15 20:25:10.729894)


Review request for hive.


Changes
---

Fixed missing tests create_or_replace_view6, TestJdbcDriver.java.


Summary
---

Modified the ParseException message to include the actual token it fails on 
(When available).
It should make it easier to grok the actual problem from the error.


This addresses bug HIVE-198.
https://issues.apache.org/jira/browse/HIVE-198


Diffs (updated)
-

  trunk/jdbc/src/test/org/apache/hadoop/hive/jdbc/TestJdbcDriver.java 1164307 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ParseDriver.java 1164307 
  trunk/ql/src/test/results/clientnegative/column_rename3.q.out 1164307 
  trunk/ql/src/test/results/clientnegative/create_or_replace_view6.q.out 
1164307 
  trunk/ql/src/test/results/clientnegative/invalid_select_expression.q.out 
1164307 
  trunk/ql/src/test/results/clientnegative/invalid_tbl_name.q.out 1164307 
  trunk/ql/src/test/results/compiler/errors/garbage.q.out 1164307 
  trunk/ql/src/test/results/compiler/errors/missing_overwrite.q.out 1164307 
  trunk/ql/src/test/results/compiler/errors/wrong_distinct2.q.out 1164307 

Diff: https://reviews.apache.org/r/1818/diff


Testing
---

ant test -Dtestcase=TestParseNegative


Thanks,

avivey



 Parse errors report incorrectly.
 

 Key: HIVE-198
 URL: https://issues.apache.org/jira/browse/HIVE-198
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: S. Alex Smith
Assignee: Aviv Eyal
  Labels: parse
 Attachments: HIVE-198.2.patch.txt, PraseErrorMessage.patch


 The following two queries fail:
 CREATE TABLE output_table(userid, bigint);
 CREATE TABLE output_table(userid bigint, age int, sex string, location 
 string);
 each giving the error message FAILED: Parse Error: line 1:16 mismatched 
 input 'TABLE' expecting KW_TEMPORARY
 Although one might not catch it from the error message, the problem with the 
 first is that there is a comma between userid and bigint, and the problem 
 with the second is that location is a reserved keyword.  Reported errors 
 should more accurately describe the nature of the error, such as no type 
 given for column 'userid' or 'location' is not a valid column name.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2447) Add job ID to MapRedStats

2011-09-15 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13105681#comment-13105681
 ] 

jirapos...@reviews.apache.org commented on HIVE-2447:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1918/#review1917
---

Ship it!


- Ning


On 2011-09-15 17:39:49, Kevin Wilfong wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1918/
bq.  ---
bq.  
bq.  (Updated 2011-09-15 17:39:49)
bq.  
bq.  
bq.  Review request for hive and Ning Zhang.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  I added the job ID to the MapRedStats class.  It is populated as part of 
the constructor.
bq.  
bq.  
bq.  This addresses bug HIVE-2447.
bq.  https://issues.apache.org/jira/browse/HIVE-2447
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/MapRedStats.java 1171172 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/HadoopJobExecHelper.java 
1171172 
bq.  
bq.  Diff: https://reviews.apache.org/r/1918/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Verified the job ID was being populated.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Kevin
bq.  
bq.



 Add job ID to MapRedStats
 -

 Key: HIVE-2447
 URL: https://issues.apache.org/jira/browse/HIVE-2447
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2447.1.patch.txt


 Currently, we collect a lot of stats per job in MapRedStats, but there is no 
 way to associate these stats with a specific job.  Adding the job ID to the 
 class would help to make the association clear.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2446) Introduction of client statistics publishers possibility

2011-09-15 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13105710#comment-13105710
 ] 

jirapos...@reviews.apache.org commented on HIVE-2446:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1909/#review1918
---



trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/HadoopJobExecHelper.java
https://reviews.apache.org/r/1909/#comment4399

should -1 be replaced by 0 here? 



trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/HadoopJobExecHelper.java
https://reviews.apache.org/r/1909/#comment4400

I've thought it over again and feel it's better to use the counter name 
here rather than display name. Display name has the benefit that the users 
see the same name at the JT page and the metrics page, but display name could 
be too long (sometimes may contain special characters) and it could be changed 
over time. I think name should be a better ID here. The only caveat is that 
the person who's looking at the metrics need to know how it is translated to 
the display name in JT page. 



trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/HadoopJobExecHelper.java
https://reviews.apache.org/r/1909/#comment4398

Is this line too long? Hive's line length shouldn't exceed 100 chars. 



trunk/ql/src/java/org/apache/hadoop/hive/ql/stats/ClientStatsPublisher.java
https://reviews.apache.org/r/1909/#comment4401

Rather than passing JobID, it would be more flexible to pass a String 
converted from jobID. This is the approach that StatsPublisher interface is 
using. 


- Ning


On 2011-09-14 22:19:21, Robert Surówka wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1909/
bq.  ---
bq.  
bq.  (Updated 2011-09-14 22:19:21)
bq.  
bq.  
bq.  Review request for hive and Ning Zhang.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  The purpose of this change is to allow publication or storage of counters 
while the job is running.
bq.  
bq.  Introduced two new variables to hive-default.xml and HiveConf.java: 
hive.client.stats.publishers and hive.client.stats.counters. First one 
specifies classes names, whose instances will be executed by 
HadoopJobExecHelper.java (similarly as hooks are) in its method 
progress(ExecDriverTaskHandle): MapRedStats. Second one specifies list of 
counters that any client stat publishers should publish or stored. Details 
regarding format of this list is up to a specific deployment (it is up to 
client stats publishers to parse it), yet it is required to use display names 
of counter groups and counters.
bq.  
bq.  Added interface ClientStatsPublishers in org.apache.hadoop.hive.ql.stats 
package, that must be implemented by all stats publishers.
bq.  
bq.  Added code to progress(ExecDriverTaskHandle): MapRedStats from 
HadoopJobExecHelper.java that puts counters' values to a Java map and then 
executes registered client stats publishers giving them that map and running 
job id. Added two new methods to HadoopJobExecHelper: 
extractAllCounterValues(Counters) and getClientStatsPublishers() that are used 
by code from previous sentence.
bq.  
bq.  Made cosmetic changes in two other classes
bq.  
bq.  
bq.  This addresses bug HIVE-2446.
bq.  https://issues.apache.org/jira/browse/HIVE-2446
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1170867 
bq.trunk/conf/hive-default.xml 1170867 
bq.
trunk/metastore/src/test/org/apache/hadoop/hive/metastore/TestMarkPartition.java
 1170867 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/HadoopJobExecHelper.java 
1170867 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/history/HiveHistory.java 
1170867 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/stats/ClientStatsPublisher.java 
PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1909/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Run some random tests, and still running the unit tests.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Robert
bq.  
bq.



 Introduction of client statistics publishers possibility
 

 Key: HIVE-2446
 URL: https://issues.apache.org/jira/browse/HIVE-2446
 Project: Hive
  Issue Type: Improvement
  Components: Clients, Statistics
Reporter: Robert Surówka
Priority: Minor
 Attachments: HIVE-2446.1.patch, HIVE-2446.1.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 The purpose of this change is to allow publication or storage of counters 
 while the job is 

[jira] [Commented] (HIVE-2445) The PerfLogger should log the full name of hooks, not just the simple name.

2011-09-14 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13104663#comment-13104663
 ] 

jirapos...@reviews.apache.org commented on HIVE-2445:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1882/
---

Review request for hive and Ning Zhang.


Summary
---

I change the method called when logging pre, failure, and post hooks to the 
PerfLogger to getName instead of getSimpleName.


This addresses bug HIVE-2445.
https://issues.apache.org/jira/browse/HIVE-2445


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1170268 

Diff: https://reviews.apache.org/r/1882/diff


Testing
---

Ran the unit test queries, and verified the intended change in the logs.


Thanks,

Kevin



 The PerfLogger should log the full name of hooks, not just the simple name.
 ---

 Key: HIVE-2445
 URL: https://issues.apache.org/jira/browse/HIVE-2445
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong

 Sometimes the simple name of a hook is not enough to identify it, so the 
 PerfLogger should log the full name instead.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2445) The PerfLogger should log the full name of hooks, not just the simple name.

2011-09-14 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13104681#comment-13104681
 ] 

jirapos...@reviews.apache.org commented on HIVE-2445:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1882/#review1893
---

Ship it!


- Ning


On 2011-09-14 16:53:49, Kevin Wilfong wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1882/
bq.  ---
bq.  
bq.  (Updated 2011-09-14 16:53:49)
bq.  
bq.  
bq.  Review request for hive and Ning Zhang.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  I change the method called when logging pre, failure, and post hooks to 
the PerfLogger to getName instead of getSimpleName.
bq.  
bq.  
bq.  This addresses bug HIVE-2445.
bq.  https://issues.apache.org/jira/browse/HIVE-2445
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1170268 
bq.  
bq.  Diff: https://reviews.apache.org/r/1882/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Ran the unit test queries, and verified the intended change in the logs.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Kevin
bq.  
bq.



 The PerfLogger should log the full name of hooks, not just the simple name.
 ---

 Key: HIVE-2445
 URL: https://issues.apache.org/jira/browse/HIVE-2445
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong

 Sometimes the simple name of a hook is not enough to identify it, so the 
 PerfLogger should log the full name instead.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2398) Hive server doesn't return schema for 'set' command

2011-09-14 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13104738#comment-13104738
 ] 

jirapos...@reviews.apache.org commented on HIVE-2398:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1605/#review1895
---



jdbc/src/test/org/apache/hadoop/hive/jdbc/TestJdbcDriver.java
https://reviews.apache.org/r/1605/#comment4376

Need to assert that the size of the resultset is  0. Might also be good to 
log the contents of the resultset.


- Carl


On 2011-09-13 20:03:48, Prasad Mujumdar wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1605/
bq.  ---
bq.  
bq.  (Updated 2011-09-13 20:03:48)
bq.  
bq.  
bq.  Review request for hive and Carl Steinbach.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  The Hive server currently has the framework to process the Cli commands 
like 'set'. But it returns an empty schema definition for these resultsets. The 
patch here is to add a getSchema() to the CommandResponse interface. The 
SetProcessor is modified to build a schema object which is returned with 
getSchema.
bq.  HiveServer is modified to call the getSchema for command cases as well.
bq.  
bq.  
bq.  This addresses bug HIVE-2398.
bq.  https://issues.apache.org/jira/browse/HIVE-2398
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.jdbc/src/test/org/apache/hadoop/hive/jdbc/TestJdbcDriver.java e744d11 
bq.
ql/src/java/org/apache/hadoop/hive/ql/processors/CommandProcessorResponse.java 
4f7f472 
bq.ql/src/java/org/apache/hadoop/hive/ql/processors/SetProcessor.java 
9ca5ca4 
bq.service/src/java/org/apache/hadoop/hive/service/HiveServer.java 95f322a 
bq.  
bq.  Diff: https://reviews.apache.org/r/1605/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Tested with ODBC program to execute the set command. Ran Hive unit tests.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Prasad
bq.  
bq.



 Hive server doesn't return schema for 'set' command
 ---

 Key: HIVE-2398
 URL: https://issues.apache.org/jira/browse/HIVE-2398
 Project: Hive
  Issue Type: Bug
  Components: Server Infrastructure
Affects Versions: 0.7.1, 0.8.0
Reporter: Prasad Mujumdar
Assignee: Prasad Mujumdar
 Attachments: HIVE-2398.patch, HIVE-2398.patch


 The Hive server does process the CLI commands like 'set', 'set -v' sent by 
 ODBC or JDBC clients. But currently only the data is returned to client but 
 not schema for that resultset. This makes it unusable for a ODBC or JDBC 
 client to use this option.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2398) Hive server doesn't return schema for 'set' command

2011-09-14 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13104817#comment-13104817
 ] 

jirapos...@reviews.apache.org commented on HIVE-2398:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1605/
---

(Updated 2011-09-14 19:41:13.647029)


Review request for hive and Carl Steinbach.


Changes
---

Added check for resultset non empty. 
The full set -v output would be too long, also most of the rows will 
different in each test setup. Hence I am suggesting not to do that. Let me know 
if you disagree.


Summary
---

The Hive server currently has the framework to process the Cli commands like 
'set'. But it returns an empty schema definition for these resultsets. The 
patch here is to add a getSchema() to the CommandResponse interface. The 
SetProcessor is modified to build a schema object which is returned with 
getSchema.
HiveServer is modified to call the getSchema for command cases as well.


This addresses bug HIVE-2398.
https://issues.apache.org/jira/browse/HIVE-2398


Diffs (updated)
-

  jdbc/src/test/org/apache/hadoop/hive/jdbc/TestJdbcDriver.java e744d11 
  
ql/src/java/org/apache/hadoop/hive/ql/processors/CommandProcessorResponse.java 
4f7f472 
  ql/src/java/org/apache/hadoop/hive/ql/processors/SetProcessor.java 9ca5ca4 
  service/src/java/org/apache/hadoop/hive/service/HiveServer.java 95f322a 

Diff: https://reviews.apache.org/r/1605/diff


Testing
---

Tested with ODBC program to execute the set command. Ran Hive unit tests.


Thanks,

Prasad



 Hive server doesn't return schema for 'set' command
 ---

 Key: HIVE-2398
 URL: https://issues.apache.org/jira/browse/HIVE-2398
 Project: Hive
  Issue Type: Bug
  Components: Server Infrastructure
Affects Versions: 0.7.1, 0.8.0
Reporter: Prasad Mujumdar
Assignee: Prasad Mujumdar
 Attachments: HIVE-2398.patch, HIVE-2398.patch


 The Hive server does process the CLI commands like 'set', 'set -v' sent by 
 ODBC or JDBC clients. But currently only the data is returned to client but 
 not schema for that resultset. This makes it unusable for a ODBC or JDBC 
 client to use this option.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2446) Introduction of client statistics publishers possibility

2011-09-14 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13104944#comment-13104944
 ] 

jirapos...@reviews.apache.org commented on HIVE-2446:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1909/
---

Review request for hive and Ning Zhang.


Summary
---

The purpose of this change is to allow publication or storage of counters while 
the job is running.

Introduced two new variables to hive-default.xml and HiveConf.java: 
hive.client.stats.publishers and hive.client.stats.counters. First one 
specifies classes names, whose instances will be executed by 
HadoopJobExecHelper.java (similarly as hooks are) in its method 
progress(ExecDriverTaskHandle): MapRedStats. Second one specifies list of 
counters that any client stat publishers should publish or stored. Details 
regarding format of this list is up to a specific deployment (it is up to 
client stats publishers to parse it), yet it is required to use display names 
of counter groups and counters.

Added interface ClientStatsPublishers in org.apache.hadoop.hive.ql.stats 
package, that must be implemented by all stats publishers.

Added code to progress(ExecDriverTaskHandle): MapRedStats from 
HadoopJobExecHelper.java that puts counters' values to a Java map and then 
executes registered client stats publishers giving them that map and running 
job id. Added two new methods to HadoopJobExecHelper: 
extractAllCounterValues(Counters) and getClientStatsPublishers() that are used 
by code from previous sentence.

Made cosmetic changes in two other classes


This addresses bug HIVE-2446.
https://issues.apache.org/jira/browse/HIVE-2446


Diffs
-

  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1170867 
  trunk/conf/hive-default.xml 1170867 
  
trunk/metastore/src/test/org/apache/hadoop/hive/metastore/TestMarkPartition.java
 1170867 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/HadoopJobExecHelper.java 
1170867 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/history/HiveHistory.java 1170867 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/stats/ClientStatsPublisher.java 
PRE-CREATION 

Diff: https://reviews.apache.org/r/1909/diff


Testing
---

Run some random tests, and still running the unit tests.


Thanks,

Robert



 Introduction of client statistics publishers possibility
 

 Key: HIVE-2446
 URL: https://issues.apache.org/jira/browse/HIVE-2446
 Project: Hive
  Issue Type: Improvement
  Components: Clients, Statistics
Reporter: Robert Surówka
Priority: Minor
 Attachments: HIVE-2446.1.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 The purpose of this change is to allow publication or storage of counters 
 while the job is running.
 Introduced two new variables to hive-default.xml and HiveConf.java: 
 hive.client.stats.publishers and hive.client.stats.counters. First one 
 specifies classes names, whose instances will be executed by 
 HadoopJobExecHelper.java (similarly as hooks are) in its method 
 progress(ExecDriverTaskHandle): MapRedStats. Second one specifies list of 
 counters that any client stat publishers should publish or stored. Details 
 regarding format of this list is up to a specific deployment (it is up to 
 client stats publishers to parse it), yet it is required to use display names 
 of counter groups and counters.
 Added interface ClientStatsPublishers in org.apache.hadoop.hive.ql.stats 
 package, that must be implemented by all stats publishers.
 Added code to progress(ExecDriverTaskHandle): MapRedStats from 
 HadoopJobExecHelper.java that puts counters' values to a Java map and then 
 executes registered client stats publishers giving them that map and running 
 job id. Added two new methods to HadoopJobExecHelper: 
 extractAllCounterValues(Counters) and getClientStatsPublishers() that are 
 used by code from previous sentence.
 Made cosmetic changes in two other classes
  

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2398) Hive server doesn't return schema for 'set' command

2011-09-14 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13104986#comment-13104986
 ] 

jirapos...@reviews.apache.org commented on HIVE-2398:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1605/#review1903
---

Ship it!


+1. Will commit if tests pass.

- Carl


On 2011-09-14 19:41:13, Prasad Mujumdar wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1605/
bq.  ---
bq.  
bq.  (Updated 2011-09-14 19:41:13)
bq.  
bq.  
bq.  Review request for hive and Carl Steinbach.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  The Hive server currently has the framework to process the Cli commands 
like 'set'. But it returns an empty schema definition for these resultsets. The 
patch here is to add a getSchema() to the CommandResponse interface. The 
SetProcessor is modified to build a schema object which is returned with 
getSchema.
bq.  HiveServer is modified to call the getSchema for command cases as well.
bq.  
bq.  
bq.  This addresses bug HIVE-2398.
bq.  https://issues.apache.org/jira/browse/HIVE-2398
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.jdbc/src/test/org/apache/hadoop/hive/jdbc/TestJdbcDriver.java e744d11 
bq.
ql/src/java/org/apache/hadoop/hive/ql/processors/CommandProcessorResponse.java 
4f7f472 
bq.ql/src/java/org/apache/hadoop/hive/ql/processors/SetProcessor.java 
9ca5ca4 
bq.service/src/java/org/apache/hadoop/hive/service/HiveServer.java 95f322a 
bq.  
bq.  Diff: https://reviews.apache.org/r/1605/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Tested with ODBC program to execute the set command. Ran Hive unit tests.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Prasad
bq.  
bq.



 Hive server doesn't return schema for 'set' command
 ---

 Key: HIVE-2398
 URL: https://issues.apache.org/jira/browse/HIVE-2398
 Project: Hive
  Issue Type: Bug
  Components: Server Infrastructure
Affects Versions: 0.7.1, 0.8.0
Reporter: Prasad Mujumdar
Assignee: Prasad Mujumdar
 Attachments: HIVE-2398.patch, HIVE-2398.patch, HIVE-2398.patch.1


 The Hive server does process the CLI commands like 'set', 'set -v' sent by 
 ODBC or JDBC clients. But currently only the data is returned to client but 
 not schema for that resultset. This makes it unusable for a ODBC or JDBC 
 client to use this option.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1975) insert overwrite directory Not able to insert data with multi level directory path

2011-09-13 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13103799#comment-13103799
 ] 

jirapos...@reviews.apache.org commented on HIVE-1975:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1829/
---

Review request for hive and Yongqiang He.


Summary
---

If insert into multilevel directory like '/HIVEFT25686/chinna/'
 (insert overwrite directory '/HIVEFT25686/chinna/' select * from dept_j) is 
failing because fs.rename(sourcePath, targetPath) is failing rename to 
multilevel directories, so first created the target path.


This addresses bug HIVE-1975.
https://issues.apache.org/jira/browse/HIVE-1975


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MoveTask.java 1170044 

Diff: https://reviews.apache.org/r/1829/diff


Testing
---

Ran all testcases


Thanks,

chinna



 insert overwrite directory Not able to insert data with multi level 
 directory path
 

 Key: HIVE-1975
 URL: https://issues.apache.org/jira/browse/HIVE-1975
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.5.0
 Environment: Hadoop 0.20.1, Hive0.5.0 and SUSE Linux Enterprise 
 Server 10 SP2 (i586) - Kernel 2.6.16.60-0.21-smp (5).
Reporter: Chinna Rao Lalam
Assignee: Chinna Rao Lalam
 Attachments: HIVE-1975.1.patch, HIVE-1975.patch


 Below query execution is failed
 Ex:
 {noformat}
insert overwrite directory '/HIVEFT25686/chinna/' select * from dept_j;
 {noformat}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2182) Avoid null pointer exception when executing UDF

2011-09-13 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13103803#comment-13103803
 ] 

jirapos...@reviews.apache.org commented on HIVE-2182:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1759/
---

(Updated 2011-09-13 17:32:55.124065)


Review request for hive and John Sichi.


Changes
---

Patch rebased on trunk


Summary
---

while executing UDF if the implementation jar is not present in MR class path 
it is throwing nullpointer exception instead of throwing nullpointer exception 
throwing meaning full exception with the required details.


This addresses bug HIVE-2182.
https://issues.apache.org/jira/browse/HIVE-2182


Diffs (updated)
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFBridge.java 
1170044 
  trunk/ql/src/test/queries/clientnegative/udfnull.q PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/udfnull.q.out PRE-CREATION 
  trunk/ql/src/test/results/compiler/plan/cast1.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/groupby1.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/groupby2.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/groupby3.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/groupby4.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/groupby5.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/groupby6.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/input20.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/input8.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/join2.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/sample1.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/sample2.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/sample3.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/sample4.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/sample5.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/sample6.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/sample7.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/udf1.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/udf4.q.xml 1170044 
  trunk/ql/src/test/results/compiler/plan/udf6.q.xml 1170044 

Diff: https://reviews.apache.org/r/1759/diff


Testing
---

Added unit tests


Thanks,

chinna



 Avoid null pointer exception when executing UDF
 ---

 Key: HIVE-2182
 URL: https://issues.apache.org/jira/browse/HIVE-2182
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.5.0, 0.8.0
 Environment: Hadoop 0.20.1, Hive0.8.0 and SUSE Linux Enterprise 
 Server 10 SP2 (i586) - Kernel 2.6.16.60-0.21-smp (5)
Reporter: Chinna Rao Lalam
Assignee: Chinna Rao Lalam
 Attachments: HIVE-2182.1.patch, HIVE-2182.2.patch, HIVE-2182.3.patch, 
 HIVE-2182.4.patch, HIVE-2182.5.patch, HIVE-2182.patch


 For using UDF's executed following steps
 {noformat}
 add jar /home/udf/udf.jar;
 create temporary function grade as 'udf.Grade';
 select m.userid,m.name,grade(m.maths,m.physics,m.chemistry) from marks m;
 {noformat}
 But from the above steps if we miss the first step (add jar) and execute 
 remaining steps
 {noformat}
 create temporary function grade as 'udf.Grade';
 select m.userid,m.name,grade(m.maths,m.physics,m.chemistry) from marks m;
 {noformat}
 In tasktracker it is throwing this exception
 {noformat}
 Caused by: java.lang.RuntimeException: Map operator initialization failed
at 
 org.apache.hadoop.hive.ql.exec.ExecMapper.configure(ExecMapper.java:121)
... 18 more
 Caused by: java.lang.RuntimeException: java.lang.NullPointerException
at 
 org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:115)
at 
 org.apache.hadoop.hive.ql.udf.generic.GenericUDFBridge.initialize(GenericUDFBridge.java:126)
at 
 org.apache.hadoop.hive.ql.exec.ExprNodeGenericFuncEvaluator.initialize(ExprNodeGenericFuncEvaluator.java:133)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initEvaluators(Operator.java:878)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initEvaluatorsAndReturnStruct(Operator.java:904)
at 
 org.apache.hadoop.hive.ql.exec.SelectOperator.initializeOp(SelectOperator.java:60)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:357)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:433)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initializeChildren(Operator.java:389)
at 
 

[jira] [Commented] (HIVE-2145) NPE during parsing order-by expression

2011-09-12 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13102832#comment-13102832
 ] 

jirapos...@reviews.apache.org commented on HIVE-2145:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1802/
---

Review request for hive and Ning Zhang.


Summary
---

While compiling the query if order by contains function throwing an exception..


This addresses bug HIVE-2145.
https://issues.apache.org/jira/browse/HIVE-2145


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ErrorMsg.java 1167000 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
1167000 
  trunk/ql/src/test/queries/clientnegative/orderby_function.q PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/orderby_function.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/1802/diff


Testing
---

Added unit test case and ran all existing test cases


Thanks,

chinna



 NPE during parsing order-by expression
 --

 Key: HIVE-2145
 URL: https://issues.apache.org/jira/browse/HIVE-2145
 Project: Hive
  Issue Type: Bug
Reporter: Ning Zhang
Assignee: Chinna Rao Lalam
 Attachments: HIVE-2145.patch


 The following query throws NPE, where it should have throw parsing exception. 
 hive select key, count(1) cnt from src group by key order by count(1) limit 
 10;
 select key, count(1) cnt from src group by key order by count(1) limit 10;
 FAILED: Hive Internal Error: java.lang.NullPointerException(null)
 java.lang.NullPointerException
   at 
 org.apache.hadoop.hive.ql.plan.ExprNodeGenericFuncDesc.newInstance(ExprNodeGenericFuncDesc.java:153)
   at 
 org.apache.hadoop.hive.ql.parse.TypeCheckProcFactory$DefaultExprProcessor.getXpathOrFuncExprNodeDesc(TypeCheckProcFactory.java:640)
   at 
 org.apache.hadoop.hive.ql.parse.TypeCheckProcFactory$DefaultExprProcessor.process(TypeCheckProcFactory.java:761)
   at 
 org.apache.hadoop.hive.ql.lib.DefaultRuleDispatcher.dispatch(DefaultRuleDispatcher.java:89)
   at 
 org.apache.hadoop.hive.ql.lib.DefaultGraphWalker.dispatch(DefaultGraphWalker.java:88)
   at 
 org.apache.hadoop.hive.ql.lib.DefaultGraphWalker.walk(DefaultGraphWalker.java:125)
   at 
 org.apache.hadoop.hive.ql.lib.DefaultGraphWalker.startWalking(DefaultGraphWalker.java:102)
   at 
 org.apache.hadoop.hive.ql.parse.TypeCheckProcFactory.genExprNode(TypeCheckProcFactory.java:156)
   at 
 org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genExprNodeDesc(SemanticAnalyzer.java:6830)
   at 
 org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genExprNodeDesc(SemanticAnalyzer.java:6788)
   at 
 org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genReduceSinkPlan(SemanticAnalyzer.java:4303)
   at 
 org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genBodyPlan(SemanticAnalyzer.java:5461)
   at 
 org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:6022)
   at 
 org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeInternal(SemanticAnalyzer.java:6607)
   at 
 org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:238)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:340)
   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:790)
   at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:209)
   at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:286)
   at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:514)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
 Correct syntax should be: 
 hive select key, count(1) cnt from src group by key order by cnt limit 10;

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2398) Hive server doesn't return schema for 'set' command

2011-09-12 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13102909#comment-13102909
 ] 

jirapos...@reviews.apache.org commented on HIVE-2398:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1605/#review1857
---


Please add a testcase. Also, I tried running 'set -v' from the CLI after 
setting hive.cli.print.header=true, and I don't see the column name in the 
output. This seems like a bug?


ql/src/java/org/apache/hadoop/hive/ql/processors/SetProcessor.java
https://reviews.apache.org/r/1605/#comment4245

Checkstyle: static final variable names should appear in ALL CAPS



ql/src/java/org/apache/hadoop/hive/ql/processors/SetProcessor.java
https://reviews.apache.org/r/1605/#comment4248

bld? Might be good to change the name to getSchema. Also, since setColName 
is a constant there's no need for this method to have an input parameter.



ql/src/java/org/apache/hadoop/hive/ql/processors/SetProcessor.java
https://reviews.apache.org/r/1605/#comment4247

Please add static imports for STRING_TYPE_NAME, SERIALIZATION_NULL_FORMAT, 
and defaultNullString.



service/src/java/org/apache/hadoop/hive/service/HiveServer.java
https://reviews.apache.org/r/1605/#comment4249

This comment is no longer valid. Also, it might be cleaner to remove the 
else clause.


- Carl


On 2011-08-19 22:37:45, Prasad Mujumdar wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1605/
bq.  ---
bq.  
bq.  (Updated 2011-08-19 22:37:45)
bq.  
bq.  
bq.  Review request for hive and Carl Steinbach.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  The Hive server currently has the framework to process the Cli commands 
like 'set'. But it returns an empty schema definition for these resultsets. The 
patch here is to add a getSchema() to the CommandResponse interface. The 
SetProcessor is modified to build a schema object which is returned with 
getSchema.
bq.  HiveServer is modified to call the getSchema for command cases as well.
bq.  
bq.  
bq.  This addresses bug HIVE-2398.
bq.  https://issues.apache.org/jira/browse/HIVE-2398
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.
ql/src/java/org/apache/hadoop/hive/ql/processors/CommandProcessorResponse.java 
4f7f472 
bq.ql/src/java/org/apache/hadoop/hive/ql/processors/SetProcessor.java 
97fa1ab 
bq.service/src/java/org/apache/hadoop/hive/service/HiveServer.java c4b041f 
bq.  
bq.  Diff: https://reviews.apache.org/r/1605/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Tested with ODBC program to execute the set command. Ran Hive unit tests.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Prasad
bq.  
bq.



 Hive server doesn't return schema for 'set' command
 ---

 Key: HIVE-2398
 URL: https://issues.apache.org/jira/browse/HIVE-2398
 Project: Hive
  Issue Type: Bug
  Components: Server Infrastructure
Affects Versions: 0.7.1, 0.8.0
Reporter: Prasad Mujumdar
Assignee: Prasad Mujumdar
 Attachments: HIVE-2398.patch


 The Hive server does process the CLI commands like 'set', 'set -v' sent by 
 ODBC or JDBC clients. But currently only the data is returned to client but 
 not schema for that resultset. This makes it unusable for a ODBC or JDBC 
 client to use this option.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2440) make hive mapper initialize faster when having tons of input files

2011-09-12 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13102935#comment-13102935
 ] 

jirapos...@reviews.apache.org commented on HIVE-2440:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1813/
---

Review request for hive and Ning Zhang.


Summary
---

make hive mapper initialize faster when having tons of input files


This addresses bug hive-2440.
https://issues.apache.org/jira/browse/hive-2440


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MapOperator.java 1167289 

Diff: https://reviews.apache.org/r/1813/diff


Testing
---


Thanks,

Yongqiang



 make hive mapper initialize faster when having tons of input files
 --

 Key: HIVE-2440
 URL: https://issues.apache.org/jira/browse/HIVE-2440
 Project: Hive
  Issue Type: Improvement
Reporter: He Yongqiang
Assignee: He Yongqiang
 Attachments: HIVE-2440.1.patch, HIVE-2440.2.patch


 when one hive job has tons of input files, a lot of mappers may fail because 
 of slow initialization.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2440) make hive mapper initialize faster when having tons of input files

2011-09-12 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13102950#comment-13102950
 ] 

jirapos...@reviews.apache.org commented on HIVE-2440:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1813/#review1859
---



trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MapOperator.java
https://reviews.apache.org/r/1813/#comment4250

do you need childrenPaths? It's only used for adding paths, but nobody is 
reading it. 


- Ning


On 2011-09-12 19:15:54, Yongqiang He wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1813/
bq.  ---
bq.  
bq.  (Updated 2011-09-12 19:15:54)
bq.  
bq.  
bq.  Review request for hive and Ning Zhang.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  make hive mapper initialize faster when having tons of input files
bq.  
bq.  
bq.  This addresses bug hive-2440.
bq.  https://issues.apache.org/jira/browse/hive-2440
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MapOperator.java 
1167289 
bq.  
bq.  Diff: https://reviews.apache.org/r/1813/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Yongqiang
bq.  
bq.



 make hive mapper initialize faster when having tons of input files
 --

 Key: HIVE-2440
 URL: https://issues.apache.org/jira/browse/HIVE-2440
 Project: Hive
  Issue Type: Improvement
Reporter: He Yongqiang
Assignee: He Yongqiang
 Attachments: HIVE-2440.1.patch, HIVE-2440.2.patch


 when one hive job has tons of input files, a lot of mappers may fail because 
 of slow initialization.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-198) Parse errors report incorrectly.

2011-09-12 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13103261#comment-13103261
 ] 

jirapos...@reviews.apache.org commented on HIVE-198:



---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1818/
---

Review request for hive.


Summary
---

Modified the message to include the actual token it fails on (When available).
It should make it easier to grok the actual problem from the error.


This addresses bug HIVE-198.
https://issues.apache.org/jira/browse/HIVE-198


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ParseDriver.java 1164307 
  trunk/ql/src/test/results/clientnegative/column_rename3.q.out 1164307 
  trunk/ql/src/test/results/clientnegative/invalid_select_expression.q.out 
1164307 
  trunk/ql/src/test/results/clientnegative/invalid_tbl_name.q.out 1164307 
  trunk/ql/src/test/results/compiler/errors/garbage.q.out 1164307 
  trunk/ql/src/test/results/compiler/errors/missing_overwrite.q.out 1164307 
  trunk/ql/src/test/results/compiler/errors/wrong_distinct2.q.out 1164307 

Diff: https://reviews.apache.org/r/1818/diff


Testing
---

ant test -Dtestcase=TestParseNegative


Thanks,

avivey



 Parse errors report incorrectly.
 

 Key: HIVE-198
 URL: https://issues.apache.org/jira/browse/HIVE-198
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: S. Alex Smith
Assignee: Aviv Eyal
  Labels: parse
 Attachments: HIVE-198.2.patch.txt, PraseErrorMessage.patch


 The following two queries fail:
 CREATE TABLE output_table(userid, bigint);
 CREATE TABLE output_table(userid bigint, age int, sex string, location 
 string);
 each giving the error message FAILED: Parse Error: line 1:16 mismatched 
 input 'TABLE' expecting KW_TEMPORARY
 Although one might not catch it from the error message, the problem with the 
 first is that there is a comma between userid and bigint, and the problem 
 with the second is that location is a reserved keyword.  Reported errors 
 should more accurately describe the nature of the error, such as no type 
 given for column 'userid' or 'location' is not a valid column name.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-198) Parse errors report incorrectly.

2011-09-12 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13103264#comment-13103264
 ] 

jirapos...@reviews.apache.org commented on HIVE-198:



---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1818/
---

(Updated 2011-09-13 01:39:58.571166)


Review request for hive.


Summary (updated)
---

Modified the ParseException message to include the actual token it fails on 
(When available).
It should make it easier to grok the actual problem from the error.


This addresses bug HIVE-198.
https://issues.apache.org/jira/browse/HIVE-198


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ParseDriver.java 1164307 
  trunk/ql/src/test/results/clientnegative/column_rename3.q.out 1164307 
  trunk/ql/src/test/results/clientnegative/invalid_select_expression.q.out 
1164307 
  trunk/ql/src/test/results/clientnegative/invalid_tbl_name.q.out 1164307 
  trunk/ql/src/test/results/compiler/errors/garbage.q.out 1164307 
  trunk/ql/src/test/results/compiler/errors/missing_overwrite.q.out 1164307 
  trunk/ql/src/test/results/compiler/errors/wrong_distinct2.q.out 1164307 

Diff: https://reviews.apache.org/r/1818/diff


Testing
---

ant test -Dtestcase=TestParseNegative


Thanks,

avivey



 Parse errors report incorrectly.
 

 Key: HIVE-198
 URL: https://issues.apache.org/jira/browse/HIVE-198
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: S. Alex Smith
Assignee: Aviv Eyal
  Labels: parse
 Attachments: HIVE-198.2.patch.txt, PraseErrorMessage.patch


 The following two queries fail:
 CREATE TABLE output_table(userid, bigint);
 CREATE TABLE output_table(userid bigint, age int, sex string, location 
 string);
 each giving the error message FAILED: Parse Error: line 1:16 mismatched 
 input 'TABLE' expecting KW_TEMPORARY
 Although one might not catch it from the error message, the problem with the 
 first is that there is a comma between userid and bigint, and the problem 
 with the second is that location is a reserved keyword.  Reported errors 
 should more accurately describe the nature of the error, such as no type 
 given for column 'userid' or 'location' is not a valid column name.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1694) Accelerate GROUP BY execution using indexes

2011-09-10 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13102149#comment-13102149
 ] 

jirapos...@reviews.apache.org commented on HIVE-1694:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1194/
---

(Updated 2011-09-10 21:10:06.178279)


Review request for hive and John Sichi.


Changes
---

Added order-by to queries for test determinism.


Summary
---

This patch has defined a new AggregateIndexHandler which is used to optimize 
the query plan for groupby queries. 


This addresses bug HIVE-1694.
https://issues.apache.org/jira/browse/HIVE-1694


Diffs (updated)
-

  common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 66ee0be 
  data/files/lineitem.txt PRE-CREATION 
  data/files/tbl.txt PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/index/AggregateIndexHandler.java 
PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndex.java 591c9ff 
  ql/src/java/org/apache/hadoop/hive/ql/index/bitmap/BitmapIndexHandler.java 
5053576 
  ql/src/java/org/apache/hadoop/hive/ql/index/compact/CompactIndexHandler.java 
7a00c00 
  ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java bec8787 
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/IndexUtils.java PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/Optimizer.java 590d69a 
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyCtx.java 
PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyProcFactory.java
 PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteGBUsingIndex.java 
PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteParseContextGenerator.java
 PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteQueryUsingAggregateIndex.java
 PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteQueryUsingAggregateIndexCtx.java
 PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/index/IndexWhereProcessor.java
 dcdfb9e 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/index/IndexWhereTaskDispatcher.java
 699519b 
  ql/src/test/queries/clientpositive/ql_rewrite_gbtoidx.q PRE-CREATION 
  ql/src/test/results/clientpositive/ql_rewrite_gbtoidx.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/1194/diff


Testing
---


Thanks,

Prajakta



 Accelerate GROUP BY execution using indexes
 ---

 Key: HIVE-1694
 URL: https://issues.apache.org/jira/browse/HIVE-1694
 Project: Hive
  Issue Type: New Feature
  Components: Indexing, Query Processor
Affects Versions: 0.7.0
Reporter: Nikhil Deshpande
Assignee: Prajakta Kalmegh
 Attachments: HIVE-1694.1.patch.txt, HIVE-1694.2.patch.txt, 
 HIVE-1694.3.patch.txt, HIVE-1694.4.patch, HIVE-1694.5.patch, 
 HIVE-1694.6.patch, HIVE-1694.7.patch, HIVE-1694_2010-10-28.diff, demo_q1.hql, 
 demo_q2.hql


 The index building patch (Hive-417) is checked into trunk, this JIRA issue 
 tracks supporting indexes in Hive compiler  execution engine for SELECT 
 queries.
 This is in ref. to John's comment at
 https://issues.apache.org/jira/browse/HIVE-417?focusedCommentId=12884869page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#action_12884869
 on creating separate JIRA issue for tracking index usage in optimizer  query 
 execution.
 The aim of this effort is to use indexes to accelerate query execution (for 
 certain class of queries). E.g.
 - Filters and range scans (already being worked on by He Yongqiang as part of 
 HIVE-417?)
 - Joins (index based joins)
 - Group By, Order By and other misc cases
 The proposal is multi-step:
 1. Building index based operators, compiler and execution engine changes
 2. Optimizer enhancements (e.g. cost-based optimizer to compare and choose 
 between index scans, full table scans etc.)
 This JIRA initially focuses on the first step. This JIRA is expected to hold 
 the information about index based plans  operator implementations for above 
 mentioned cases. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2223) support grouping on complex types in Hive

2011-09-09 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13101212#comment-13101212
 ] 

jirapos...@reviews.apache.org commented on HIVE-2223:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1745/
---

(Updated 2011-09-09 13:50:30.594056)


Review request for hive.


Summary
---

Adds hash codes for List and Map object inspectors.


This addresses bug HIVE-2223.
https://issues.apache.org/jira/browse/HIVE-2223


Diffs
-


Diff: https://reviews.apache.org/r/1745/diff


Testing
---

Added unittest.


Thanks,

Jonathan



 support grouping on complex types in Hive
 -

 Key: HIVE-2223
 URL: https://issues.apache.org/jira/browse/HIVE-2223
 Project: Hive
  Issue Type: New Feature
Reporter: Kate Ting
Assignee: Jonathan Chang
Priority: Minor
 Attachments: HIVE-2223.patch


 Creating a query with a GROUP BY statement when an array type column is part 
 of the column list is not yet supported:
 CREATE TABLE test_group_by ( key INT, group INT, terms ARRAYSTRING);
 SELECT key, terms, count(group) FROM test_group_by GROUP BY key, terms;
 ...
 Hash code on complex types not supported yet.
 java.lang.RuntimeException: Error while closing operators
 at org.apache.hadoop.hive.ql.exec.ExecMapper.close(ExecMapper.java:232)
 at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:57)
 at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:356)
 at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
 at org.apache.hadoop.mapred.Child.main(Child.java:170)
 Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: 
 java.lang.RuntimeException: Hash code on complex types not supported yet.
 at 
 org.apache.hadoop.hive.ql.exec.GroupByOperator.closeOp(GroupByOperator.java:799)
 at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:462)
 at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:470)
 at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:470)
 at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:470)
 at org.apache.hadoop.hive.ql.exec.ExecMapper.close(ExecMapper.java:211)
 ... 4 more
 Caused by: java.lang.RuntimeException: Hash code on complex types not 
 supported yet.
 at 
 org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorUtils.hashCode(ObjectInspectorUtils.java:348)
 at 
 org.apache.hadoop.hive.ql.exec.ReduceSinkOperator.processOp(ReduceSinkOperator.java:187)
 at org.apache.hadoop.hive.ql.exec.Operator.process(Operator.java:386)
 at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:598)
 at 
 org.apache.hadoop.hive.ql.exec.GroupByOperator.forward(GroupByOperator.java:746)
 at 
 org.apache.hadoop.hive.ql.exec.GroupByOperator.closeOp(GroupByOperator.java:780)
 ... 9 more

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2182) Avoid null pointer exception when executing UDF

2011-09-09 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13101405#comment-13101405
 ] 

jirapos...@reviews.apache.org commented on HIVE-2182:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1759/
---

(Updated 2011-09-09 18:08:07.528830)


Review request for hive and John Sichi.


Summary
---

while executing UDF if the implementation jar is not present in MR class path 
it is throwing nullpointer exception instead of throwing nullpointer exception 
throwing meaning full exception with the required details.


This addresses bug HIVE-2182.
https://issues.apache.org/jira/browse/HIVE-2182


Diffs (updated)
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFBridge.java 
1167000 
  trunk/ql/src/test/queries/clientnegative/udfnull.q PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/udfnull.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/1759/diff


Testing
---

Added unit tests


Thanks,

chinna



 Avoid null pointer exception when executing UDF
 ---

 Key: HIVE-2182
 URL: https://issues.apache.org/jira/browse/HIVE-2182
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.5.0, 0.8.0
 Environment: Hadoop 0.20.1, Hive0.8.0 and SUSE Linux Enterprise 
 Server 10 SP2 (i586) - Kernel 2.6.16.60-0.21-smp (5)
Reporter: Chinna Rao Lalam
Assignee: Chinna Rao Lalam
 Attachments: HIVE-2182.1.patch, HIVE-2182.2.patch, HIVE-2182.3.patch, 
 HIVE-2182.patch


 For using UDF's executed following steps
 {noformat}
 add jar /home/udf/udf.jar;
 create temporary function grade as 'udf.Grade';
 select m.userid,m.name,grade(m.maths,m.physics,m.chemistry) from marks m;
 {noformat}
 But from the above steps if we miss the first step (add jar) and execute 
 remaining steps
 {noformat}
 create temporary function grade as 'udf.Grade';
 select m.userid,m.name,grade(m.maths,m.physics,m.chemistry) from marks m;
 {noformat}
 In tasktracker it is throwing this exception
 {noformat}
 Caused by: java.lang.RuntimeException: Map operator initialization failed
at 
 org.apache.hadoop.hive.ql.exec.ExecMapper.configure(ExecMapper.java:121)
... 18 more
 Caused by: java.lang.RuntimeException: java.lang.NullPointerException
at 
 org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:115)
at 
 org.apache.hadoop.hive.ql.udf.generic.GenericUDFBridge.initialize(GenericUDFBridge.java:126)
at 
 org.apache.hadoop.hive.ql.exec.ExprNodeGenericFuncEvaluator.initialize(ExprNodeGenericFuncEvaluator.java:133)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initEvaluators(Operator.java:878)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initEvaluatorsAndReturnStruct(Operator.java:904)
at 
 org.apache.hadoop.hive.ql.exec.SelectOperator.initializeOp(SelectOperator.java:60)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:357)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:433)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initializeChildren(Operator.java:389)
at 
 org.apache.hadoop.hive.ql.exec.TableScanOperator.initializeOp(TableScanOperator.java:133)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:357)
at 
 org.apache.hadoop.hive.ql.exec.MapOperator.initializeOp(MapOperator.java:444)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:357)
at 
 org.apache.hadoop.hive.ql.exec.ExecMapper.configure(ExecMapper.java:98)
... 18 more
 Caused by: java.lang.NullPointerException
at 
 java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:768)
at 
 org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:107)
... 31 more
 {noformat}
 Instead of null pointer exception it should throw meaning full exception

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2440) make hive mapper initialize faster when having tons of input files

2011-09-09 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13101563#comment-13101563
 ] 

jirapos...@reviews.apache.org commented on HIVE-2440:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1783/
---

Review request for hive and Ning Zhang.


Summary
---

when one hive job has tons of input files, a lot of mappers may fail because of 
slow initialization.


This addresses bug HIVE-2440.
https://issues.apache.org/jira/browse/HIVE-2440


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MapOperator.java 1167289 

Diff: https://reviews.apache.org/r/1783/diff


Testing
---


Thanks,

Yongqiang



 make hive mapper initialize faster when having tons of input files
 --

 Key: HIVE-2440
 URL: https://issues.apache.org/jira/browse/HIVE-2440
 Project: Hive
  Issue Type: Improvement
Reporter: He Yongqiang
Assignee: He Yongqiang
 Attachments: HIVE-2440.1.patch


 when one hive job has tons of input files, a lot of mappers may fail because 
 of slow initialization.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2402) Function like with empty string is throwing null pointer exception

2011-09-08 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13100495#comment-13100495
 ] 

jirapos...@reviews.apache.org commented on HIVE-2402:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1757/
---

Review request for hive and John Sichi.


Summary
---

By default patterntype is complex so it is expecting Pattern object but in this 
scenario pattern object is null so it is throwing nullpointer exception. 
Default patterntype can be NONE and in parseSimplePattern() it will be assigned 
with right type.


This addresses bug HIVE-2402.
https://issues.apache.org/jira/browse/HIVE-2402


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFLike.java 1165244 
  trunk/ql/src/test/queries/clientpositive/udf_like.q 1165244 
  trunk/ql/src/test/results/clientpositive/udf_like.q.out 1165244 

Diff: https://reviews.apache.org/r/1757/diff


Testing
---

Added unit testcase.


Thanks,

chinna



 Function like with empty string is throwing null pointer exception
 --

 Key: HIVE-2402
 URL: https://issues.apache.org/jira/browse/HIVE-2402
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.8.0
 Environment: Hadoop 0.20.1, Hive0.8.0 and SUSE Linux Enterprise 
 Server 10 SP2 (i586) - Kernel 2.6.16.60-0.21-smp (5)
Reporter: Chinna Rao Lalam
Assignee: Chinna Rao Lalam
 Attachments: HIVE-2402.1.patch, HIVE-2402.patch


 select emp.ename from emp where ename like ''
 This query is throwing null pointer exception

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2182) Avoid null pointer exception when executing UDF

2011-09-08 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13100631#comment-13100631
 ] 

jirapos...@reviews.apache.org commented on HIVE-2182:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1759/
---

Review request for hive and John Sichi.


Summary
---

while executing UDF if the implementation jar is not present in MR class path 
it is throwing nullpointer exception instead of throwing nullpointer exception 
throwing meaning full exception with the required details.


This addresses bug HIVE-2182.
https://issues.apache.org/jira/browse/HIVE-2182


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFBridge.java 
1166865 
  trunk/ql/src/test/queries/clientnegative/udfnull.q PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/udfnull.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/1759/diff


Testing
---

Added unit tests


Thanks,

chinna



 Avoid null pointer exception when executing UDF
 ---

 Key: HIVE-2182
 URL: https://issues.apache.org/jira/browse/HIVE-2182
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.5.0, 0.8.0
 Environment: Hadoop 0.20.1, Hive0.8.0 and SUSE Linux Enterprise 
 Server 10 SP2 (i586) - Kernel 2.6.16.60-0.21-smp (5)
Reporter: Chinna Rao Lalam
Assignee: Chinna Rao Lalam
 Attachments: HIVE-2182.1.patch, HIVE-2182.2.patch, HIVE-2182.patch


 For using UDF's executed following steps
 {noformat}
 add jar /home/udf/udf.jar;
 create temporary function grade as 'udf.Grade';
 select m.userid,m.name,grade(m.maths,m.physics,m.chemistry) from marks m;
 {noformat}
 But from the above steps if we miss the first step (add jar) and execute 
 remaining steps
 {noformat}
 create temporary function grade as 'udf.Grade';
 select m.userid,m.name,grade(m.maths,m.physics,m.chemistry) from marks m;
 {noformat}
 In tasktracker it is throwing this exception
 {noformat}
 Caused by: java.lang.RuntimeException: Map operator initialization failed
at 
 org.apache.hadoop.hive.ql.exec.ExecMapper.configure(ExecMapper.java:121)
... 18 more
 Caused by: java.lang.RuntimeException: java.lang.NullPointerException
at 
 org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:115)
at 
 org.apache.hadoop.hive.ql.udf.generic.GenericUDFBridge.initialize(GenericUDFBridge.java:126)
at 
 org.apache.hadoop.hive.ql.exec.ExprNodeGenericFuncEvaluator.initialize(ExprNodeGenericFuncEvaluator.java:133)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initEvaluators(Operator.java:878)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initEvaluatorsAndReturnStruct(Operator.java:904)
at 
 org.apache.hadoop.hive.ql.exec.SelectOperator.initializeOp(SelectOperator.java:60)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:357)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:433)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initializeChildren(Operator.java:389)
at 
 org.apache.hadoop.hive.ql.exec.TableScanOperator.initializeOp(TableScanOperator.java:133)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:357)
at 
 org.apache.hadoop.hive.ql.exec.MapOperator.initializeOp(MapOperator.java:444)
at 
 org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:357)
at 
 org.apache.hadoop.hive.ql.exec.ExecMapper.configure(ExecMapper.java:98)
... 18 more
 Caused by: java.lang.NullPointerException
at 
 java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:768)
at 
 org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:107)
... 31 more
 {noformat}
 Instead of null pointer exception it should throw meaning full exception

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1694) Accelerate GROUP BY execution using indexes

2011-09-08 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13100839#comment-13100839
 ] 

jirapos...@reviews.apache.org commented on HIVE-1694:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1194/
---

(Updated 2011-09-09 01:14:16.218940)


Review request for hive and John Sichi.


Summary
---

This patch has defined a new AggregateIndexHandler which is used to optimize 
the query plan for groupby queries. 


This addresses bug HIVE-1694.
https://issues.apache.org/jira/browse/HIVE-1694


Diffs (updated)
-

  ql/src/test/results/clientpositive/ql_rewrite_gbtoidx.q.out PRE-CREATION 
  ql/src/test/queries/clientpositive/ql_rewrite_gbtoidx.q PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/index/IndexWhereTaskDispatcher.java
 699519b 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/index/IndexWhereProcessor.java
 dcdfb9e 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteQueryUsingAggregateIndexCtx.java
 PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteQueryUsingAggregateIndex.java
 PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteParseContextGenerator.java
 PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteGBUsingIndex.java 
PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyProcFactory.java
 PRE-CREATION 
  common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 66ee0be 
  data/files/lineitem.txt PRE-CREATION 
  data/files/tbl.txt PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/index/AggregateIndexHandler.java 
PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndex.java 591c9ff 
  ql/src/java/org/apache/hadoop/hive/ql/index/bitmap/BitmapIndexHandler.java 
5053576 
  ql/src/java/org/apache/hadoop/hive/ql/index/compact/CompactIndexHandler.java 
7a00c00 
  ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java bec8787 
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/IndexUtils.java PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/Optimizer.java 590d69a 
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyCtx.java 
PRE-CREATION 

Diff: https://reviews.apache.org/r/1194/diff


Testing
---


Thanks,

Prajakta



 Accelerate GROUP BY execution using indexes
 ---

 Key: HIVE-1694
 URL: https://issues.apache.org/jira/browse/HIVE-1694
 Project: Hive
  Issue Type: New Feature
  Components: Indexing, Query Processor
Affects Versions: 0.7.0
Reporter: Nikhil Deshpande
Assignee: Prajakta Kalmegh
 Attachments: HIVE-1694.1.patch.txt, HIVE-1694.2.patch.txt, 
 HIVE-1694.3.patch.txt, HIVE-1694.4.patch, HIVE-1694.5.patch, 
 HIVE-1694_2010-10-28.diff, demo_q1.hql, demo_q2.hql


 The index building patch (Hive-417) is checked into trunk, this JIRA issue 
 tracks supporting indexes in Hive compiler  execution engine for SELECT 
 queries.
 This is in ref. to John's comment at
 https://issues.apache.org/jira/browse/HIVE-417?focusedCommentId=12884869page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#action_12884869
 on creating separate JIRA issue for tracking index usage in optimizer  query 
 execution.
 The aim of this effort is to use indexes to accelerate query execution (for 
 certain class of queries). E.g.
 - Filters and range scans (already being worked on by He Yongqiang as part of 
 HIVE-417?)
 - Joins (index based joins)
 - Group By, Order By and other misc cases
 The proposal is multi-step:
 1. Building index based operators, compiler and execution engine changes
 2. Optimizer enhancements (e.g. cost-based optimizer to compare and choose 
 between index scans, full table scans etc.)
 This JIRA initially focuses on the first step. This JIRA is expected to hold 
 the information about index based plans  operator implementations for above 
 mentioned cases. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2412) Update Eclipse configuration to include Mockito dependency

2011-09-07 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13098750#comment-13098750
 ] 

jirapos...@reviews.apache.org commented on HIVE-2412:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1665/#review1782
---

Ship it!


+1

- Amareshwari


On 2011-08-26 02:12:54, Carl Steinbach wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1665/
bq.  ---
bq.  
bq.  (Updated 2011-08-26 02:12:54)
bq.  
bq.  
bq.  Review request for hive and Amareshwari Sriramadasu.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  This patch does two things:
bq.  
bq.  * Fixes the eclipse project generation code by including mockito
bq.on the classpath
bq.  
bq.  * Updates the HiveCli launch configuration template so that the
bq.metastore JAR is no longer on the classpath, and also moves the
bq.hive-exec JAR to the end of the classpath. This was done in
bq.order to give preference on the classpath to local changes,
bq.which up to this point have been obscured by hive-exec and
bq.hive-metastore JAR files located in the build directory. Note
bq.that in order to run the HiveCLI launchtemplate you now need to
bq.have the Datanucleus Eclipse Plugin installed, and
bq.must enhance the project.
bq.  
bq.  
bq.  This addresses bug HIVE-2412.
bq.  https://issues.apache.org/jira/browse/HIVE-2412
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.eclipse-templates/.classpath b59b343 
bq.eclipse-templates/HiveCLI.launchtemplate 0fa86cb 
bq.  
bq.  Diff: https://reviews.apache.org/r/1665/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Verified that project generation and import works. Also verified that 
HiveCLI launch template works.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Carl
bq.  
bq.



 Update Eclipse configuration to include Mockito dependency
 --

 Key: HIVE-2412
 URL: https://issues.apache.org/jira/browse/HIVE-2412
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Reporter: Carl Steinbach
Assignee: Carl Steinbach
 Fix For: 0.9.0

 Attachments: HIVE-2412.1.patch.txt




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1694) Accelerate GROUP BY execution using indexes

2011-09-07 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13098986#comment-13098986
 ] 

jirapos...@reviews.apache.org commented on HIVE-1694:
-



bq.  On 2011-08-05 21:20:21, John Sichi wrote:
bq.   ql/src/java/org/apache/hadoop/hive/ql/index/AggregateIndexHandler.java, 
line 172
bq.   https://reviews.apache.org/r/1194/diff/2/?file=30443#file30443line172
bq.  
bq.   See recent changes in corresponding CompactIndexHandler code for 
HIVEOPTINDEXFILTER; need the same here (or better, factor out common code here 
and elsewhere).
bq.   
bq.   On a related note, you may be able to use the same technique instead 
of isQueryInsertToTable; this would be preferable since it's nice to be able to 
use the index rewrite in cases where it's a normal INSERT table with index 
being used for GROUP BY on SELECT from some other table.
bq.  

I have factored out the common code in all Index handler classes and placed it 
in IndexUtils file. 

I also removed the code for isQueryInsertToTable and am setting the 
HIVEOPTINDEXFILTER to false instead. 


bq.  On 2011-08-05 21:20:21, John Sichi wrote:
bq.   
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyProcFactory.java,
 line 153
bq.   https://reviews.apache.org/r/1194/diff/2/?file=30449#file30449line153
bq.  
bq.   Shouldn't this be the same as COUNT(*)?
bq.  

Yes it is. I missed to change this part from the previous code.


- Prajakta


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1194/#review1303
---


On 2011-08-03 10:31:42, Prajakta Kalmegh wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1194/
bq.  ---
bq.  
bq.  (Updated 2011-08-03 10:31:42)
bq.  
bq.  
bq.  Review request for hive and John Sichi.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  This patch has defined a new AggregateIndexHandler which is used to 
optimize the query plan for groupby queries. 
bq.  
bq.  
bq.  This addresses bug HIVE-1694.
bq.  https://issues.apache.org/jira/browse/HIVE-1694
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.common/src/java/org/apache/hadoop/hive/conf/HiveConf.java b46976f 
bq.ql/src/java/org/apache/hadoop/hive/ql/index/AggregateIndexHandler.java 
PRE-CREATION 
bq.ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndex.java 591c9ff 
bq.ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java a57f9cf 
bq.ql/src/java/org/apache/hadoop/hive/ql/optimizer/IndexUtils.java 
PRE-CREATION 
bq.ql/src/java/org/apache/hadoop/hive/ql/optimizer/Optimizer.java 590d69a 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyCtx.java 
PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyProcFactory.java
 PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteGBUsingIndex.java 
PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteParseContextGenerator.java
 PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteQueryUsingAggregateIndex.java
 PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteQueryUsingAggregateIndexCtx.java
 PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/index/IndexWhereProcessor.java
 8295687 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/index/IndexWhereTaskDispatcher.java
 699519b 
bq.ql/src/test/queries/clientpositive/ql_rewrite_gbtoidx.q PRE-CREATION 
bq.ql/src/test/results/clientpositive/ql_rewrite_gbtoidx.q.out PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1194/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Prajakta
bq.  
bq.



 Accelerate GROUP BY execution using indexes
 ---

 Key: HIVE-1694
 URL: https://issues.apache.org/jira/browse/HIVE-1694
 Project: Hive
  Issue Type: New Feature
  Components: Indexing, Query Processor
Affects Versions: 0.7.0
Reporter: Nikhil Deshpande
Assignee: Prajakta Kalmegh
 Attachments: HIVE-1694.1.patch.txt, HIVE-1694.2.patch.txt, 
 HIVE-1694.3.patch.txt, HIVE-1694.4.patch, HIVE-1694.5.patch, 
 HIVE-1694_2010-10-28.diff, demo_q1.hql, demo_q2.hql


 The index building patch (Hive-417) is checked into trunk, this JIRA issue 
 tracks supporting indexes in Hive compiler  execution engine for SELECT 
 queries.
 This is in ref. to John's comment at
 

[jira] [Commented] (HIVE-2430) Performance degradation in stats DB after JIRA HIVE-2144 (https://issues.apache.org/jira/browse/HIVE-2144)

2011-09-07 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13099178#comment-13099178
 ] 

jirapos...@reviews.apache.org commented on HIVE-2430:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1737/
---

Review request for hive and Ning Zhang.


Summary
---

I considered two different strategies for handling duplicates after removing 
the primary key from the table.

1) Go back to performing a select, and then updating if a row for the file 
exists, or inserting a new record otherwise.

2) Always insert records and then during aggregation get the max value for each 
statistic with a group by on the file name, and then aggregate those statistics.

This diff contains the code for option 2.  I determined this to be the better 
option by adding a couple stress tests to TestStatsPublisherEnhanced, and then 
comparing the run times for the two implementations using derby and MySQL.  The 
two tests checked the performance when inserting a couple hundred rows for each 
of two files, and inserting several hundred rows, each for a different file.  
In each case, when i ran the tests on my machine there wasn't much difference 
for derby, but for MySQL I was seeing both tests run about 100 ms faster for 
MySQL.  I ran both tests several times, to confirm what I was seeing.

Note that previously, if statistics were added for a file, and then statistics 
were added again for that same file, but missing some number of values, those 
missing values were erased from the row.  With this new implementation the old 
values for those missing statistics will be used.  This case will probably 
never happen in the field.


This addresses bug HIVE-2430.
https://issues.apache.org/jira/browse/HIVE-2430


Diffs
-

  
trunk/ql/src/java/org/apache/hadoop/hive/ql/stats/jdbc/JDBCStatsPublisher.java 
1165899 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/stats/jdbc/JDBCStatsSetupConstants.java
 1165899 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/stats/jdbc/JDBCStatsUtils.java 
1165899 
  
trunk/ql/src/test/org/apache/hadoop/hive/ql/exec/TestStatsPublisherEnhanced.java
 1165899 

Diff: https://reviews.apache.org/r/1737/diff


Testing
---

I added two new stress tests to TestStatsPublisherEnhanced.  I also modified 
one of the tests to reflect the modified behavior described in the Description.

I ran the unit test queries as well.


Thanks,

Kevin



 Performance degradation in stats DB after JIRA HIVE-2144 
 (https://issues.apache.org/jira/browse/HIVE-2144)
 --

 Key: HIVE-2430
 URL: https://issues.apache.org/jira/browse/HIVE-2430
 Project: Hive
  Issue Type: Bug
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2430.1.patch.txt


 After JIRA HIVE-2144 (https://issues.apache.org/jira/browse/HIVE-2144), the 
 performance in stats DB degraded significantly due to MySQL's inefficient 
 index maintenance. 
 We should remove the primary index introduced in that JIRA and resolve 
 duplicates in the stats aggregation
 phase. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2429) skip corruption bug that cause data not decompressed

2011-09-07 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13099255#comment-13099255
 ] 

jirapos...@reviews.apache.org commented on HIVE-2429:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1738/
---

Review request for Yongqiang He.


Summary
---

Fix for lazy decompression


This addresses bug HIVE-2429.
https://issues.apache.org/jira/browse/HIVE-2429


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/io/RCFile.java 1166283 
  trunk/ql/src/test/queries/clientpositive/rcfile_toleratecorruptions.q 
PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/rcfile_toleratecorruptions.q.out 
PRE-CREATION 

Diff: https://reviews.apache.org/r/1738/diff


Testing
---

Added unit test


Thanks,

Ramkumar



 skip corruption bug that cause data not decompressed
 

 Key: HIVE-2429
 URL: https://issues.apache.org/jira/browse/HIVE-2429
 Project: Hive
  Issue Type: Bug
Reporter: He Yongqiang
Assignee: Ramkumar Vadali
 Attachments: HIVE-2429.patch


 This is a regression of https://issues.apache.org/jira/browse/HIVE-2404

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2397) Support with rollup option for group by

2011-09-07 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13099593#comment-13099593
 ] 

jirapos...@reviews.apache.org commented on HIVE-2397:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1637/
---

(Updated 2011-09-07 21:47:48.620408)


Review request for hive, Yongqiang He, Ning Zhang, and Siying Dong.


Changes
---

Two changes:

1) Added groupby_withrollup.q.out, but forgot to add groupby_withrollup.q

2) I modified the method modifyAggregations in GroupbyOperator to take a list 
of AggregationBuffer[]'s.  This makes it so that we only need to evaluate each 
row of input once, and then we can update all the rollup aggregations at once, 
instead of reevaluating the same row of input for each rollup row.


Summary
---

If a user runs a query that includes group by ... with rollup, the behavior is 
like that of MySQL, see the task.

I had to implement 4 different ways of providing this behavior to fit in with 
the 4 different ways of implementing group by depending on whether map 
aggregation is allowed, and whether the data is known to be skewed.

If map aggregation is allowed, it is a simple matter of adding new keys with an 
increasing number of NULLs to the hash map to collect the data for the new rows 
as part of the map side hash aggregation.

If map aggregation is not allowed and the data is not skewed, I perform a 
reduce job which performs a hash aggregation very similar to the way it is 
performed on the map side.  I then perform a mergepartial reduce job to perform 
a final aggregation on the hash aggregation.

If map aggregation is not allowed and the data is skewed and there are no 
distinct aggregations for the group by, I aggregate data for the new rows with 
NULLs as part of the non-hash aggregation.  This was as simple as adding the 
new functionality to the map side hash aggregation.

If map aggregation is not allowed and the data is skewed and there are distinct 
aggregations fro the group by, I perform a reduce job which performs  a hash 
aggregation, and then use the same implementation that is used when there is no 
rollup option set.

I have done my best not to detract from any optimizations that were made for 
each of the four different implementations of group by, but, r for the ones 
where I add a new reduce job, I am not sure how successful I was.

Currently, the optimizations for multiple group bys is not supported for 
queries with the rollup option set, but I am continuing to look into this.


This addresses bug HIVE-2397.
https://issues.apache.org/jira/browse/HIVE-2397


Diffs (updated)
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/GroupByOperator.java 1166376 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/KeyWrapper.java 1166376 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/KeyWrapperFactory.java 
1166376 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g 1166376 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
1166376 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/GroupByDesc.java 1166376 
  trunk/ql/src/test/queries/clientpositive/groupby10_withrollup.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby11_withrollup.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby1_limit_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby1_map_nomap_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby1_map_skew_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby1_map_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby1_noskew_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby1_withrollup.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby2_limit_withrollup.q 
PRE-CREATION 
  
trunk/ql/src/test/queries/clientpositive/groupby2_map_multi_distinct_withrollup.q
 PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby2_map_skew_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby2_map_withrollup.q 
PRE-CREATION 
  
trunk/ql/src/test/queries/clientpositive/groupby2_noskew_multi_distinct_withrollup.q
 PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby2_noskew_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby2_withrollup.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby7_map_skew_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby7_map_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby7_noskew_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby7_withrollup.q PRE-CREATION 
  

[jira] [Commented] (HIVE-2431) upgrading thrift version didn't upgrade libthrift.jar symlink correctly

2011-09-07 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13099678#comment-13099678
 ] 

jirapos...@reviews.apache.org commented on HIVE-2431:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1740/
---

Review request for hive and Carl Steinbach.


Summary
---

simple change to fix the libfb303.jar and libthrift.jar symlinks.


This addresses bug HIVE-2431.
https://issues.apache.org/jira/browse/HIVE-2431


Diffs
-

  trunk/build.xml 1166476 

Diff: https://reviews.apache.org/r/1740/diff


Testing
---

ant clean package then check build/dist/lib for the correctness. 


Thanks,

Ning



 upgrading thrift version didn't upgrade libthrift.jar symlink correctly
 ---

 Key: HIVE-2431
 URL: https://issues.apache.org/jira/browse/HIVE-2431
 Project: Hive
  Issue Type: Bug
Reporter: Ning Zhang
Assignee: Ning Zhang
 Attachments: HIVE-2431.patch


 libthrift.jar and libfb303.jar are symlinks to the current thrift version. 
 With the upgrade to 0.7, there's a bug in the symlink creation. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2416) Multiple distinct function to support hive.groupby.skewindata optimization

2011-09-07 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13099914#comment-13099914
 ] 

jirapos...@reviews.apache.org commented on HIVE-2416:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1741/
---

(Updated 2011-09-08 04:52:29.083703)


Review request for hive and Amareshwari Sriramadasu.


Changes
---

Update BUGs field to link JIRA


Summary
---

Currently when multiple distinct function is used, hive.groupby.skewindata 
optimization parameter shall be set false, or else an exception is raised:

Error in semantic analysis: DISTINCT on different columns not supported with 
skew in data

Skew groupby should support multiple distinct function


This addresses bug HIVE-2416.
https://issues.apache.org/jira/browse/HIVE-2416


Diffs
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/ReduceSinkOperator.java
 1162620 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ErrorMsg.java
 1162620 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1162620 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/PlanUtils.java
 1162620 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/ReduceSinkDesc.java
 1162620 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/groupby2_map_skew_multi_distinct.q
 1162620 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/groupby3_multi_distinct.q
 1162620 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/groupby2_map_skew_multi_distinct.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/groupby2_map_skew_multi_distinct.q.out
 1162620 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/groupby3_map_skew_multi_distinct.q.out
 1162620 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/groupby3_multi_distinct.q.out
 1162620 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/groupby2.q.out
 1162620 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/groupby2_map_skew.q.out
 1162620 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/groupby2_map_skew_multi_distinct.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/groupby3.q.out
 1162620 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/groupby3_map_skew.q.out
 1162620 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/nullgroup4.q.out
 1162620 

Diff: https://reviews.apache.org/r/1741/diff


Testing
---

All UT passed


Thanks,

Ted



 Multiple distinct function to support hive.groupby.skewindata optimization
 --

 Key: HIVE-2416
 URL: https://issues.apache.org/jira/browse/HIVE-2416
 Project: Hive
  Issue Type: Improvement
Reporter: Ted Xu
Assignee: Ted Xu
 Attachments: multi_distinct_skew.patch


 Currently when multiple distinct function is used, hive.groupby.skewindata 
 optimization parameter shall be set false, or else an exception is raised:
 {code}
 Error in semantic analysis: DISTINCT on different columns not supported with 
 skew in data
 {code}
 Skew groupby should support multiple distinct function

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2431) upgrading thrift version didn't upgrade libthrift.jar symlink correctly

2011-09-07 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13100078#comment-13100078
 ] 

jirapos...@reviews.apache.org commented on HIVE-2431:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1740/#review1811
---

Ship it!


+1

- Amareshwari


On 2011-09-08 00:25:14, Ning Zhang wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1740/
bq.  ---
bq.  
bq.  (Updated 2011-09-08 00:25:14)
bq.  
bq.  
bq.  Review request for hive and Carl Steinbach.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  simple change to fix the libfb303.jar and libthrift.jar symlinks.
bq.  
bq.  
bq.  This addresses bug HIVE-2431.
bq.  https://issues.apache.org/jira/browse/HIVE-2431
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/build.xml 1166476 
bq.  
bq.  Diff: https://reviews.apache.org/r/1740/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  ant clean package then check build/dist/lib for the correctness. 
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Ning
bq.  
bq.



 upgrading thrift version didn't upgrade libthrift.jar symlink correctly
 ---

 Key: HIVE-2431
 URL: https://issues.apache.org/jira/browse/HIVE-2431
 Project: Hive
  Issue Type: Bug
Reporter: Ning Zhang
Assignee: Ning Zhang
 Attachments: HIVE-2431.patch


 libthrift.jar and libfb303.jar are symlinks to the current thrift version. 
 With the upgrade to 0.7, there's a bug in the symlink creation. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2196) Ensure HiveConf includes all properties defined in hive-default.xml

2011-09-06 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13097758#comment-13097758
 ] 

jirapos...@reviews.apache.org commented on HIVE-2196:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1423/
---

(Updated 2011-09-06 06:16:26.733090)


Review request for hive, Carl Steinbach and John Sichi.


Changes
---

Reworked on the patch and latest patch taken care the below issues
1) Configuration property values verified and made it sync with the 
hive-default.xml
2) Compilation failure corrected.


Summary
---

The below configurations are not used in the code base so removed from the 
hive-default.xml

hive.mapjoin.hashtable.initialCapacity 
hive.mapjoin.hashtable.loadfactor 
hive.mapjoin.smalltable.filesize 
hive.optimize.pruner 
hive.stats.jdbc.atomic 
hive.concurrency.manager 

The below configurations are not used in the code base but these configurations 
are required for the system so need to maintain in the hive-default.xml and no 
need to add in the HiveConf.java

javax.jdo.option.ConnectionDriverName 
javax.jdo.PersistenceManagerFactoryClass 
javax.jdo.option.DetachAllOnCommit 
javax.jdo.option.NonTransactionalRead 
javax.jdo.option.ConnectionUserName 
fs.har.impl 


This addresses bug HIVE-2196.
https://issues.apache.org/jira/browse/HIVE-2196


Diffs (updated)
-

  
trunk/metastore/src/test/org/apache/hadoop/hive/metastore/TestRemoteHiveMetaStore.java
 1165244 
  
trunk/shims/src/test/org/apache/hadoop/hive/thrift/TestHadoop20SAuthBridge.java 
1165244 
  
trunk/metastore/src/test/org/apache/hadoop/hive/metastore/TestMarkPartitionRemote.java
 1165244 
  
trunk/metastore/src/test/org/apache/hadoop/hive/metastore/TestMetaStoreAuthorization.java
 1165244 
  
trunk/metastore/src/test/org/apache/hadoop/hive/metastore/TestMetaStoreEventListener.java
 1165244 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
1165244 
  
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStoreClient.java
 1165244 
  trunk/conf/hive-default.xml 1165244 
  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1165244 

Diff: https://reviews.apache.org/r/1423/diff


Testing
---

All unit tests passed


Thanks,

chinna



 Ensure HiveConf includes all properties defined in hive-default.xml
 ---

 Key: HIVE-2196
 URL: https://issues.apache.org/jira/browse/HIVE-2196
 Project: Hive
  Issue Type: Bug
  Components: Configuration
Affects Versions: 0.8.0
Reporter: Carl Steinbach
Assignee: Chinna Rao Lalam
 Attachments: HIVE-2196.1.patch, HIVE-2196.2.patch, HIVE-2196.3.patch, 
 HIVE-2196.build.log, HIVE-2196.patch


 There are a bunch of properties that are defined in hive-default.xml but not 
 in HiveConf.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2427) Warn user that precision is lost when bigint is implicitly cast to double in joins and unions.

2011-09-06 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13098211#comment-13098211
 ] 

jirapos...@reviews.apache.org commented on HIVE-2427:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1705/
---

(Updated 2011-09-06 17:41:24.308161)


Review request for hive and Siying Dong.


Changes
---

Added the new property to hive-default.xml


Summary
---

I added checks before the type conversions for both joins and unions to either 
throw an error or a warning depending if a bigint was going to be converted to 
a double.

I also added a new variable hive.mapred.bigint.comparison.mode, which is set to 
either strict or nonstrict to indicate whether an error or a warning should be 
given when a bigint is converted to a double.  This is instead of the original 
implementation which used the variable hive.mapred.mode.


This addresses bug HIVE-2427.
https://issues.apache.org/jira/browse/HIVE-2427


Diffs (updated)
-

  trunk/ql/src/test/results/clientnegative/compare_string_bigint_union.q.out 
PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/filter_join_breaktask2.q.out 1164293 
  trunk/ql/src/test/results/clientnegative/compare_double_bigint_union.q.out 
PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/compare_string_bigint.q.out 1164293 
  trunk/ql/src/test/results/clientnegative/compare_string_bigint_join.q.out 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/filter_join_breaktask2.q 1164293 
  trunk/ql/src/test/results/clientnegative/compare_double_bigint.q.out 1164293 
  trunk/ql/src/test/results/clientnegative/compare_double_bigint_join.q.out 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/compare_double_bigint_union.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/compare_string_bigint.q 1164293 
  trunk/ql/src/test/queries/clientnegative/compare_string_bigint_join.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/compare_string_bigint_union.q 
PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/ExprNodeGenericFuncDesc.java 
1164293 
  trunk/ql/src/test/queries/clientnegative/compare_double_bigint.q 1164293 
  trunk/ql/src/test/queries/clientnegative/compare_double_bigint_join.q 
PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ErrorMsg.java 1164293 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
1164293 
  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1164293 
  trunk/conf/hive-default.xml 1164293 

Diff: https://reviews.apache.org/r/1705/diff


Testing
---

I added several more tests to demonstrate the errors.

I also modified a unit test in which a join was being done which involved a 
comparison between a string and a bigint.  I set 
hive.mapred.bigint.comparison.mode to nonstrict, so the only change was the new 
warning appeared in the output.


Thanks,

Kevin



 Warn user that precision is lost when bigint is implicitly cast to double in 
 joins and unions.
 --

 Key: HIVE-2427
 URL: https://issues.apache.org/jira/browse/HIVE-2427
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2427.1.patch.txt, HIVE-2427.2.patch.txt


 When a bigint is implicitly cast to a double (when a bigint is involved in an 
 equality expression with a string or double as part of a join condition, or 
 when a bigint column is to be combined with a string or double column as part 
 of a union) precision may be lost, resulting in unexpected behavior. Until we 
 fix the underlying issue we should throw an error in strict mode, and a 
 warning in nonstrict mode alerting the user about this.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2405) get_privilege does not get user level privilege

2011-09-06 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13098409#comment-13098409
 ] 

jirapos...@reviews.apache.org commented on HIVE-2405:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1646/#review1774
---



trunk/ql/src/test/queries/clientpositive/authorization_7.q
https://reviews.apache.org/r/1646/#comment4051

yes. what we want to do is to test the global privilege. The first command 
is to grant a global privilege, and after that do test with that. 


- Yongqiang


On 2011-08-24 22:23:39, Yongqiang He wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1646/
bq.  ---
bq.  
bq.  (Updated 2011-08-24 22:23:39)
bq.  
bq.  
bq.  Review request for hive and Ning Zhang.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  get_privilege does not get user level privilege and should return 
emptylist instead of null
bq.  
bq.  
bq.  This addresses bug HIVE-2405.
bq.  https://issues.apache.org/jira/browse/HIVE-2405
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
1161232 
bq.trunk/ql/src/test/queries/clientpositive/authorization_7.q PRE-CREATION 
bq.trunk/ql/src/test/results/clientpositive/authorization_7.q.out 
PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1646/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Yongqiang
bq.  
bq.



 get_privilege does not get user level privilege
 ---

 Key: HIVE-2405
 URL: https://issues.apache.org/jira/browse/HIVE-2405
 Project: Hive
  Issue Type: Bug
Reporter: He Yongqiang
Assignee: He Yongqiang
 Attachments: HIVE-2405.1.patch, HIVE-2405.2.patch


 hive set hive.security.authorization.enabled=true;
 hive  grant all to user heyongqiang;  
 hive show grant user heyongqiang; 
 principalName heyongqiang 
 principalType USER
 privilege All 
 grantTime Wed Aug 24 11:51:54 PDT 2011
 grantor   heyongqiang 
 Time taken: 0.032 seconds
 hive  CREATE TABLE src (foo INT, bar STRING); 
 Authorization failed:No privilege 'Create' found for outputs { 
 database:default}. Use show grant to get more details.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2428) ADD ARCHIVE has no effect when query runs in local mode

2011-09-06 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13098488#comment-13098488
 ] 

jirapos...@reviews.apache.org commented on HIVE-2428:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1731/
---

Review request for hive, Scott Chen and Ning Zhang.


Summary
---

Added the -archives flag to the command run when a map reduce task is run 
locally.  It runs essentially exactly as it does for the -files flag.


This addresses bug HIVE-2428.
https://issues.apache.org/jira/browse/HIVE-2428


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MapRedTask.java 1164711 

Diff: https://reviews.apache.org/r/1731/diff


Testing
---

I run the unit test queries to verify nothing was broken.

I also ran the query specified in the issues description and verified that 
information was returned for both files and archives in both local and 
non-local mode.


Thanks,

Kevin



 ADD ARCHIVE has no effect when query runs in local mode
 ---

 Key: HIVE-2428
 URL: https://issues.apache.org/jira/browse/HIVE-2428
 Project: Hive
  Issue Type: Bug
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2428.1.patch.txt


 When you ADD ARCHIVE a file and the query runs in local mode, the file is not 
 available at all.
 For example running the commands
 echo '#!/bin/bash\nls -l 12;read'  foo.sh
 hive -e 'ADD FILE foo.sh; ADD ARCHIVE /tmp/mapreduce_test.zip; SELECT 
 TRANSFORM(*) USING sh foo.sh FROM src'
 when the query is allowed to run in local mode, only information for foo.sh 
 is returned.
 However, if the query is not allowed to run in local mode, information for 
 both foo.sh and mapreduce_test.zip is returned.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2426) Test that views with joins work properly

2011-09-06 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13098500#comment-13098500
 ] 

jirapos...@reviews.apache.org commented on HIVE-2426:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1732/
---

Review request for hive.


Summary
---

HIVE-2426 patch 3


This addresses bug HIVE-2426.
https://issues.apache.org/jira/browse/HIVE-2426


Diffs
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/UnparseTranslator.java
 1165909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/join_view.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join_view.q.out
 PRE-CREATION 

Diff: https://reviews.apache.org/r/1732/diff


Testing
---


Thanks,

John



 Test that views with joins work properly
 

 Key: HIVE-2426
 URL: https://issues.apache.org/jira/browse/HIVE-2426
 Project: Hive
  Issue Type: Test
Reporter: Charles Chen
Assignee: Charles Chen
 Fix For: 0.9.0

 Attachments: HIVE-2426.3.patch, HIVE-2426v2.patch


 With the testcase
 {noformat}
 drop table invites;
 drop table invites2;
 create table invites (foo int, bar string) partitioned by (ds string);
 create table invites2 (foo int, bar string) partitioned by (ds string);
 set hive.mapred.mode=strict;
 -- test join views: see HIVE-1989
 create view v as select invites.bar, invites2.foo, invites2.ds from invites 
 join invites2 on invites.ds=invites2.ds;
 explain select * from v where ds='2011-09-01';
 drop view v;
 drop table invites;
 drop table invites2;
 {noformat}
 We should not have the partition pruner complain about invites.ds not having 
 a predicate because the predicate invites2.ds='2011-09-01' will be inferred 
 with the ppd transitivity optimization

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2427) Warn user that precision is lost when bigint is implicitly cast to double in joins and unions.

2011-09-06 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13098526#comment-13098526
 ] 

jirapos...@reviews.apache.org commented on HIVE-2427:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1705/
---

(Updated 2011-09-07 01:33:36.605379)


Review request for hive, Yongqiang He, Ning Zhang, and Siying Dong.


Summary
---

I added checks before the type conversions for both joins and unions to either 
throw an error or a warning depending if a bigint was going to be converted to 
a double.

I also added a new variable hive.mapred.bigint.comparison.mode, which is set to 
either strict or nonstrict to indicate whether an error or a warning should be 
given when a bigint is converted to a double.  This is instead of the original 
implementation which used the variable hive.mapred.mode.


This addresses bug HIVE-2427.
https://issues.apache.org/jira/browse/HIVE-2427


Diffs
-

  trunk/ql/src/test/results/clientnegative/compare_string_bigint_union.q.out 
PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/filter_join_breaktask2.q.out 1164293 
  trunk/ql/src/test/results/clientnegative/compare_double_bigint_union.q.out 
PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/compare_string_bigint.q.out 1164293 
  trunk/ql/src/test/results/clientnegative/compare_string_bigint_join.q.out 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/filter_join_breaktask2.q 1164293 
  trunk/ql/src/test/results/clientnegative/compare_double_bigint.q.out 1164293 
  trunk/ql/src/test/results/clientnegative/compare_double_bigint_join.q.out 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/compare_double_bigint_union.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/compare_string_bigint.q 1164293 
  trunk/ql/src/test/queries/clientnegative/compare_string_bigint_join.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/compare_string_bigint_union.q 
PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/ExprNodeGenericFuncDesc.java 
1164293 
  trunk/ql/src/test/queries/clientnegative/compare_double_bigint.q 1164293 
  trunk/ql/src/test/queries/clientnegative/compare_double_bigint_join.q 
PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ErrorMsg.java 1164293 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
1164293 
  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1164293 
  trunk/conf/hive-default.xml 1164293 

Diff: https://reviews.apache.org/r/1705/diff


Testing
---

I added several more tests to demonstrate the errors.

I also modified a unit test in which a join was being done which involved a 
comparison between a string and a bigint.  I set 
hive.mapred.bigint.comparison.mode to nonstrict, so the only change was the new 
warning appeared in the output.


Thanks,

Kevin



 Warn user that precision is lost when bigint is implicitly cast to double in 
 joins and unions.
 --

 Key: HIVE-2427
 URL: https://issues.apache.org/jira/browse/HIVE-2427
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2427.1.patch.txt, HIVE-2427.2.patch.txt


 When a bigint is implicitly cast to a double (when a bigint is involved in an 
 equality expression with a string or double as part of a join condition, or 
 when a bigint column is to be combined with a string or double column as part 
 of a union) precision may be lost, resulting in unexpected behavior. Until we 
 fix the underlying issue we should throw an error in strict mode, and a 
 warning in nonstrict mode alerting the user about this.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2374) Make compression used between map reduce tasks configurable.

2011-09-06 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13098525#comment-13098525
 ] 

jirapos...@reviews.apache.org commented on HIVE-2374:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1516/
---

(Updated 2011-09-07 01:34:05.392697)


Review request for hive, Yongqiang He and Ning Zhang.


Summary
---

I added a field to MapredWork and MapredLocalWork which indicates whether it is 
intermediate or not.  By intermediate, I mean that if the query is an insert, 
there is at least one other map reduce task that is guaranteed to happen before 
the move.  If the query is not an insert, intermediate applies to them all.  I 
determine this by defaulting the flag to true, and setting it to false when the 
tasks to move the data into a table or file are generated.

If the work for a map reduce task (local or otherwise) is intermediate, then we 
set the compression to be used on the output of the reduce to some configured 
value, the default is LZO.


This addresses bug HIVE-2374.
https://issues.apache.org/jira/browse/HIVE-2374


Diffs
-

  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1164667 
  trunk/conf/hive-default.xml 1164667 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java 1164667 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MapredLocalTask.java 1164667 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/GenMRFileSink1.java 
1164667 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/MapredLocalWork.java 1164667 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/MapredWork.java 1164667 
  trunk/ql/src/test/org/apache/hadoop/hive/ql/exec/TestExecDriver.java 1164667 
  
trunk/ql/src/test/org/apache/hadoop/hive/ql/hooks/VerifyIsIntermediateHook.java 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/intermediate_compression.q 
PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/auto_join0.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join10.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join11.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join12.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join13.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join15.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join16.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join18.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join18_multi_distinct.q.out 
1164667 
  trunk/ql/src/test/results/clientpositive/auto_join2.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join20.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join21.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join22.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join23.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join24.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join26.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join27.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join28.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join29.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join30.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/auto_join31.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/cluster.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/ctas.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/filter_join_breaktask.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby1.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby10.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby11.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby1_limit.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby1_map_skew.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby2_map_skew.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby3.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby3_map_skew.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby4.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby5.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby6.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby6_map_skew.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby8.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby8_map.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby8_map_skew.q.out 1164667 
  trunk/ql/src/test/results/clientpositive/groupby8_noskew.q.out 1164667 
  

[jira] [Commented] (HIVE-2397) Support with rollup option for group by

2011-09-06 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13098530#comment-13098530
 ] 

jirapos...@reviews.apache.org commented on HIVE-2397:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1637/
---

(Updated 2011-09-07 01:34:25.870854)


Review request for hive, Yongqiang He, Ning Zhang, and Siying Dong.


Summary
---

If a user runs a query that includes group by ... with rollup, the behavior is 
like that of MySQL, see the task.

I had to implement 4 different ways of providing this behavior to fit in with 
the 4 different ways of implementing group by depending on whether map 
aggregation is allowed, and whether the data is known to be skewed.

If map aggregation is allowed, it is a simple matter of adding new keys with an 
increasing number of NULLs to the hash map to collect the data for the new rows 
as part of the map side hash aggregation.

If map aggregation is not allowed and the data is not skewed, I perform a 
reduce job which performs a hash aggregation very similar to the way it is 
performed on the map side.  I then perform a mergepartial reduce job to perform 
a final aggregation on the hash aggregation.

If map aggregation is not allowed and the data is skewed and there are no 
distinct aggregations for the group by, I aggregate data for the new rows with 
NULLs as part of the non-hash aggregation.  This was as simple as adding the 
new functionality to the map side hash aggregation.

If map aggregation is not allowed and the data is skewed and there are distinct 
aggregations fro the group by, I perform a reduce job which performs  a hash 
aggregation, and then use the same implementation that is used when there is no 
rollup option set.

I have done my best not to detract from any optimizations that were made for 
each of the four different implementations of group by, but, r for the ones 
where I add a new reduce job, I am not sure how successful I was.

Currently, the optimizations for multiple group bys is not supported for 
queries with the rollup option set, but I am continuing to look into this.


This addresses bug HIVE-2397.
https://issues.apache.org/jira/browse/HIVE-2397


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/GroupByOperator.java 1160895 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/KeyWrapper.java 1160895 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/KeyWrapperFactory.java 
1160895 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g 1160895 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
1160895 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/GroupByDesc.java 1160895 
  trunk/ql/src/test/queries/clientpositive/groupby10_withrollup.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby11_withrollup.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby1_limit_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby1_map_nomap_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby1_map_skew_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby1_map_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby1_noskew_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby1_withrollup.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby2_limit_withrollup.q 
PRE-CREATION 
  
trunk/ql/src/test/queries/clientpositive/groupby2_map_multi_distinct_withrollup.q
 PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby2_map_skew_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby2_map_withrollup.q 
PRE-CREATION 
  
trunk/ql/src/test/queries/clientpositive/groupby2_noskew_multi_distinct_withrollup.q
 PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby2_noskew_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby2_withrollup.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby7_map_skew_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby7_map_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby7_noskew_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby7_withrollup.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby8_map_skew_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby8_map_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby8_noskew_withrollup.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby8_withrollup.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/groupby9_withrollup.q PRE-CREATION 
  

[jira] [Commented] (HIVE-2376) Upgrade Hive's Thrift dependency to version 0.7.0

2011-09-04 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13096997#comment-13096997
 ] 

jirapos...@reviews.apache.org commented on HIVE-2376:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1695/#review1745
---

Ship it!


+1
Running tests

- Amareshwari


On 2011-09-01 22:08:47, Carl Steinbach wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1695/
bq.  ---
bq.  
bq.  (Updated 2011-09-01 22:08:47)
bq.  
bq.  
bq.  Review request for hive and Amareshwari Sriramadasu.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  * Upgrade Hive's Thrift dependency to version 0.7.0
bq.  * Resolve Thrift dependency using Ivy
bq.  
bq.  
bq.  This addresses bug HIVE-2376.
bq.  https://issues.apache.org/jira/browse/HIVE-2376
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.service/src/java/org/apache/hadoop/hive/service/HiveServer.java c4b041f 
bq.shims/ivy.xml 88fae44 
bq.metastore/ivy.xml 7647b37 
bq.metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
285908e 
bq.ql/build.xml 15a58ef 
bq.ql/ivy.xml 99f3786 
bq.serde/ivy.xml 01c4d0c 
bq.service/ivy.xml 7254a09 
bq.build.xml f20cbb6 
bq.cli/ivy.xml abe7232 
bq.eclipse-templates/.classpath 74e5542 
bq.ivy/libraries.properties 4236df1 
bq.jdbc/ivy.xml ec26e74 
bq.lib/README 6cc2fba 
bq.lib/thrift-fb303.LICENSE 4eacb64 
bq.lib/thrift.LICENSE 9d189ef 
bq.  
bq.  Diff: https://reviews.apache.org/r/1695/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Carl
bq.  
bq.



 Upgrade Hive's Thrift dependency to version 0.7.0
 -

 Key: HIVE-2376
 URL: https://issues.apache.org/jira/browse/HIVE-2376
 Project: Hive
  Issue Type: Task
  Components: Build Infrastructure, Clients
Reporter: Carl Steinbach
Assignee: Carl Steinbach
 Fix For: 0.8.0

 Attachments: HIVE-2376-complete.1.patch.txt, 
 HIVE-2376-complete.2.patch.txt, HIVE-2376-no-gen.1.patch.txt, 
 HIVE-2376-no-gen.2.patch.txt, libfb303-0.7.0.jar, libthrift-0.7.0.jar




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1989) recognize transitivity of predicates on join keys

2011-09-02 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095797#comment-13095797
 ] 

jirapos...@reviews.apache.org commented on HIVE-1989:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1228/
---

(Updated 2011-09-02 06:26:31.707450)


Review request for hive.


Changes
---

Rebase to trunk


Summary
---

https://issues.apache.org/jira/browse/HIVE-1989


This addresses bug HIVE-1989.
https://issues.apache.org/jira/browse/HIVE-1989


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
 1164353 
  http://svn.apache.org/repos/asf/hive/trunk/conf/hive-default.xml 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/ColumnPrunerProcFactory.java
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/ExprWalkerProcFactory.java
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/auto_join16.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/auto_join8.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/cluster.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/index_auto_mult_tables.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/index_auto_mult_tables_compact.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join16.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join38.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join8.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/louter_join_ppr.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_clusterby.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_gby_join.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join2.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join3.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join1.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join2.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join4.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_udf_case.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/router_join_ppr.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/smb_mapjoin9.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/smb_mapjoin_6.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/udf_named_struct.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/union22.q.out
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join1.q.xml
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join2.q.xml
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join3.q.xml
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join8.q.xml
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample1.q.xml
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample2.q.xml
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample3.q.xml
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample4.q.xml
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample5.q.xml
 1164353 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample6.q.xml
 1164353 
  

[jira] [Commented] (HIVE-1989) recognize transitivity of predicates on join keys

2011-09-02 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13096118#comment-13096118
 ] 

jirapos...@reviews.apache.org commented on HIVE-1989:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1228/
---

(Updated 2011-09-02 16:49:23.015066)


Review request for hive.


Changes
---

Fixed unit tests


Summary
---

https://issues.apache.org/jira/browse/HIVE-1989


This addresses bug HIVE-1989.
https://issues.apache.org/jira/browse/HIVE-1989


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
 1164358 
  http://svn.apache.org/repos/asf/hive/trunk/conf/hive-default.xml 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/ColumnPrunerProcFactory.java
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/ExprWalkerProcFactory.java
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/auto_join16.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/auto_join8.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/cluster.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/index_auto_mult_tables.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/index_auto_mult_tables_compact.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join16.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join38.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join8.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/louter_join_ppr.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/outer_join_ppr.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_clusterby.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_gby_join.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join2.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join3.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join1.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join2.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join3.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join4.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join5.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_udf_case.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_union.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/router_join_ppr.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/smb_mapjoin9.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/smb_mapjoin_6.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/udf_named_struct.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/union22.q.out
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join1.q.xml
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join2.q.xml
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join3.q.xml
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join8.q.xml
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample1.q.xml
 1164358 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample2.q.xml
 1164358 
  

[jira] [Commented] (HIVE-2427) Warn user that precision is lost when bigint is implicitly cast to double in joins and unions.

2011-09-02 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13096127#comment-13096127
 ] 

jirapos...@reviews.apache.org commented on HIVE-2427:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1705/
---

Review request for hive and Siying Dong.


Summary
---

I added checks before the type conversions for both joins and unions to either 
throw an error or a warning depending if a bigint was going to be converted to 
a double.

I also added a new variable hive.mapred.bigint.comparison.mode, which is set to 
either strict or nonstrict to indicate whether an error or a warning should be 
given when a bigint is converted to a double.  This is instead of the original 
implementation which used the variable hive.mapred.mode.


This addresses bug HIVE-2427.
https://issues.apache.org/jira/browse/HIVE-2427


Diffs
-

  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1164293 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ErrorMsg.java 1164293 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
1164293 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/ExprNodeGenericFuncDesc.java 
1164293 
  trunk/ql/src/test/queries/clientnegative/compare_double_bigint.q 1164293 
  trunk/ql/src/test/queries/clientnegative/compare_double_bigint_join.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/compare_double_bigint_union.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/compare_string_bigint.q 1164293 
  trunk/ql/src/test/queries/clientnegative/compare_string_bigint_join.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/compare_string_bigint_union.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/filter_join_breaktask2.q 1164293 
  trunk/ql/src/test/results/clientnegative/compare_double_bigint.q.out 1164293 
  trunk/ql/src/test/results/clientnegative/compare_double_bigint_join.q.out 
PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/compare_double_bigint_union.q.out 
PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/compare_string_bigint.q.out 1164293 
  trunk/ql/src/test/results/clientnegative/compare_string_bigint_join.q.out 
PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/compare_string_bigint_union.q.out 
PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/filter_join_breaktask2.q.out 1164293 

Diff: https://reviews.apache.org/r/1705/diff


Testing
---

I added several more tests to demonstrate the errors.

I also modified a unit test in which a join was being done which involved a 
comparison between a string and a bigint.  I set 
hive.mapred.bigint.comparison.mode to nonstrict, so the only change was the new 
warning appeared in the output.


Thanks,

Kevin



 Warn user that precision is lost when bigint is implicitly cast to double in 
 joins and unions.
 --

 Key: HIVE-2427
 URL: https://issues.apache.org/jira/browse/HIVE-2427
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2427.1.patch.txt


 When a bigint is implicitly cast to a double (when a bigint is involved in an 
 equality expression with a string or double as part of a join condition, or 
 when a bigint column is to be combined with a string or double column as part 
 of a union) precision may be lost, resulting in unexpected behavior. Until we 
 fix the underlying issue we should throw an error in strict mode, and a 
 warning in nonstrict mode alerting the user about this.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1989) recognize transitivity of predicates on join keys

2011-09-01 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095473#comment-13095473
 ] 

jirapos...@reviews.apache.org commented on HIVE-1989:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1228/
---

(Updated 2011-09-01 17:58:25.969663)


Review request for hive.


Changes
---

Fix unit tests


Summary
---

https://issues.apache.org/jira/browse/HIVE-1989


This addresses bug HIVE-1989.
https://issues.apache.org/jira/browse/HIVE-1989


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
 1163909 
  http://svn.apache.org/repos/asf/hive/trunk/conf/hive-default.xml 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/ColumnPrunerProcFactory.java
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/ppd_transitivity.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/auto_join16.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/auto_join8.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/cluster.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/index_auto_mult_tables.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/index_auto_mult_tables_compact.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join16.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join38.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join8.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/louter_join_ppr.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_clusterby.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_gby_join.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join2.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join3.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join1.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join2.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join4.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_transitivity.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_udf_case.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/router_join_ppr.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/smb_mapjoin9.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/smb_mapjoin_6.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/udf_named_struct.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/union22.q.out
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/input4.q.xml
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join1.q.xml
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join2.q.xml
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join3.q.xml
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join8.q.xml
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample1.q.xml
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample2.q.xml
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample3.q.xml
 1163909 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample4.q.xml
 1163909 
  

[jira] [Commented] (HIVE-1989) recognize transitivity of predicates on join keys

2011-09-01 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095619#comment-13095619
 ] 

jirapos...@reviews.apache.org commented on HIVE-1989:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1228/
---

(Updated 2011-09-01 21:31:00.970058)


Review request for hive.


Changes
---

Changed extractFinalCandidates() to decompose AND expression into parts so the 
parts can be replicated over transitively even if the whole thing can't. Added 
comments.


Summary
---

https://issues.apache.org/jira/browse/HIVE-1989


This addresses bug HIVE-1989.
https://issues.apache.org/jira/browse/HIVE-1989


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/union.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/udf_when.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/udf_case.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/udf1.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/udf4.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/udf6.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/subq.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample4.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample5.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample6.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample7.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample3.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample2.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample1.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/input4.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join1.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join2.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join3.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join8.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/smb_mapjoin9.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/smb_mapjoin_6.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/udf_named_struct.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/union22.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/router_join_ppr.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_udf_case.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join2.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join4.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_repeated_alias2.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_transitivity.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join1.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_gby_join.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join2.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join3.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_clusterby.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join16.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join38.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join8.q.out
 1163905 
  

[jira] [Commented] (HIVE-2376) Upgrade Hive's Thrift dependency to version 0.6.1

2011-09-01 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095639#comment-13095639
 ] 

jirapos...@reviews.apache.org commented on HIVE-2376:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1695/
---

Review request for hive and Amareshwari Sriramadasu.


Summary
---

* Upgrade Hive's Thrift dependency to version 0.7.0
* Resolve Thrift dependency using Ivy


This addresses bug HIVE-2376.
https://issues.apache.org/jira/browse/HIVE-2376


Diffs
-

  service/src/java/org/apache/hadoop/hive/service/HiveServer.java c4b041f 
  shims/ivy.xml 88fae44 
  metastore/ivy.xml 7647b37 
  metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
285908e 
  ql/build.xml 15a58ef 
  ql/ivy.xml 99f3786 
  serde/ivy.xml 01c4d0c 
  service/ivy.xml 7254a09 
  build.xml f20cbb6 
  cli/ivy.xml abe7232 
  eclipse-templates/.classpath 74e5542 
  ivy/libraries.properties 4236df1 
  jdbc/ivy.xml ec26e74 
  lib/README 6cc2fba 
  lib/thrift-fb303.LICENSE 4eacb64 
  lib/thrift.LICENSE 9d189ef 

Diff: https://reviews.apache.org/r/1695/diff


Testing
---


Thanks,

Carl



 Upgrade Hive's Thrift dependency to version 0.6.1
 -

 Key: HIVE-2376
 URL: https://issues.apache.org/jira/browse/HIVE-2376
 Project: Hive
  Issue Type: Task
  Components: Build Infrastructure, Clients
Reporter: Carl Steinbach
Assignee: Carl Steinbach
 Fix For: 0.8.0

 Attachments: HIVE-2376-complete.1.patch.txt, 
 HIVE-2376-complete.2.patch.txt, HIVE-2376-no-gen.1.patch.txt, 
 HIVE-2376-no-gen.2.patch.txt, libfb303-0.7.0.jar, libthrift-0.7.0.jar




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2425) Print the counters SLOTS_MILLIS_MAPS and SLOTS_MILLIS_REDUCES to the Hive CLI output.

2011-09-01 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095707#comment-13095707
 ] 

jirapos...@reviews.apache.org commented on HIVE-2425:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1697/
---

Review request for hive and Siying Dong.


Summary
---

I print the new stats out where we are already printing out the other map 
reduce stats for each job.


This addresses bug HIVE-2425.
https://issues.apache.org/jira/browse/HIVE-2425


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/MapRedStats.java 1164320 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/HadoopJobExecHelper.java 
1164320 

Diff: https://reviews.apache.org/r/1697/diff


Testing
---

I ran a query in local mode and verified the stats were not printed.
I ran a map only query and verified only the map time was printed.
I ran a query involving multiple jobs involving both map only and map and 
reduce, and verified that the appropriate stats were printed for each.

In all cases, I verified the query completed successfully.


Thanks,

Kevin



 Print the counters SLOTS_MILLIS_MAPS and SLOTS_MILLIS_REDUCES to the Hive CLI 
 output.
 -

 Key: HIVE-2425
 URL: https://issues.apache.org/jira/browse/HIVE-2425
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2425.1.patch.txt




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2304) Support PreparedStatement.setObject

2011-09-01 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095709#comment-13095709
 ] 

jirapos...@reviews.apache.org commented on HIVE-2304:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1700/
---

Review request for hive.


Summary
---

Review for HIVE-2304 (Ido Hadanny)


This addresses bug HIVE-2304.
https://issues.apache.org/jira/browse/HIVE-2304


Diffs
-

  jdbc/src/java/org/apache/hadoop/hive/jdbc/HivePreparedStatement.java 679792c 
  jdbc/src/test/org/apache/hadoop/hive/jdbc/TestJdbcDriver.java e744d11 

Diff: https://reviews.apache.org/r/1700/diff


Testing
---


Thanks,

Carl



 Support PreparedStatement.setObject
 ---

 Key: HIVE-2304
 URL: https://issues.apache.org/jira/browse/HIVE-2304
 Project: Hive
  Issue Type: Sub-task
  Components: JDBC
Affects Versions: 0.7.1
Reporter: Ido Hadanny
Assignee: Ido Hadanny
Priority: Minor
 Fix For: 0.8.0

 Attachments: HIVE-0.8-SetObject.1.patch.txt

   Original Estimate: 1h
  Remaining Estimate: 1h

 PreparedStatement.setObject is important for spring's jdbcTemplate support

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2304) Support PreparedStatement.setObject

2011-09-01 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095713#comment-13095713
 ] 

jirapos...@reviews.apache.org commented on HIVE-2304:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1700/#review1731
---



jdbc/src/java/org/apache/hadoop/hive/jdbc/HivePreparedStatement.java
https://reviews.apache.org/r/1700/#comment3948

Please Add a negative testcase that exercises this exception.



jdbc/src/test/org/apache/hadoop/hive/jdbc/TestJdbcDriver.java
https://reviews.apache.org/r/1700/#comment3947

Please add a new testcase or add to an existing testcase instead of 
modifying a pre-existing testcase. You have provided test coverage for 
PreparedStatement.setObject() at the expense of removing test coverage for 
PreparedStatement.setBoolean()/setInt(),etc.


- Carl


On 2011-09-02 01:35:33, Carl Steinbach wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1700/
bq.  ---
bq.  
bq.  (Updated 2011-09-02 01:35:33)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Review for HIVE-2304 (Ido Hadanny)
bq.  
bq.  
bq.  This addresses bug HIVE-2304.
bq.  https://issues.apache.org/jira/browse/HIVE-2304
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.jdbc/src/java/org/apache/hadoop/hive/jdbc/HivePreparedStatement.java 
679792c 
bq.jdbc/src/test/org/apache/hadoop/hive/jdbc/TestJdbcDriver.java e744d11 
bq.  
bq.  Diff: https://reviews.apache.org/r/1700/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Carl
bq.  
bq.



 Support PreparedStatement.setObject
 ---

 Key: HIVE-2304
 URL: https://issues.apache.org/jira/browse/HIVE-2304
 Project: Hive
  Issue Type: Sub-task
  Components: JDBC
Affects Versions: 0.7.1
Reporter: Ido Hadanny
Assignee: Ido Hadanny
Priority: Minor
 Attachments: HIVE-0.8-SetObject.1.patch.txt

   Original Estimate: 1h
  Remaining Estimate: 1h

 PreparedStatement.setObject is important for spring's jdbcTemplate support

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1989) recognize transitivity of predicates on join keys

2011-09-01 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095784#comment-13095784
 ] 

jirapos...@reviews.apache.org commented on HIVE-1989:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1228/
---

(Updated 2011-09-02 05:47:55.711939)


Review request for hive.


Changes
---

Added unparseIdentifier for ASTNode handling


Summary
---

https://issues.apache.org/jira/browse/HIVE-1989


This addresses bug HIVE-1989.
https://issues.apache.org/jira/browse/HIVE-1989


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample6.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample4.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample5.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample3.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join8.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample1.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample2.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join3.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join2.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join1.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/input4.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/smb_mapjoin9.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/smb_mapjoin_6.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/udf_named_struct.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/union22.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_transitivity.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_udf_case.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/router_join_ppr.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join1.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join2.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join4.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_repeated_alias2.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join2.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join3.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_gby_join.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_clusterby.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join8.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/louter_join_ppr.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join16.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join38.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/index_auto_mult_tables.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/index_auto_mult_tables_compact.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/ppd_repeated_alias2.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/ppd_transitivity.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/auto_join16.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/auto_join8.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/cluster.q.out
 1163905 
  

[jira] [Commented] (HIVE-2426) Test that views with joins work properly

2011-09-01 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095791#comment-13095791
 ] 

jirapos...@reviews.apache.org commented on HIVE-2426:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1702/
---

Review request for hive.


Summary
---

https://issues.apache.org/jira/browse/HIVE-2426


This addresses bug HIVE-2426.
https://issues.apache.org/jira/browse/HIVE-2426


Diffs
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/UnparseTranslator.java
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/join_view.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join_view.q.out
 PRE-CREATION 

Diff: https://reviews.apache.org/r/1702/diff


Testing
---


Thanks,

Charles



 Test that views with joins work properly
 

 Key: HIVE-2426
 URL: https://issues.apache.org/jira/browse/HIVE-2426
 Project: Hive
  Issue Type: Test
Reporter: Charles Chen
Assignee: Charles Chen
 Attachments: HIVE-2426v2.patch


 With the testcase
 {noformat}
 drop table invites;
 drop table invites2;
 create table invites (foo int, bar string) partitioned by (ds string);
 create table invites2 (foo int, bar string) partitioned by (ds string);
 set hive.mapred.mode=strict;
 -- test join views: see HIVE-1989
 create view v as select invites.bar, invites2.foo, invites2.ds from invites 
 join invites2 on invites.ds=invites2.ds;
 explain select * from v where ds='2011-09-01';
 drop view v;
 drop table invites;
 drop table invites2;
 {noformat}
 We should not have the partition pruner complain about invites.ds not having 
 a predicate because the predicate invites2.ds='2011-09-01' will be inferred 
 with the ppd transitivity optimization

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2337) Predicate pushdown erroneously conservative with outer joins

2011-08-31 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095013#comment-13095013
 ] 

jirapos...@reviews.apache.org commented on HIVE-2337:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1275/
---

(Updated 2011-09-01 00:08:37.474019)


Review request for hive.


Changes
---

Fixed ppd_outer_join4.q.out


Summary
---

https://issues.apache.org/jira/browse/HIVE-2337


This addresses bug HIVE-2337.
https://issues.apache.org/jira/browse/HIVE-2337


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1163856 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/ppd_outer_join5.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join4.q.out
 1163856 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join5.q.out
 PRE-CREATION 

Diff: https://reviews.apache.org/r/1275/diff


Testing
---

Unit tests passed


Thanks,

Charles



 Predicate pushdown erroneously conservative with outer joins
 

 Key: HIVE-2337
 URL: https://issues.apache.org/jira/browse/HIVE-2337
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Charles Chen
Assignee: Charles Chen
 Attachments: HIVE-2337v1.patch, HIVE-2337v2.patch, HIVE-2337v3.patch, 
 HIVE-2337v4.patch


 The predicate pushdown filter is not applying left associativity of joins 
 correctly in determining possible aliases for pushing predicates.
 In hive.ql.ppd.OpProcFactory.JoinPPD.getQualifiedAliases, the criteria for 
 pushing aliases is specified as:
 {noformat}
 /**
  * Figures out the aliases for whom it is safe to push predicates based on
  * ANSI SQL semantics For inner join, all predicates for all aliases can 
 be
  * pushed For full outer join, none of the predicates can be pushed as 
 that
  * would limit the number of rows for join For left outer join, all the
  * predicates on the left side aliases can be pushed up For right outer
  * join, all the predicates on the right side aliases can be pushed up 
 Joins
  * chain containing both left and right outer joins are treated as full
  * outer join. [...]
  *
  * @param op
  *  Join Operator
  * @param rr
  *  Row resolver
  * @return set of qualified aliases
  */
 {noformat}
 Since hive joins are left associative, something like a RIGHT OUTER JOIN b 
 LEFT OUTER JOIN c INNER JOIN d should be interpreted as ((a RIGHT OUTER 
 JOIN b) LEFT OUTER JOIN c) INNER JOIN d, so there would be cases where joins 
 with both left and right outer joins can have aliases that can be pushed.  
 Here, aliases b and d are eligible to be pushed up while the current criteria 
 provide that none are eligible.
 Using:
 {noformat}
 create table t1 (id int, key string, value string);
 create table t2 (id int, key string, value string);
 create table t3 (id int, key string, value string);
 create table t4 (id int, key string, value string);
 {noformat}
 For example, the query
 {noformat}
 explain select * from t1 full outer join t2 on t1.id=t2.id join t3 on 
 t2.id=t3.id where t3.id=20; 
 {noformat}
 currently gives
 {noformat}
 STAGE DEPENDENCIES:
   Stage-1 is a root stage
   Stage-0 is a root stage
 STAGE PLANS:
   Stage: Stage-1
 Map Reduce
   Alias - Map Operator Tree:
 t1 
   TableScan
 alias: t1
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
 type: int
   tag: 0
   value expressions:
 expr: id
 type: int
 expr: key
 type: string
 expr: value
 type: string
 t2 
   TableScan
 alias: t2
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
 type: int
   tag: 1
   value expressions:
 expr: id
 type: int
 expr: key
 type: string
 expr: value
 

[jira] [Commented] (HIVE-2337) Predicate pushdown erroneously conservative with outer joins

2011-08-31 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095022#comment-13095022
 ] 

jirapos...@reviews.apache.org commented on HIVE-2337:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1275/
---

(Updated 2011-09-01 00:19:17.176704)


Review request for hive.


Changes
---

Rebased to current trunk


Summary
---

https://issues.apache.org/jira/browse/HIVE-2337


This addresses bug HIVE-2337.
https://issues.apache.org/jira/browse/HIVE-2337


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1163875 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join4.q.out
 1163875 

Diff: https://reviews.apache.org/r/1275/diff


Testing
---

Unit tests passed


Thanks,

Charles



 Predicate pushdown erroneously conservative with outer joins
 

 Key: HIVE-2337
 URL: https://issues.apache.org/jira/browse/HIVE-2337
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Charles Chen
Assignee: Charles Chen
 Fix For: 0.9.0

 Attachments: HIVE-2337v1.patch, HIVE-2337v2.patch, HIVE-2337v3.patch, 
 HIVE-2337v4.patch, HIVE-2337v5.patch


 The predicate pushdown filter is not applying left associativity of joins 
 correctly in determining possible aliases for pushing predicates.
 In hive.ql.ppd.OpProcFactory.JoinPPD.getQualifiedAliases, the criteria for 
 pushing aliases is specified as:
 {noformat}
 /**
  * Figures out the aliases for whom it is safe to push predicates based on
  * ANSI SQL semantics For inner join, all predicates for all aliases can 
 be
  * pushed For full outer join, none of the predicates can be pushed as 
 that
  * would limit the number of rows for join For left outer join, all the
  * predicates on the left side aliases can be pushed up For right outer
  * join, all the predicates on the right side aliases can be pushed up 
 Joins
  * chain containing both left and right outer joins are treated as full
  * outer join. [...]
  *
  * @param op
  *  Join Operator
  * @param rr
  *  Row resolver
  * @return set of qualified aliases
  */
 {noformat}
 Since hive joins are left associative, something like a RIGHT OUTER JOIN b 
 LEFT OUTER JOIN c INNER JOIN d should be interpreted as ((a RIGHT OUTER 
 JOIN b) LEFT OUTER JOIN c) INNER JOIN d, so there would be cases where joins 
 with both left and right outer joins can have aliases that can be pushed.  
 Here, aliases b and d are eligible to be pushed up while the current criteria 
 provide that none are eligible.
 Using:
 {noformat}
 create table t1 (id int, key string, value string);
 create table t2 (id int, key string, value string);
 create table t3 (id int, key string, value string);
 create table t4 (id int, key string, value string);
 {noformat}
 For example, the query
 {noformat}
 explain select * from t1 full outer join t2 on t1.id=t2.id join t3 on 
 t2.id=t3.id where t3.id=20; 
 {noformat}
 currently gives
 {noformat}
 STAGE DEPENDENCIES:
   Stage-1 is a root stage
   Stage-0 is a root stage
 STAGE PLANS:
   Stage: Stage-1
 Map Reduce
   Alias - Map Operator Tree:
 t1 
   TableScan
 alias: t1
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
 type: int
   tag: 0
   value expressions:
 expr: id
 type: int
 expr: key
 type: string
 expr: value
 type: string
 t2 
   TableScan
 alias: t2
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
 type: int
   tag: 1
   value expressions:
 expr: id
 type: int
 expr: key
 type: string
 expr: value
 type: string
 t3 
   TableScan
 alias: t3
 Reduce Output Operator
   key expressions:
 expr: id

[jira] [Commented] (HIVE-2247) ALTER TABLE RENAME PARTITION

2011-08-31 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095082#comment-13095082
 ] 

jirapos...@reviews.apache.org commented on HIVE-2247:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1105/
---

(Updated 2011-09-01 02:23:59.714244)


Review request for Siying Dong.


Changes
---

+work.getInputs().add(new ReadEntity(oldPart));
+work.getOutputs().add(new WriteEntity(newPart));


Summary
---

Implement ALTER TABLE PARTITION RENAME function to rename a partition. 
Add HiveQL syntax ALTER TABLE bar PARTITION (k1='v1', k2='v2') RENAME TO 
PARTITION (k1='v3', k2='v4');
This is my first Hive diff, I just learn everything from existing codebase and 
may not have a good understanding on it. 
Feel free to inform me if I make something wrong. Thanks


This addresses bug HIVE-2247.
https://issues.apache.org/jira/browse/HIVE-2247


Diffs (updated)
-

  trunk/metastore/if/hive_metastore.thrift 1145366 
  trunk/metastore/src/gen/thrift/gen-cpp/ThriftHiveMetastore.h 1145366 
  trunk/metastore/src/gen/thrift/gen-cpp/ThriftHiveMetastore.cpp 1145366 
  
trunk/metastore/src/gen/thrift/gen-cpp/ThriftHiveMetastore_server.skeleton.cpp 
1145366 
  
trunk/metastore/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/metastore/api/ThriftHiveMetastore.java
 1145366 
  trunk/metastore/src/gen/thrift/gen-php/hive_metastore/ThriftHiveMetastore.php 
1145366 
  
trunk/metastore/src/gen/thrift/gen-py/hive_metastore/ThriftHiveMetastore-remote 
1145366 
  trunk/metastore/src/gen/thrift/gen-py/hive_metastore/ThriftHiveMetastore.py 
1145366 
  trunk/metastore/src/gen/thrift/gen-rb/thrift_hive_metastore.rb 1145366 
  
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveAlterHandler.java 
1145366 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
1145366 
  
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStoreClient.java
 1145366 
  
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/IMetaStoreClient.java 
1145366 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java 
1145366 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/RawStore.java 
1145366 
  
trunk/metastore/src/test/org/apache/hadoop/hive/metastore/TestHiveMetaStore.java
 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Partition.java 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java 
1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g 1145366 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzerFactory.java 
1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/AlterTableDesc.java 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/DDLWork.java 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/HiveOperation.java 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/RenamePartitionDesc.java 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/alter_rename_partition_failure.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/alter_rename_partition_failure2.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/alter_rename_partition_failure3.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/alter_rename_partition.q 
PRE-CREATION 
  
trunk/ql/src/test/queries/clientpositive/alter_rename_partition_authorization.q 
PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/alter_rename_partition_failure.q.out 
PRE-CREATION 
  
trunk/ql/src/test/results/clientnegative/alter_rename_partition_failure2.q.out 
PRE-CREATION 
  
trunk/ql/src/test/results/clientnegative/alter_rename_partition_failure3.q.out 
PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/alter_rename_partition.q.out 
PRE-CREATION 
  
trunk/ql/src/test/results/clientpositive/alter_rename_partition_authorization.q.out
 PRE-CREATION 

Diff: https://reviews.apache.org/r/1105/diff


Testing
---

Add a partition A in the table
Rename partition A to partition B
Show the partitions in the table, it returns partition B.
SELECT the data from partition A, it returns no results
SELECT the data from partition B, it returns the data originally stored in 
partition A


Thanks,

Weiyan



 ALTER TABLE RENAME PARTITION
 

 Key: HIVE-2247
 URL: https://issues.apache.org/jira/browse/HIVE-2247
 Project: Hive
  Issue Type: New Feature
Reporter: Siying Dong
Assignee: Weiyan Wang
 Attachments: HIVE-2247.3.patch.txt, 

[jira] [Commented] (HIVE-2337) Predicate pushdown erroneously conservative with outer joins

2011-08-31 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095104#comment-13095104
 ] 

jirapos...@reviews.apache.org commented on HIVE-2337:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1275/#review1710
---



http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
https://reviews.apache.org/r/1275/#comment3884

There is a weird non-ASCII character on this line.


- John


On 2011-09-01 00:19:17, Charles Chen wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1275/
bq.  ---
bq.  
bq.  (Updated 2011-09-01 00:19:17)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  https://issues.apache.org/jira/browse/HIVE-2337
bq.  
bq.  
bq.  This addresses bug HIVE-2337.
bq.  https://issues.apache.org/jira/browse/HIVE-2337
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1163875 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join4.q.out
 1163875 
bq.  
bq.  Diff: https://reviews.apache.org/r/1275/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Unit tests passed
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Charles
bq.  
bq.



 Predicate pushdown erroneously conservative with outer joins
 

 Key: HIVE-2337
 URL: https://issues.apache.org/jira/browse/HIVE-2337
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Charles Chen
Assignee: Charles Chen
 Fix For: 0.9.0

 Attachments: HIVE-2337v1.patch, HIVE-2337v2.patch, HIVE-2337v3.patch, 
 HIVE-2337v4.patch, HIVE-2337v5.patch


 The predicate pushdown filter is not applying left associativity of joins 
 correctly in determining possible aliases for pushing predicates.
 In hive.ql.ppd.OpProcFactory.JoinPPD.getQualifiedAliases, the criteria for 
 pushing aliases is specified as:
 {noformat}
 /**
  * Figures out the aliases for whom it is safe to push predicates based on
  * ANSI SQL semantics For inner join, all predicates for all aliases can 
 be
  * pushed For full outer join, none of the predicates can be pushed as 
 that
  * would limit the number of rows for join For left outer join, all the
  * predicates on the left side aliases can be pushed up For right outer
  * join, all the predicates on the right side aliases can be pushed up 
 Joins
  * chain containing both left and right outer joins are treated as full
  * outer join. [...]
  *
  * @param op
  *  Join Operator
  * @param rr
  *  Row resolver
  * @return set of qualified aliases
  */
 {noformat}
 Since hive joins are left associative, something like a RIGHT OUTER JOIN b 
 LEFT OUTER JOIN c INNER JOIN d should be interpreted as ((a RIGHT OUTER 
 JOIN b) LEFT OUTER JOIN c) INNER JOIN d, so there would be cases where joins 
 with both left and right outer joins can have aliases that can be pushed.  
 Here, aliases b and d are eligible to be pushed up while the current criteria 
 provide that none are eligible.
 Using:
 {noformat}
 create table t1 (id int, key string, value string);
 create table t2 (id int, key string, value string);
 create table t3 (id int, key string, value string);
 create table t4 (id int, key string, value string);
 {noformat}
 For example, the query
 {noformat}
 explain select * from t1 full outer join t2 on t1.id=t2.id join t3 on 
 t2.id=t3.id where t3.id=20; 
 {noformat}
 currently gives
 {noformat}
 STAGE DEPENDENCIES:
   Stage-1 is a root stage
   Stage-0 is a root stage
 STAGE PLANS:
   Stage: Stage-1
 Map Reduce
   Alias - Map Operator Tree:
 t1 
   TableScan
 alias: t1
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
 type: int
   tag: 0
   value expressions:
 expr: id
 type: int
 expr: key
 type: string
 expr: value
 type: string
 t2 
   TableScan
 alias: t2
 Reduce Output Operator
   

[jira] [Commented] (HIVE-2337) Predicate pushdown erroneously conservative with outer joins

2011-08-31 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095110#comment-13095110
 ] 

jirapos...@reviews.apache.org commented on HIVE-2337:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1275/
---

(Updated 2011-09-01 04:26:59.076177)


Review request for hive.


Changes
---

Oops fixed dropped unit test, javadoc character


Summary
---

https://issues.apache.org/jira/browse/HIVE-2337


This addresses bug HIVE-2337.
https://issues.apache.org/jira/browse/HIVE-2337


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join5.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1163875 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/ppd_outer_join5.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join4.q.out
 1163875 

Diff: https://reviews.apache.org/r/1275/diff


Testing
---

Unit tests passed


Thanks,

Charles



 Predicate pushdown erroneously conservative with outer joins
 

 Key: HIVE-2337
 URL: https://issues.apache.org/jira/browse/HIVE-2337
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Charles Chen
Assignee: Charles Chen
 Fix For: 0.9.0

 Attachments: HIVE-2337v1.patch, HIVE-2337v2.patch, HIVE-2337v3.patch, 
 HIVE-2337v4.patch, HIVE-2337v5.patch


 The predicate pushdown filter is not applying left associativity of joins 
 correctly in determining possible aliases for pushing predicates.
 In hive.ql.ppd.OpProcFactory.JoinPPD.getQualifiedAliases, the criteria for 
 pushing aliases is specified as:
 {noformat}
 /**
  * Figures out the aliases for whom it is safe to push predicates based on
  * ANSI SQL semantics For inner join, all predicates for all aliases can 
 be
  * pushed For full outer join, none of the predicates can be pushed as 
 that
  * would limit the number of rows for join For left outer join, all the
  * predicates on the left side aliases can be pushed up For right outer
  * join, all the predicates on the right side aliases can be pushed up 
 Joins
  * chain containing both left and right outer joins are treated as full
  * outer join. [...]
  *
  * @param op
  *  Join Operator
  * @param rr
  *  Row resolver
  * @return set of qualified aliases
  */
 {noformat}
 Since hive joins are left associative, something like a RIGHT OUTER JOIN b 
 LEFT OUTER JOIN c INNER JOIN d should be interpreted as ((a RIGHT OUTER 
 JOIN b) LEFT OUTER JOIN c) INNER JOIN d, so there would be cases where joins 
 with both left and right outer joins can have aliases that can be pushed.  
 Here, aliases b and d are eligible to be pushed up while the current criteria 
 provide that none are eligible.
 Using:
 {noformat}
 create table t1 (id int, key string, value string);
 create table t2 (id int, key string, value string);
 create table t3 (id int, key string, value string);
 create table t4 (id int, key string, value string);
 {noformat}
 For example, the query
 {noformat}
 explain select * from t1 full outer join t2 on t1.id=t2.id join t3 on 
 t2.id=t3.id where t3.id=20; 
 {noformat}
 currently gives
 {noformat}
 STAGE DEPENDENCIES:
   Stage-1 is a root stage
   Stage-0 is a root stage
 STAGE PLANS:
   Stage: Stage-1
 Map Reduce
   Alias - Map Operator Tree:
 t1 
   TableScan
 alias: t1
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
 type: int
   tag: 0
   value expressions:
 expr: id
 type: int
 expr: key
 type: string
 expr: value
 type: string
 t2 
   TableScan
 alias: t2
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
 type: int
   tag: 1
   value expressions:
 expr: id
 type: int
 expr: key
 

[jira] [Commented] (HIVE-2337) Predicate pushdown erroneously conservative with outer joins

2011-08-31 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095116#comment-13095116
 ] 

jirapos...@reviews.apache.org commented on HIVE-2337:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1275/
---

(Updated 2011-09-01 04:42:25.815081)


Review request for hive.


Changes
---

Added TestParse changes ??


Summary
---

https://issues.apache.org/jira/browse/HIVE-2337


This addresses bug HIVE-2337.
https://issues.apache.org/jira/browse/HIVE-2337


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1163875 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/ppd_outer_join5.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join4.q.out
 1163875 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join5.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/input4.q.xml
 1163875 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join8.q.xml
 1163875 

Diff: https://reviews.apache.org/r/1275/diff


Testing
---

Unit tests passed


Thanks,

Charles



 Predicate pushdown erroneously conservative with outer joins
 

 Key: HIVE-2337
 URL: https://issues.apache.org/jira/browse/HIVE-2337
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Charles Chen
Assignee: Charles Chen
 Fix For: 0.9.0

 Attachments: HIVE-2337v1.patch, HIVE-2337v2.patch, HIVE-2337v3.patch, 
 HIVE-2337v4.patch, HIVE-2337v5.patch, HIVE-2337v6.patch


 The predicate pushdown filter is not applying left associativity of joins 
 correctly in determining possible aliases for pushing predicates.
 In hive.ql.ppd.OpProcFactory.JoinPPD.getQualifiedAliases, the criteria for 
 pushing aliases is specified as:
 {noformat}
 /**
  * Figures out the aliases for whom it is safe to push predicates based on
  * ANSI SQL semantics For inner join, all predicates for all aliases can 
 be
  * pushed For full outer join, none of the predicates can be pushed as 
 that
  * would limit the number of rows for join For left outer join, all the
  * predicates on the left side aliases can be pushed up For right outer
  * join, all the predicates on the right side aliases can be pushed up 
 Joins
  * chain containing both left and right outer joins are treated as full
  * outer join. [...]
  *
  * @param op
  *  Join Operator
  * @param rr
  *  Row resolver
  * @return set of qualified aliases
  */
 {noformat}
 Since hive joins are left associative, something like a RIGHT OUTER JOIN b 
 LEFT OUTER JOIN c INNER JOIN d should be interpreted as ((a RIGHT OUTER 
 JOIN b) LEFT OUTER JOIN c) INNER JOIN d, so there would be cases where joins 
 with both left and right outer joins can have aliases that can be pushed.  
 Here, aliases b and d are eligible to be pushed up while the current criteria 
 provide that none are eligible.
 Using:
 {noformat}
 create table t1 (id int, key string, value string);
 create table t2 (id int, key string, value string);
 create table t3 (id int, key string, value string);
 create table t4 (id int, key string, value string);
 {noformat}
 For example, the query
 {noformat}
 explain select * from t1 full outer join t2 on t1.id=t2.id join t3 on 
 t2.id=t3.id where t3.id=20; 
 {noformat}
 currently gives
 {noformat}
 STAGE DEPENDENCIES:
   Stage-1 is a root stage
   Stage-0 is a root stage
 STAGE PLANS:
   Stage: Stage-1
 Map Reduce
   Alias - Map Operator Tree:
 t1 
   TableScan
 alias: t1
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
 type: int
   tag: 0
   value expressions:
 expr: id
 type: int
 expr: key
 type: string
 expr: value
 type: string
 t2 
   TableScan
 alias: t2
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: 

[jira] [Commented] (HIVE-1989) recognize transitivity of predicates on join keys

2011-08-31 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13095134#comment-13095134
 ] 

jirapos...@reviews.apache.org commented on HIVE-1989:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1228/
---

(Updated 2011-09-01 05:43:51.993258)


Review request for hive.


Changes
---

Rebase to trunk


Summary
---

https://issues.apache.org/jira/browse/HIVE-1989


This addresses bug HIVE-1989.
https://issues.apache.org/jira/browse/HIVE-1989


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
 1163905 
  http://svn.apache.org/repos/asf/hive/trunk/conf/hive-default.xml 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/ColumnPrunerProcFactory.java
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/ppd_transitivity.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/auto_join16.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/cluster.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/index_auto_mult_tables.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/index_auto_mult_tables_compact.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join16.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join38.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/louter_join_ppr.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_clusterby.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_gby_join.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join2.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_join3.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join1.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join2.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join4.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_transitivity.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_udf_case.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/router_join_ppr.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/smb_mapjoin9.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/smb_mapjoin_6.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/udf_named_struct.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/union22.q.out
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join1.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join2.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join3.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample1.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample2.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample3.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample4.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample5.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample6.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/sample7.q.xml
 1163905 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/subq.q.xml
 1163905 
  

[jira] [Commented] (HIVE-2415) disallow partition column names when doing replace columns

2011-08-30 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13093498#comment-13093498
 ] 

jirapos...@reviews.apache.org commented on HIVE-2415:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1672/#review1686
---



trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Table.java
https://reviews.apache.org/r/1672/#comment3842

remove TAB



trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java
https://reviews.apache.org/r/1672/#comment3844

tab



trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java
https://reviews.apache.org/r/1672/#comment3843

tab



trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java
https://reviews.apache.org/r/1672/#comment3845

does this metastore object change causes the mapped DB table change? it 
looks dangerous here. 



trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java
https://reviews.apache.org/r/1672/#comment3846

why we need to clear deserializer here? Is it to make sure column names are 
not from deserializer?


- Ning


On 2011-08-28 23:14:12, Yongqiang He wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1672/
bq.  ---
bq.  
bq.  (Updated 2011-08-28 23:14:12)
bq.  
bq.  
bq.  Review request for hive and Ning Zhang.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  move validation of renaming/replacing columns from runtime to parser
bq.  
bq.  
bq.  This addresses bug HIVE-2415.
bq.  https://issues.apache.org/jira/browse/HIVE-2415
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1162190 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Table.java 1162190 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java 
1162190 
bq.trunk/ql/src/test/queries/clientnegative/replace_columns.q PRE-CREATION 
bq.trunk/ql/src/test/queries/clientnegative/replace_columns_2.q 
PRE-CREATION 
bq.trunk/ql/src/test/queries/clientnegative/replace_columns_3.q 
PRE-CREATION 
bq.trunk/ql/src/test/results/clientnegative/altern1.q.out 1162190 
bq.trunk/ql/src/test/results/clientnegative/column_rename1.q.out 1162190 
bq.trunk/ql/src/test/results/clientnegative/column_rename2.q.out 1162190 
bq.trunk/ql/src/test/results/clientnegative/column_rename4.q.out 1162190 
bq.trunk/ql/src/test/results/clientnegative/replace_columns.q.out 
PRE-CREATION 
bq.trunk/ql/src/test/results/clientnegative/replace_columns_2.q.out 
PRE-CREATION 
bq.trunk/ql/src/test/results/clientnegative/replace_columns_3.q.out 
PRE-CREATION 
bq.trunk/ql/src/test/results/clientnegative/replace_columns_4.q.out 
PRE-CREATION 
bq.trunk/serde/src/java/org/apache/hadoop/hive/serde2/SerDeUtils.java 
1162190 
bq.  
bq.  Diff: https://reviews.apache.org/r/1672/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Yongqiang
bq.  
bq.



 disallow partition column names when doing replace columns
 --

 Key: HIVE-2415
 URL: https://issues.apache.org/jira/browse/HIVE-2415
 Project: Hive
  Issue Type: Bug
Reporter: He Yongqiang
Assignee: He Yongqiang
 Attachments: HIVE-2415.1.patch


 alter table replace columns allows to add a column with the same name as 
 partition column, which introduced inconsistency. 
 We should disallow this. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2382) Invalid predicate pushdown from incorrect column expression map for select operator generated by GROUP BY operation

2011-08-30 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13094102#comment-13094102
 ] 

jirapos...@reviews.apache.org commented on HIVE-2382:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1566/
---

(Updated 2011-08-30 21:07:18.651251)


Review request for hive.


Changes
---

TestParse test output different


Summary
---

https://issues.apache.org/jira/browse/HIVE-2382


This addresses bug HIVE-2382.
https://issues.apache.org/jira/browse/HIVE-2382


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1163362 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/groupby_ppd.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/groupby_ppd.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/groupby1.q.xml
 1163362 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/groupby2.q.xml
 1163362 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/groupby3.q.xml
 1163362 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/groupby4.q.xml
 1163362 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/groupby5.q.xml
 1163362 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/groupby6.q.xml
 1163362 

Diff: https://reviews.apache.org/r/1566/diff


Testing
---

Unit tests passed


Thanks,

Charles



 Invalid predicate pushdown from incorrect column expression map for select 
 operator generated by GROUP BY operation
 ---

 Key: HIVE-2382
 URL: https://issues.apache.org/jira/browse/HIVE-2382
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: Charles Chen
Assignee: Charles Chen
Priority: Critical
 Fix For: 0.8.0

 Attachments: HIVE-2382v1.patch


 When a GROUP BY is specified, a select operator is added before the GROUP BY 
 in SemanticAnalyzer.insertSelectAllPlanForGroupBy.  Currently, the column 
 expression map for this is set to the column expression map for the parent 
 operator.  This behavior is incorrect as, for example, the parent operator 
 could rearrange the order of the columns (_col0 = _col0, _col1 = _col2, 
 _col2 = _col1) and the new operator should not repeat this.
 The predicate pushdown optimization uses the column expression map to track 
 which columns a filter expression refers to at different operators.  This 
 results in a filter on incorrect columns.
 Here is a simple case of this going wrong: Using
 {noformat}
 create table invites (id int, foo int, bar int);
 {noformat}
 executing the query
 {noformat}
 explain select * from (select foo, bar from (select bar, foo from invites c 
 union all select bar, foo from invites d) b) a group by bar, foo having bar=1;
 {noformat}
 results in
 {noformat}
 STAGE DEPENDENCIES:
   Stage-1 is a root stage
   Stage-0 is a root stage
 STAGE PLANS:
   Stage: Stage-1
 Map Reduce
   Alias - Map Operator Tree:
 a-subquery1:b-subquery1:c 
   TableScan
 alias: c
 Filter Operator
   predicate:
   expr: (foo = 1)
   type: boolean
   Select Operator
 expressions:
   expr: bar
   type: int
   expr: foo
   type: int
 outputColumnNames: _col0, _col1
 Union
   Select Operator
 expressions:
   expr: _col1
   type: int
   expr: _col0
   type: int
 outputColumnNames: _col0, _col1
 Select Operator
   expressions:
 expr: _col0
 type: int
 expr: _col1
 type: int
   outputColumnNames: _col0, _col1
   Group By Operator
 bucketGroup: false
 keys:
   expr: _col1
   type: int
   expr: _col0
   type: int
 mode: hash

[jira] [Commented] (HIVE-2383) Incorrect alias filtering for predicate pushdown

2011-08-30 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13094171#comment-13094171
 ] 

jirapos...@reviews.apache.org commented on HIVE-2383:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1568/
---

(Updated 2011-08-30 22:46:54.310929)


Review request for hive.


Summary
---

https://issues.apache.org/jira/browse/HIVE-2383


This addresses bug HIVE-2383.
https://issues.apache.org/jira/browse/HIVE-2383


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/ExprWalkerInfo.java
 1163438 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/ExprWalkerProcFactory.java
 1163438 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1163438 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/ppd_repeated_alias.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_repeated_alias.q.out
 PRE-CREATION 

Diff: https://reviews.apache.org/r/1568/diff


Testing
---


Thanks,

Charles



 Incorrect alias filtering for predicate pushdown
 

 Key: HIVE-2383
 URL: https://issues.apache.org/jira/browse/HIVE-2383
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: Charles Chen
Assignee: Charles Chen
Priority: Critical
 Fix For: 0.8.0

 Attachments: HIVE-2383v1.patch, HIVE-2383v2.patch


 The predicate pushdown optimizer starts at the topmost operators traverses 
 the operator tree, at each stage collecting predicates to be pushed down.  At 
 each operator, ive.ql.ppd.OpProcFactory.DefaultPPD.mergeWithChildrenPred is 
 called, which merges the predicates of the children nodes into the current 
 node.  The predicates are stored in hive.ql.ppd.ExprWalkerInfo.pushdownPreds 
 as a map from the alias a predicate refers to (a predicate may only refer to 
 one alias at a time as only such predicates can be pushed) to a list of such 
 predicates.  Since at each stage the alias the predicate refers to may change 
 (subqueries may change aliases), this is updated for each operator 
 (hive.ql.ppd.ExprWalkerProcFactory.extractPushdownPreds is called which walks 
 the ExprNodeDesc for each predicate). When a JoinOperator is encountered, 
 mergeWithChildrenPred is passed an optional parameter aliases which 
 contains a set of aliases that can be pushed per ansi semantics (see 
 hive.ql.ppd.OpProcFactory.JoinPPD.getQualifiedAliases).  The part that is 
 incorrect is that aliases are filtered in mergeWithChildrenPred before 
 extractPushdownPreds is called, which associates the predicates with the 
 correct alias in the current operator's context while the filtering should 
 happen after.
 In test case Q2 below, when the predicate a.bar=3 comes into the 
 JoinOperator, the alias is a coming in so it is accepted for pushdown.  
 When brought into the JoinOperator's context, however, since the predicate 
 refers to b.foo in the inner scope, we should not actually accept this for 
 pushdown.
 With the test cases
 {noformat}
 -- Q1: predicate should not be pushed on the right side of a left outer join 
 (this is correct in trunk)
 explain
 SELECT a.foo as foo1, b.foo as foo2, b.bar
 FROM pokes a LEFT OUTER JOIN pokes2 b
 ON a.foo=b.foo
 WHERE b.bar=3;
 -- Q2: predicate should not be pushed on the right side of a left outer join 
 (this is broken in trunk)
 explain
 SELECT * FROM
 (SELECT a.foo as foo1, b.foo as foo2, b.bar
 FROM pokes a LEFT OUTER JOIN pokes2 b
 ON a.foo=b.foo) a
 WHERE a.bar=3;
 -- Q3: predicate should be pushed (this is correct in trunk)
 explain
 SELECT * FROM
 (SELECT a.foo as foo1, b.foo as foo2, a.bar
 FROM pokes a JOIN pokes2 b
 ON a.foo=b.foo) a
 WHERE a.bar=3;
 {noformat}
 The current output is
 {noformat}
 hive 
  -- Q1: predicate should not be pushed on the right side of a left outer 
 join
  explain
  SELECT a.foo as foo1, b.foo as foo2, b.bar
  FROM pokes a LEFT OUTER JOIN pokes2 b
  ON a.foo=b.foo
  WHERE b.bar=3;
 OK
 ABSTRACT SYNTAX TREE:
   (TOK_QUERY (TOK_FROM (TOK_LEFTOUTERJOIN (TOK_TABREF (TOK_TABNAME pokes) a) 
 (TOK_TABREF (TOK_TABNAME pokes2) b) (= (. (TOK_TABLE_OR_COL a) foo) (. 
 (TOK_TABLE_OR_COL b) foo (TOK_INSERT (TOK_DESTINATION (TOK_DIR 
 TOK_TMP_FILE)) (TOK_SELECT (TOK_SELEXPR (. (TOK_TABLE_OR_COL a) foo) foo1) 
 (TOK_SELEXPR (. (TOK_TABLE_OR_COL b) foo) foo2) (TOK_SELEXPR (. 
 (TOK_TABLE_OR_COL b) bar))) 

[jira] [Commented] (HIVE-2383) Incorrect alias filtering for predicate pushdown

2011-08-30 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13094183#comment-13094183
 ] 

jirapos...@reviews.apache.org commented on HIVE-2383:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1568/
---

(Updated 2011-08-30 23:30:03.671952)


Review request for hive.


Changes
---

fixed nullpointer


Summary
---

https://issues.apache.org/jira/browse/HIVE-2383


This addresses bug HIVE-2383.
https://issues.apache.org/jira/browse/HIVE-2383


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/ExprWalkerInfo.java
 1163438 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/ExprWalkerProcFactory.java
 1163438 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1163438 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/ppd_repeated_alias.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_repeated_alias.q.out
 PRE-CREATION 

Diff: https://reviews.apache.org/r/1568/diff


Testing
---


Thanks,

Charles



 Incorrect alias filtering for predicate pushdown
 

 Key: HIVE-2383
 URL: https://issues.apache.org/jira/browse/HIVE-2383
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: Charles Chen
Assignee: Charles Chen
Priority: Critical
 Fix For: 0.8.0

 Attachments: HIVE-2383v1.patch, HIVE-2383v2.patch


 The predicate pushdown optimizer starts at the topmost operators traverses 
 the operator tree, at each stage collecting predicates to be pushed down.  At 
 each operator, ive.ql.ppd.OpProcFactory.DefaultPPD.mergeWithChildrenPred is 
 called, which merges the predicates of the children nodes into the current 
 node.  The predicates are stored in hive.ql.ppd.ExprWalkerInfo.pushdownPreds 
 as a map from the alias a predicate refers to (a predicate may only refer to 
 one alias at a time as only such predicates can be pushed) to a list of such 
 predicates.  Since at each stage the alias the predicate refers to may change 
 (subqueries may change aliases), this is updated for each operator 
 (hive.ql.ppd.ExprWalkerProcFactory.extractPushdownPreds is called which walks 
 the ExprNodeDesc for each predicate). When a JoinOperator is encountered, 
 mergeWithChildrenPred is passed an optional parameter aliases which 
 contains a set of aliases that can be pushed per ansi semantics (see 
 hive.ql.ppd.OpProcFactory.JoinPPD.getQualifiedAliases).  The part that is 
 incorrect is that aliases are filtered in mergeWithChildrenPred before 
 extractPushdownPreds is called, which associates the predicates with the 
 correct alias in the current operator's context while the filtering should 
 happen after.
 In test case Q2 below, when the predicate a.bar=3 comes into the 
 JoinOperator, the alias is a coming in so it is accepted for pushdown.  
 When brought into the JoinOperator's context, however, since the predicate 
 refers to b.foo in the inner scope, we should not actually accept this for 
 pushdown.
 With the test cases
 {noformat}
 -- Q1: predicate should not be pushed on the right side of a left outer join 
 (this is correct in trunk)
 explain
 SELECT a.foo as foo1, b.foo as foo2, b.bar
 FROM pokes a LEFT OUTER JOIN pokes2 b
 ON a.foo=b.foo
 WHERE b.bar=3;
 -- Q2: predicate should not be pushed on the right side of a left outer join 
 (this is broken in trunk)
 explain
 SELECT * FROM
 (SELECT a.foo as foo1, b.foo as foo2, b.bar
 FROM pokes a LEFT OUTER JOIN pokes2 b
 ON a.foo=b.foo) a
 WHERE a.bar=3;
 -- Q3: predicate should be pushed (this is correct in trunk)
 explain
 SELECT * FROM
 (SELECT a.foo as foo1, b.foo as foo2, a.bar
 FROM pokes a JOIN pokes2 b
 ON a.foo=b.foo) a
 WHERE a.bar=3;
 {noformat}
 The current output is
 {noformat}
 hive 
  -- Q1: predicate should not be pushed on the right side of a left outer 
 join
  explain
  SELECT a.foo as foo1, b.foo as foo2, b.bar
  FROM pokes a LEFT OUTER JOIN pokes2 b
  ON a.foo=b.foo
  WHERE b.bar=3;
 OK
 ABSTRACT SYNTAX TREE:
   (TOK_QUERY (TOK_FROM (TOK_LEFTOUTERJOIN (TOK_TABREF (TOK_TABNAME pokes) a) 
 (TOK_TABREF (TOK_TABNAME pokes2) b) (= (. (TOK_TABLE_OR_COL a) foo) (. 
 (TOK_TABLE_OR_COL b) foo (TOK_INSERT (TOK_DESTINATION (TOK_DIR 
 TOK_TMP_FILE)) (TOK_SELECT (TOK_SELEXPR (. (TOK_TABLE_OR_COL a) foo) foo1) 
 (TOK_SELEXPR (. (TOK_TABLE_OR_COL b) foo) foo2) (TOK_SELEXPR (. 

[jira] [Commented] (HIVE-2383) Incorrect alias filtering for predicate pushdown

2011-08-30 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13094307#comment-13094307
 ] 

jirapos...@reviews.apache.org commented on HIVE-2383:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1568/
---

(Updated 2011-08-31 05:01:19.947004)


Review request for hive.


Changes
---

Fix filter creation when not pushed down


Summary
---

https://issues.apache.org/jira/browse/HIVE-2383


This addresses bug HIVE-2383.
https://issues.apache.org/jira/browse/HIVE-2383


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/ExprWalkerInfo.java
 1163438 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/ExprWalkerProcFactory.java
 1163438 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1163438 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/ppd_repeated_alias.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/auto_join8.q.out
 1163438 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/index_auto_mult_tables.q.out
 1163438 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join8.q.out
 1163438 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_repeated_alias.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/udf_bitmap_empty.q.out
 1163438 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join8.q.xml
 1163438 

Diff: https://reviews.apache.org/r/1568/diff


Testing
---


Thanks,

Charles



 Incorrect alias filtering for predicate pushdown
 

 Key: HIVE-2383
 URL: https://issues.apache.org/jira/browse/HIVE-2383
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: Charles Chen
Assignee: Charles Chen
Priority: Critical
 Fix For: 0.8.0

 Attachments: HIVE-2383v1.patch, HIVE-2383v2.patch, HIVE-2383v5.patch


 The predicate pushdown optimizer starts at the topmost operators traverses 
 the operator tree, at each stage collecting predicates to be pushed down.  At 
 each operator, ive.ql.ppd.OpProcFactory.DefaultPPD.mergeWithChildrenPred is 
 called, which merges the predicates of the children nodes into the current 
 node.  The predicates are stored in hive.ql.ppd.ExprWalkerInfo.pushdownPreds 
 as a map from the alias a predicate refers to (a predicate may only refer to 
 one alias at a time as only such predicates can be pushed) to a list of such 
 predicates.  Since at each stage the alias the predicate refers to may change 
 (subqueries may change aliases), this is updated for each operator 
 (hive.ql.ppd.ExprWalkerProcFactory.extractPushdownPreds is called which walks 
 the ExprNodeDesc for each predicate). When a JoinOperator is encountered, 
 mergeWithChildrenPred is passed an optional parameter aliases which 
 contains a set of aliases that can be pushed per ansi semantics (see 
 hive.ql.ppd.OpProcFactory.JoinPPD.getQualifiedAliases).  The part that is 
 incorrect is that aliases are filtered in mergeWithChildrenPred before 
 extractPushdownPreds is called, which associates the predicates with the 
 correct alias in the current operator's context while the filtering should 
 happen after.
 In test case Q2 below, when the predicate a.bar=3 comes into the 
 JoinOperator, the alias is a coming in so it is accepted for pushdown.  
 When brought into the JoinOperator's context, however, since the predicate 
 refers to b.foo in the inner scope, we should not actually accept this for 
 pushdown.
 With the test cases
 {noformat}
 -- Q1: predicate should not be pushed on the right side of a left outer join 
 (this is correct in trunk)
 explain
 SELECT a.foo as foo1, b.foo as foo2, b.bar
 FROM pokes a LEFT OUTER JOIN pokes2 b
 ON a.foo=b.foo
 WHERE b.bar=3;
 -- Q2: predicate should not be pushed on the right side of a left outer join 
 (this is broken in trunk)
 explain
 SELECT * FROM
 (SELECT a.foo as foo1, b.foo as foo2, b.bar
 FROM pokes a LEFT OUTER JOIN pokes2 b
 ON a.foo=b.foo) a
 WHERE a.bar=3;
 -- Q3: predicate should be pushed (this is correct in trunk)
 explain
 SELECT * FROM
 (SELECT a.foo as foo1, b.foo as foo2, a.bar
 FROM pokes a JOIN pokes2 b
 ON a.foo=b.foo) a
 WHERE a.bar=3;
 {noformat}
 The current output is
 {noformat}
 hive 
  -- Q1: predicate should not 

[jira] [Commented] (HIVE-2383) Incorrect alias filtering for predicate pushdown

2011-08-30 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13094320#comment-13094320
 ] 

jirapos...@reviews.apache.org commented on HIVE-2383:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1568/#review1700
---



http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/ExprWalkerInfo.java
https://reviews.apache.org/r/1568/#comment3855

The predicates are already cloned in 
ExprWalkerProcFactory.extractPushdownPreds() and this behavior is inconsistent 
with addPushdowns() which doesn't clone the predicates.  Moreover, not cloning 
the expressions is convenient for the newToOldExprMap so that now the 
ExprNodeDesc's can be compared with == instead of a deep comparison of the tree.



http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
https://reviews.apache.org/r/1568/#comment3856

From what I understand, currently, when the PPD encounters a JoinOperator, 
mergeChildrenPred is used to find the filters that are not being pushed down 
past that operator based on the alias and these should be the same filters that 
are stored in ExprWalkerInfo.nonFinalPreds (except the filters we want filter 
on the output columns of the JoinOperator while the nonFinalPreds filter on its 
input).  The bug in HIVE-2383 is that alias filtering should be done in the 
context of the JoinOperator, not on its children, so now mergeChildrenPred 
doesn't return the correct filters to create.  I changed the JoinPPD to use the 
nonFinalPreds to figure out which filters to create and added a map in 
ExprWalkerInfo.newToOldExprMap to store the mapping from each filter in the 
operator's context to the filter that it came from in the children's context 
(this is populated during ExprWalkerProcFactory.extractPushdownPreds()).


- Charles


On 2011-08-31 05:01:19, Charles Chen wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1568/
bq.  ---
bq.  
bq.  (Updated 2011-08-31 05:01:19)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  https://issues.apache.org/jira/browse/HIVE-2383
bq.  
bq.  
bq.  This addresses bug HIVE-2383.
bq.  https://issues.apache.org/jira/browse/HIVE-2383
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/ExprWalkerInfo.java
 1163438 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/ExprWalkerProcFactory.java
 1163438 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1163438 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/ppd_repeated_alias.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/auto_join8.q.out
 1163438 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/index_auto_mult_tables.q.out
 1163438 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/join8.q.out
 1163438 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_repeated_alias.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/udf_bitmap_empty.q.out
 1163438 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/compiler/plan/join8.q.xml
 1163438 
bq.  
bq.  Diff: https://reviews.apache.org/r/1568/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Charles
bq.  
bq.



 Incorrect alias filtering for predicate pushdown
 

 Key: HIVE-2383
 URL: https://issues.apache.org/jira/browse/HIVE-2383
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: Charles Chen
Assignee: Charles Chen
Priority: Critical
 Fix For: 0.8.0

 Attachments: HIVE-2383v1.patch, HIVE-2383v2.patch, HIVE-2383v5.patch


 The predicate pushdown optimizer starts at the topmost operators traverses 
 the operator tree, at each stage collecting predicates to be pushed down.  At 
 each operator, ive.ql.ppd.OpProcFactory.DefaultPPD.mergeWithChildrenPred is 
 called, which merges the predicates of the children nodes into the current 
 node.  The predicates are stored in hive.ql.ppd.ExprWalkerInfo.pushdownPreds 

[jira] [Commented] (HIVE-2415) disallow partition column names when doing replace columns

2011-08-28 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13092565#comment-13092565
 ] 

jirapos...@reviews.apache.org commented on HIVE-2415:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1672/
---

Review request for hive and Ning Zhang.


Summary
---

move validation of renaming/replacing columns from runtime to parser


This addresses bug HIVE-2415.
https://issues.apache.org/jira/browse/HIVE-2415


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1162190 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Table.java 1162190 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java 
1162190 
  trunk/ql/src/test/queries/clientnegative/replace_columns.q PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/replace_columns_2.q PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/replace_columns_3.q PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/altern1.q.out 1162190 
  trunk/ql/src/test/results/clientnegative/column_rename1.q.out 1162190 
  trunk/ql/src/test/results/clientnegative/column_rename2.q.out 1162190 
  trunk/ql/src/test/results/clientnegative/column_rename4.q.out 1162190 
  trunk/ql/src/test/results/clientnegative/replace_columns.q.out PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/replace_columns_2.q.out PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/replace_columns_3.q.out PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/replace_columns_4.q.out PRE-CREATION 
  trunk/serde/src/java/org/apache/hadoop/hive/serde2/SerDeUtils.java 1162190 

Diff: https://reviews.apache.org/r/1672/diff


Testing
---


Thanks,

Yongqiang



 disallow partition column names when doing replace columns
 --

 Key: HIVE-2415
 URL: https://issues.apache.org/jira/browse/HIVE-2415
 Project: Hive
  Issue Type: Bug
Reporter: He Yongqiang
Assignee: He Yongqiang
 Attachments: HIVE-2415.1.patch


 alter table replace columns allows to add a column with the same name as 
 partition column, which introduced inconsistency. 
 We should disallow this. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2404) Allow RCFile Reader to tolerate corruptions

2011-08-27 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13092390#comment-13092390
 ] 

jirapos...@reviews.apache.org commented on HIVE-2404:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1660/
---

(Updated 2011-08-27 23:13:24.160233)


Review request for Yongqiang He and Paul Yang.


Changes
---

Added unit-test, also handled ChecksumException


Summary
---

Sometimes it is useful to tolerate corruptions during a query and return 
results based on the files that can be processed. A single corrupt block of 
data should not prevent reading the rest of the data.

We need a way to gracefully ignore errors while reading a RC File


This addresses bug HIVE-2404.
https://issues.apache.org/jira/browse/HIVE-2404


Diffs (updated)
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/io/RCFile.java 1161660 
  trunk/ql/src/test/org/apache/hadoop/hive/ql/io/TestRCFile.java 1161660 

Diff: https://reviews.apache.org/r/1660/diff


Testing
---

Manual testing with a corrupt RC file


Thanks,

Ramkumar



 Allow RCFile Reader to tolerate corruptions
 ---

 Key: HIVE-2404
 URL: https://issues.apache.org/jira/browse/HIVE-2404
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.7.1
Reporter: Ramkumar Vadali
Assignee: Ramkumar Vadali
Priority: Minor
 Attachments: toleratecorruptions.patch


 Sometimes it is useful to tolerate corruptions during a query and return 
 results based on the files that can be processed. A single corrupt block of 
 data should not prevent reading the rest of the data.
 We need a way to gracefully ignore errors while reading a RC File

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




  1   2   3   4   5   6   >