[jira] [Commented] (HIVE-2246) Dedupe tables' column schemas from partitions in the metastore db

2011-08-05 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080216#comment-13080216
 ] 

jirapos...@reviews.apache.org commented on HIVE-2246:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1183/
---

(Updated 2011-08-05 20:48:05.144312)


Review request for hive, Ning Zhang and Paul Yang.


Changes
---

-On alter table, only change the column descriptor if the columns have changed.
-Fix desc table partition... to use the partition's column schema, not the 
table's


Summary
---

This patch tries to make minimal changes to the API while keeping migration 
short and somewhat easy to revert.

The new schema can be described as follows:
- CDS is a table corresponding to Column Descriptor objects.  Currently, it 
only stores a CD_ID.
- COLUMNS_V2 is a table corresponding to MFieldSchema objects, or columns.  A 
Column Descriptor holds a list of columns.  COLUMNS_V2 has a foreign key to the 
CD_ID to which it belongs.
- SDS was modified to reference a Column Descriptor. So SDS now has a foreign 
key to a CD_ID which describes its columns.

During migration, we create Column Descriptors for tables in a straightforward 
manner: their columns are now just wrapped inside a column descriptor.  The SDS 
of partitions use their parent table's column descriptor, since currently a 
partition and its table share the same list of columns.

When altering or adding a partition, give it it's parent table's column 
descriptor IF the columns they describe are the same.  Otherwise, create a new 
column descriptor for its columns.

When adding or altering a table, create a new column descriptor every time.

Whenever you drop a storage descriptor (e.g, when dropping tables or 
partitions), check to see if the related column descriptor has any other 
references in the table.  That is, check to see if any other storage 
descriptors point to that column descriptor.  If none do, then delete that 
column descriptor.  This check is in place so we don't have unreferenced column 
descriptors and columns hanging around after schema evolution for tables.


This addresses bug HIVE-2246.
https://issues.apache.org/jira/browse/HIVE-2246


Diffs (updated)
-

  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
1153927 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java 
1153927 
  
trunk/metastore/src/model/org/apache/hadoop/hive/metastore/model/MStorageDescriptor.java
 1153927 
  trunk/metastore/src/model/package.jdo 1153927 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1153927 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/MetaDataFormatUtils.java 
1153927 

Diff: https://reviews.apache.org/r/1183/diff


Testing
---

Passes facebook's regression testing and all existing test cases.  In one 
instance, before migration, the overhead involved with storage descriptors and 
columns was ~11 GB.  After migration, the overhead was ~1.5 GB.


Thanks,

Sohan



 Dedupe tables' column schemas from partitions in the metastore db
 -

 Key: HIVE-2246
 URL: https://issues.apache.org/jira/browse/HIVE-2246
 Project: Hive
  Issue Type: Improvement
  Components: Metastore
Reporter: Sohan Jain
Assignee: Sohan Jain
 Attachments: HIVE-2246.2.patch, HIVE-2246.3.patch


 Note: this patch proposes a schema change, and is therefore incompatible with 
 the current metastore.
 We can re-organize the JDO models to reduce space usage to keep the metastore 
 scalable for the future.  Currently, partitions are the fastest growing 
 objects in the metastore, and the metastore keeps a separate copy of the 
 columns list for each partition.  We can normalize the metastore db by 
 decoupling Columns from Storage Descriptors and not storing duplicate lists 
 of the columns for each partition. 
 An idea is to create an additional level of indirection with a Column 
 Descriptor that has a list of columns.  A table has a reference to its 
 latest Column Descriptor (note: a table may have more than one Column 
 Descriptor in the case of schema evolution).  Partitions and Indexes can 
 reference the same Column Descriptors as their parent table.
 Currently, the COLUMNS table in the metastore has roughly (number of 
 partitions + number of tables) * (average number of columns pertable) rows.  
 We can reduce this to (number of tables) * (average number of columns per 
 table) rows, while incurring a small cost proportional to the number of 
 tables to store the Column Descriptors.
 Please see the latest review board for additional 

[jira] [Commented] (HIVE-2334) DESCRIBE TABLE causes NPE when hive.cli.print.header=true

2011-08-05 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080217#comment-13080217
 ] 

jirapos...@reviews.apache.org commented on HIVE-2334:
-



bq.  On 2011-08-05 20:38:11, Carl Steinbach wrote:
bq.   cli/src/java/org/apache/hadoop/hive/cli/CliDriver.java, line 235
bq.   https://reviews.apache.org/r/1300/diff/1/?file=30859#file30859line235
bq.  
bq.   Might want to consider using StringTokenizer or StreamTokenizer here.

This is how it was in the original code.  All of this can actually be done 
quite a bit better.  I'm happy to switch to tokenizer; the patch is a bit 
schizophrenic about refactoring/improving.  I didn't change this since it's not 
directly related to what I was trying to test.


bq.  On 2011-08-05 20:38:11, Carl Steinbach wrote:
bq.   cli/src/test/org/apache/hadoop/hive/cli/TestCliDriverMethods.java, line 
37
bq.   https://reviews.apache.org/r/1300/diff/1/?file=30860#file30860line37
bq.  
bq.   There's already some very limited test coverage for the 
hive.cli.print.header feature in print_header.q. Why not extend this testcase 
instead of adding a new unit test?

Because this is an actual unit test against a specific regression, which has 
value separate from the print_header.q integration test.  I can add additional 
content to print_header.q, since this test is easier to identify what's gone 
wrong and runs in about 0.2 seconds, this one seems more useful.  


bq.  On 2011-08-05 20:38:11, Carl Steinbach wrote:
bq.   ivy/libraries.properties, line 47
bq.   https://reviews.apache.org/r/1300/diff/1/?file=30861#file30861line47
bq.  
bq.   We need to manage this dependency with Ivy. The Hive build currently 
runs against hadoop-0.20.1, which does not include mockito-all-1.8.2.jar

I'm sorry; I don't understand.  This is being brought in by Ivy?  As part of 
HIVE-2171, I had mentioned we need to make sure testing related jars don't get 
included during binary/package, but that should be done in a different JIRA.


bq.  On 2011-08-05 20:38:11, Carl Steinbach wrote:
bq.   cli/src/test/org/apache/hadoop/hive/cli/TestCliDriverMethods.java, line 1
bq.   https://reviews.apache.org/r/1300/diff/1/?file=30860#file30860line1
bq.  
bq.   cli/build.xml overrides the ant test target with a no-op, so this 
test is actually not getting run.

I'll update cli/build.xml to not be a no-op, unless there's some reason to?


- Jakob


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1300/#review1308
---


On 2011-08-05 01:22:01, Jakob Homan wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1300/
bq.  ---
bq.  
bq.  (Updated 2011-08-05 01:22:01)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Commands that don't return a schema cause NPE when print headers is on.
bq.  
bq.  
bq.  This addresses bug HIVE-2334.
bq.  https://issues.apache.org/jira/browse/HIVE-2334
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.cli/src/java/org/apache/hadoop/hive/cli/CliDriver.java 9fa7bc6 
bq.cli/src/test/org/apache/hadoop/hive/cli/TestCliDriverMethods.java 
PRE-CREATION 
bq.ivy/libraries.properties af856bd 
bq.  
bq.  Diff: https://reviews.apache.org/r/1300/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  New unit tests (both positive and negative) and verification on manual 
cluster.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Jakob
bq.  
bq.



 DESCRIBE TABLE causes NPE when hive.cli.print.header=true
 -

 Key: HIVE-2334
 URL: https://issues.apache.org/jira/browse/HIVE-2334
 Project: Hive
  Issue Type: Bug
  Components: CLI
Affects Versions: 0.7.1
Reporter: Carl Steinbach
Assignee: Jakob Homan
 Attachments: h2334.patch




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2246) Dedupe tables' column schemas from partitions in the metastore db

2011-08-05 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080218#comment-13080218
 ] 

jirapos...@reviews.apache.org commented on HIVE-2246:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1183/
---

(Updated 2011-08-05 20:49:19.127572)


Review request for hive, Ning Zhang and Paul Yang.


Changes
---

-Forgot to add a few files.  NOTE: this is only a temporary diff; I need to add 
derby and postgres migration scripts.


Summary
---

This patch tries to make minimal changes to the API while keeping migration 
short and somewhat easy to revert.

The new schema can be described as follows:
- CDS is a table corresponding to Column Descriptor objects.  Currently, it 
only stores a CD_ID.
- COLUMNS_V2 is a table corresponding to MFieldSchema objects, or columns.  A 
Column Descriptor holds a list of columns.  COLUMNS_V2 has a foreign key to the 
CD_ID to which it belongs.
- SDS was modified to reference a Column Descriptor. So SDS now has a foreign 
key to a CD_ID which describes its columns.

During migration, we create Column Descriptors for tables in a straightforward 
manner: their columns are now just wrapped inside a column descriptor.  The SDS 
of partitions use their parent table's column descriptor, since currently a 
partition and its table share the same list of columns.

When altering or adding a partition, give it it's parent table's column 
descriptor IF the columns they describe are the same.  Otherwise, create a new 
column descriptor for its columns.

When adding or altering a table, create a new column descriptor every time.

Whenever you drop a storage descriptor (e.g, when dropping tables or 
partitions), check to see if the related column descriptor has any other 
references in the table.  That is, check to see if any other storage 
descriptors point to that column descriptor.  If none do, then delete that 
column descriptor.  This check is in place so we don't have unreferenced column 
descriptors and columns hanging around after schema evolution for tables.


This addresses bug HIVE-2246.
https://issues.apache.org/jira/browse/HIVE-2246


Diffs (updated)
-

  trunk/metastore/scripts/upgrade/mysql/008-HIVE-2246.mysql.sql PRE-CREATION 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
1153927 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java 
1153927 
  
trunk/metastore/src/model/org/apache/hadoop/hive/metastore/model/MColumnDescriptor.java
 PRE-CREATION 
  
trunk/metastore/src/model/org/apache/hadoop/hive/metastore/model/MStorageDescriptor.java
 1153927 
  trunk/metastore/src/model/package.jdo 1153927 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1153927 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/MetaDataFormatUtils.java 
1153927 

Diff: https://reviews.apache.org/r/1183/diff


Testing
---

Passes facebook's regression testing and all existing test cases.  In one 
instance, before migration, the overhead involved with storage descriptors and 
columns was ~11 GB.  After migration, the overhead was ~1.5 GB.


Thanks,

Sohan



 Dedupe tables' column schemas from partitions in the metastore db
 -

 Key: HIVE-2246
 URL: https://issues.apache.org/jira/browse/HIVE-2246
 Project: Hive
  Issue Type: Improvement
  Components: Metastore
Reporter: Sohan Jain
Assignee: Sohan Jain
 Attachments: HIVE-2246.2.patch, HIVE-2246.3.patch


 Note: this patch proposes a schema change, and is therefore incompatible with 
 the current metastore.
 We can re-organize the JDO models to reduce space usage to keep the metastore 
 scalable for the future.  Currently, partitions are the fastest growing 
 objects in the metastore, and the metastore keeps a separate copy of the 
 columns list for each partition.  We can normalize the metastore db by 
 decoupling Columns from Storage Descriptors and not storing duplicate lists 
 of the columns for each partition. 
 An idea is to create an additional level of indirection with a Column 
 Descriptor that has a list of columns.  A table has a reference to its 
 latest Column Descriptor (note: a table may have more than one Column 
 Descriptor in the case of schema evolution).  Partitions and Indexes can 
 reference the same Column Descriptors as their parent table.
 Currently, the COLUMNS table in the metastore has roughly (number of 
 partitions + number of tables) * (average number of columns pertable) rows.  
 We can reduce this to (number of tables) * (average number of columns per 
 table) rows, while incurring a 

[jira] [Commented] (HIVE-1694) Accelerate GROUP BY execution using indexes

2011-08-05 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080235#comment-13080235
 ] 

jirapos...@reviews.apache.org commented on HIVE-1694:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1194/#review1303
---



ql/src/java/org/apache/hadoop/hive/ql/index/AggregateIndexHandler.java
https://reviews.apache.org/r/1194/#comment2955

Can't you just look up AGGREGATES in the map?



ql/src/java/org/apache/hadoop/hive/ql/index/AggregateIndexHandler.java
https://reviews.apache.org/r/1194/#comment2953

Add a helper method to avoid duplicating the code in the else block below.




ql/src/java/org/apache/hadoop/hive/ql/index/AggregateIndexHandler.java
https://reviews.apache.org/r/1194/#comment2954

Can't you just look up AGGREGATES in the map?



ql/src/java/org/apache/hadoop/hive/ql/index/AggregateIndexHandler.java
https://reviews.apache.org/r/1194/#comment2956

See recent changes in corresponding CompactIndexHandler code for 
HIVEOPTINDEXFILTER; need the same here (or better, factor out common code here 
and elsewhere).

On a related note, you may be able to use the same technique instead of 
isQueryInsertToTable; this would be preferable since it's nice to be able to 
use the index rewrite in cases where it's a normal INSERT table with index 
being used for GROUP BY on SELECT from some other table.




ql/src/java/org/apache/hadoop/hive/ql/optimizer/IndexUtils.java
https://reviews.apache.org/r/1194/#comment2957

@params here don't match actual params



ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyProcFactory.java
https://reviews.apache.org/r/1194/#comment2958

Shouldn't this be the same as COUNT(*)?




ql/src/test/queries/clientpositive/ql_rewrite_gbtoidx.q
https://reviews.apache.org/r/1194/#comment2980

Besides EXPLAIN, you should include a few queries against a non-empty table 
verifying that you get the correct results both with and without the 
optimization applied.  Remember to include an ORDER BY for test determinism.




ql/src/test/queries/clientpositive/ql_rewrite_gbtoidx.q
https://reviews.apache.org/r/1194/#comment2978

Isn't this set redundant?


- John


On 2011-08-03 10:31:42, Prajakta Kalmegh wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1194/
bq.  ---
bq.  
bq.  (Updated 2011-08-03 10:31:42)
bq.  
bq.  
bq.  Review request for hive and John Sichi.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  This patch has defined a new AggregateIndexHandler which is used to 
optimize the query plan for groupby queries. 
bq.  
bq.  
bq.  This addresses bug HIVE-1694.
bq.  https://issues.apache.org/jira/browse/HIVE-1694
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.common/src/java/org/apache/hadoop/hive/conf/HiveConf.java b46976f 
bq.ql/src/java/org/apache/hadoop/hive/ql/index/AggregateIndexHandler.java 
PRE-CREATION 
bq.ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndex.java 591c9ff 
bq.ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java a57f9cf 
bq.ql/src/java/org/apache/hadoop/hive/ql/optimizer/IndexUtils.java 
PRE-CREATION 
bq.ql/src/java/org/apache/hadoop/hive/ql/optimizer/Optimizer.java 590d69a 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyCtx.java 
PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyProcFactory.java
 PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteGBUsingIndex.java 
PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteParseContextGenerator.java
 PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteQueryUsingAggregateIndex.java
 PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteQueryUsingAggregateIndexCtx.java
 PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/index/IndexWhereProcessor.java
 8295687 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/index/IndexWhereTaskDispatcher.java
 699519b 
bq.ql/src/test/queries/clientpositive/ql_rewrite_gbtoidx.q PRE-CREATION 
bq.ql/src/test/results/clientpositive/ql_rewrite_gbtoidx.q.out PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1194/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Prajakta
bq.  
bq.



 Accelerate GROUP BY execution using indexes
 ---

 Key: HIVE-1694
 URL: 

[jira] [Commented] (HIVE-2272) add TIMESTAMP data type

2011-08-05 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080273#comment-13080273
 ] 

jirapos...@reviews.apache.org commented on HIVE-2272:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1135/#review1277
---



trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
https://reviews.apache.org/r/1135/#comment2918

I think we should put default timezone to be UTC and we internally in 
facebook have PST as default.



trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java
https://reviews.apache.org/r/1135/#comment2922

Only loading it in client side is sufficient?


- Siying


On 2011-07-28 21:59:38, Franklin Hu wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1135/
bq.  ---
bq.  
bq.  (Updated 2011-07-28 21:59:38)
bq.  
bq.  
bq.  Review request for hive, Yongqiang He, Ning Zhang, and Siying Dong.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Adds TIMESTAMP type to serde2 with both string (LazySimple) and binary 
(LazyBinary) serialization.
bq.  Supports SQL style jdbc timestamps of the format with nanosecond precision
bq.  -MM-DD HH:MM:SS[.fff...]
bq.  
bq.  
bq.  This addresses bug HIVE-2272.
bq.  https://issues.apache.org/jira/browse/HIVE-2272
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1152003 
bq.trunk/conf/hive-default.xml 1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FunctionRegistry.java 
1152003 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java 
1152003 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java 
1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ErrorMsg.java 1152003 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/TypeCheckProcFactory.java 
1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDate.java 1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDateAdd.java 1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDateDiff.java 1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDateSub.java 1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDayOfMonth.java 
1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFHour.java 1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFMinute.java 1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFMonth.java 1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFSecond.java 1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToBoolean.java 
1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToByte.java 1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToDouble.java 1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToFloat.java 1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToInteger.java 
1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToLong.java 1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToShort.java 1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToString.java 1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFUnixTimeStamp.java 
1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFWeekOfYear.java 
1152003 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFYear.java 1152003 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFAverage.java 
1152003 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFContextNGrams.java
 1152003 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCorrelation.java
 1152003 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCovariance.java
 1152003 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCovarianceSample.java
 1152003 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFHistogramNumeric.java
 1152003 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFPercentileApprox.java
 1152003 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFStd.java 
1152003 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFStdSample.java
 1152003 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFSum.java 
1152003 
bq.

[jira] [Commented] (HIVE-2246) Dedupe tables' column schemas from partitions in the metastore db

2011-08-05 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080327#comment-13080327
 ] 

jirapos...@reviews.apache.org commented on HIVE-2246:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1183/
---

(Updated 2011-08-06 01:40:49.118616)


Review request for hive, Ning Zhang and Paul Yang.


Changes
---

-made listStorageDescriptors.. into one transaction
-renamed dropStorageDescriptorCleanly to make it's functionality clearer
-indents  typo


Summary
---

This patch tries to make minimal changes to the API while keeping migration 
short and somewhat easy to revert.

The new schema can be described as follows:
- CDS is a table corresponding to Column Descriptor objects.  Currently, it 
only stores a CD_ID.
- COLUMNS_V2 is a table corresponding to MFieldSchema objects, or columns.  A 
Column Descriptor holds a list of columns.  COLUMNS_V2 has a foreign key to the 
CD_ID to which it belongs.
- SDS was modified to reference a Column Descriptor. So SDS now has a foreign 
key to a CD_ID which describes its columns.

During migration, we create Column Descriptors for tables in a straightforward 
manner: their columns are now just wrapped inside a column descriptor.  The SDS 
of partitions use their parent table's column descriptor, since currently a 
partition and its table share the same list of columns.

When altering or adding a partition, give it it's parent table's column 
descriptor IF the columns they describe are the same.  Otherwise, create a new 
column descriptor for its columns.

When adding or altering a table, create a new column descriptor every time.

Whenever you drop a storage descriptor (e.g, when dropping tables or 
partitions), check to see if the related column descriptor has any other 
references in the table.  That is, check to see if any other storage 
descriptors point to that column descriptor.  If none do, then delete that 
column descriptor.  This check is in place so we don't have unreferenced column 
descriptors and columns hanging around after schema evolution for tables.


This addresses bug HIVE-2246.
https://issues.apache.org/jira/browse/HIVE-2246


Diffs (updated)
-

  trunk/metastore/scripts/upgrade/mysql/008-HIVE-2246.mysql.sql PRE-CREATION 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
1153927 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java 
1153927 
  
trunk/metastore/src/model/org/apache/hadoop/hive/metastore/model/MColumnDescriptor.java
 PRE-CREATION 
  
trunk/metastore/src/model/org/apache/hadoop/hive/metastore/model/MStorageDescriptor.java
 1153927 
  trunk/metastore/src/model/package.jdo 1153927 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1153927 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/MetaDataFormatUtils.java 
1153927 

Diff: https://reviews.apache.org/r/1183/diff


Testing
---

Passes facebook's regression testing and all existing test cases.  In one 
instance, before migration, the overhead involved with storage descriptors and 
columns was ~11 GB.  After migration, the overhead was ~1.5 GB.


Thanks,

Sohan



 Dedupe tables' column schemas from partitions in the metastore db
 -

 Key: HIVE-2246
 URL: https://issues.apache.org/jira/browse/HIVE-2246
 Project: Hive
  Issue Type: Improvement
  Components: Metastore
Reporter: Sohan Jain
Assignee: Sohan Jain
 Attachments: HIVE-2246.2.patch, HIVE-2246.3.patch, HIVE-2246.4.patch


 Note: this patch proposes a schema change, and is therefore incompatible with 
 the current metastore.
 We can re-organize the JDO models to reduce space usage to keep the metastore 
 scalable for the future.  Currently, partitions are the fastest growing 
 objects in the metastore, and the metastore keeps a separate copy of the 
 columns list for each partition.  We can normalize the metastore db by 
 decoupling Columns from Storage Descriptors and not storing duplicate lists 
 of the columns for each partition. 
 An idea is to create an additional level of indirection with a Column 
 Descriptor that has a list of columns.  A table has a reference to its 
 latest Column Descriptor (note: a table may have more than one Column 
 Descriptor in the case of schema evolution).  Partitions and Indexes can 
 reference the same Column Descriptors as their parent table.
 Currently, the COLUMNS table in the metastore has roughly (number of 
 partitions + number of tables) * (average number of columns pertable) rows.  
 We can reduce this to (number of tables) * (average number of columns 

[jira] [Commented] (HIVE-2246) Dedupe tables' column schemas from partitions in the metastore db

2011-08-05 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13080328#comment-13080328
 ] 

jirapos...@reviews.apache.org commented on HIVE-2246:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1183/#review1313
---



trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java
https://reviews.apache.org/r/1183/#comment2984

should read 1-N actually


- Sohan


On 2011-08-06 01:40:49, Sohan Jain wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1183/
bq.  ---
bq.  
bq.  (Updated 2011-08-06 01:40:49)
bq.  
bq.  
bq.  Review request for hive, Ning Zhang and Paul Yang.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  This patch tries to make minimal changes to the API while keeping 
migration short and somewhat easy to revert.
bq.  
bq.  The new schema can be described as follows:
bq.  - CDS is a table corresponding to Column Descriptor objects.  Currently, 
it only stores a CD_ID.
bq.  - COLUMNS_V2 is a table corresponding to MFieldSchema objects, or columns. 
 A Column Descriptor holds a list of columns.  COLUMNS_V2 has a foreign key to 
the CD_ID to which it belongs.
bq.  - SDS was modified to reference a Column Descriptor. So SDS now has a 
foreign key to a CD_ID which describes its columns.
bq.  
bq.  During migration, we create Column Descriptors for tables in a 
straightforward manner: their columns are now just wrapped inside a column 
descriptor.  The SDS of partitions use their parent table's column descriptor, 
since currently a partition and its table share the same list of columns.
bq.  
bq.  When altering or adding a partition, give it it's parent table's column 
descriptor IF the columns they describe are the same.  Otherwise, create a new 
column descriptor for its columns.
bq.  
bq.  When adding or altering a table, create a new column descriptor every time.
bq.  
bq.  Whenever you drop a storage descriptor (e.g, when dropping tables or 
partitions), check to see if the related column descriptor has any other 
references in the table.  That is, check to see if any other storage 
descriptors point to that column descriptor.  If none do, then delete that 
column descriptor.  This check is in place so we don't have unreferenced column 
descriptors and columns hanging around after schema evolution for tables.
bq.  
bq.  
bq.  This addresses bug HIVE-2246.
bq.  https://issues.apache.org/jira/browse/HIVE-2246
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/metastore/scripts/upgrade/mysql/008-HIVE-2246.mysql.sql 
PRE-CREATION 
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
1153927 
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java 
1153927 
bq.
trunk/metastore/src/model/org/apache/hadoop/hive/metastore/model/MColumnDescriptor.java
 PRE-CREATION 
bq.
trunk/metastore/src/model/org/apache/hadoop/hive/metastore/model/MStorageDescriptor.java
 1153927 
bq.trunk/metastore/src/model/package.jdo 1153927 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1153927 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/MetaDataFormatUtils.java 
1153927 
bq.  
bq.  Diff: https://reviews.apache.org/r/1183/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Passes facebook's regression testing and all existing test cases.  In one 
instance, before migration, the overhead involved with storage descriptors and 
columns was ~11 GB.  After migration, the overhead was ~1.5 GB.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Sohan
bq.  
bq.



 Dedupe tables' column schemas from partitions in the metastore db
 -

 Key: HIVE-2246
 URL: https://issues.apache.org/jira/browse/HIVE-2246
 Project: Hive
  Issue Type: Improvement
  Components: Metastore
Reporter: Sohan Jain
Assignee: Sohan Jain
 Attachments: HIVE-2246.2.patch, HIVE-2246.3.patch, HIVE-2246.4.patch


 Note: this patch proposes a schema change, and is therefore incompatible with 
 the current metastore.
 We can re-organize the JDO models to reduce space usage to keep the metastore 
 scalable for the future.  Currently, partitions are the fastest growing 
 objects in the metastore, and the metastore keeps a separate copy of the 
 columns list for each partition.  We can normalize the metastore db by 
 decoupling Columns from Storage Descriptors and not storing duplicate lists 
 of the columns for each partition. 
 An 

[jira] [Commented] (HIVE-2319) Calling alter_table after changing partition comment throws an exception

2011-08-04 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079470#comment-13079470
 ] 

jirapos...@reviews.apache.org commented on HIVE-2319:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1213/
---

(Updated 2011-08-04 17:05:08.413032)


Review request for hive and Paul Yang.


Changes
---

Removed a change in TestMetaStoreEventListener.java that somehow snuck in this 
diff.


Summary
---

Altering a table's partition key comments raises an InvalidOperationException. 
The partition key name and type should not be mutable, but the comment should 
be able to get changed.


This addresses bug HIVE-2319.
https://issues.apache.org/jira/browse/HIVE-2319


Diffs (updated)
-

  
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveAlterHandler.java 
1153927 
  
trunk/metastore/src/test/org/apache/hadoop/hive/metastore/TestHiveMetaStore.java
 1153927 

Diff: https://reviews.apache.org/r/1213/diff


Testing
---

Added some test cases to HiveMetaStore that pass.


Thanks,

Sohan



 Calling alter_table after changing partition comment throws an exception
 

 Key: HIVE-2319
 URL: https://issues.apache.org/jira/browse/HIVE-2319
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Reporter: Sohan Jain
Assignee: Sohan Jain
 Attachments: HIVE-2319.2.patch, HIVE-2319.3.patch


 Altering a table's partition key comments raises an 
 InvalidOperationException.  The partition key name and type should not be 
 mutable, but the comment should be able to get changed.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2346) Add hooks to run when execution fails.

2011-08-04 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079534#comment-13079534
 ] 

jirapos...@reviews.apache.org commented on HIVE-2346:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1295/
---

Review request for hive and Ning Zhang.


Summary
---

I added a new type of hook, which will be run when a job fails.


This addresses bug HIVE-2346.
https://issues.apache.org/jira/browse/HIVE-2346


Diffs
-

  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1153966 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1153966 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/hooks/HookContext.java 1153966 

Diff: https://reviews.apache.org/r/1295/diff


Testing
---

I ran the TestCliDriver and TestNegativeCliDriver test suites and verified they 
passed.

In addition, I created a sample hook, which simply logged that it was run.  I 
verified it was run on a failure, but not when a job succeeded.


Thanks,

Kevin



 Add hooks to run when execution fails.
 --

 Key: HIVE-2346
 URL: https://issues.apache.org/jira/browse/HIVE-2346
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong
 Attachments: HIVE-2346.1.patch.txt


 Currently, when a query fails, the Post Execution Hooks are not run.
 Adding hooks to be run when a query fails could allow for better logging etc.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2347) Make Hadoop Job ID available after task finishes executing

2011-08-04 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079549#comment-13079549
 ] 

jirapos...@reviews.apache.org commented on HIVE-2347:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1296/
---

Review request for hive and Ning Zhang.


Summary
---

I added a field for the Hadoop Job ID to the Task class.  This will make it 
accessible to the Driver and hence to the hooks for logging/debugging purposes. 
 By including it in the Task, we only need to check that the type of the task 
is MAPRED, before getting the job ID.

I considered adding it to several places:

as separate fields in ExecDriver and BlockMergeTask: this would require 
duplicating code, require conditions to determine the type of a task and 
casting to either ExecDriver or BlockMergeTask in order to get the JobID from 
them

in the MapRedWork: this would require modifying a field in MapRedWork in the 
execute function, and I could not find a precedent for this


This addresses bug HIVE-2347.
https://issues.apache.org/jira/browse/HIVE-2347


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java 1153966 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Task.java 1153966 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/BlockMergeTask.java 
1153966 

Diff: https://reviews.apache.org/r/1296/diff


Testing
---

Ran the TestCliDriver and TestNegativeCliDriver test suites and verified they 
passed.

Also, created a sample post exec hook which simply logged the JobID for every 
map reduce task, and verified it.


Thanks,

Kevin



 Make Hadoop Job ID available after task finishes executing
 --

 Key: HIVE-2347
 URL: https://issues.apache.org/jira/browse/HIVE-2347
 Project: Hive
  Issue Type: Improvement
Reporter: Kevin Wilfong
Assignee: Kevin Wilfong

 After Map Reduce tasks finish the execute method (ExecDriver and 
 BlockMergeTask) the Hadoop Job ID is inaccessible to the Driver, and hence 
 the hooks it runs.  Expose this information could help to improve logging, 
 debugging, etc.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2138) Exception when no splits returned from index

2011-08-04 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079630#comment-13079630
 ] 

jirapos...@reviews.apache.org commented on HIVE-2138:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1255/
---

(Updated 2011-08-04 21:56:31.633797)


Review request for hive, John Sichi and Yongqiang He.


Changes
---

removed stale testcase


Summary
---

return an empty array of input splits when we get back no results from the 
hiveIndexResult


This addresses bug HIVE-2138.
https://issues.apache.org/jira/browse/HIVE-2138


Diffs (updated)
-

  ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndexedInputFormat.java 
1a1ecd7 
  ql/src/test/queries/clientpositive/index_auto_empty.q PRE-CREATION 
  ql/src/test/results/clientpositive/index_auto_empty.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/1255/diff


Testing
---

added index_auto_test_if_used.q to the test suite


Thanks,

Syed



 Exception when no splits returned from index
 

 Key: HIVE-2138
 URL: https://issues.apache.org/jira/browse/HIVE-2138
 Project: Hive
  Issue Type: Bug
  Components: Indexing
Affects Versions: 0.8.0
Reporter: Russell Melick
Assignee: Syed S. Albiz
 Attachments: HIVE-2138.1.patch, HIVE-2138.2.patch, HIVE-2138.3.patch, 
 HIVE-2138.4.patch, index_auto_test_if_used.q


 Running a query that uses indexing but doesn't return any results give an 
 exception.
 {code} java.lang.IllegalArgumentException: Can not create a Path from an 
 empty string
 at org.apache.hadoop.fs.Path.checkPathArg(Path.java:82)
 at org.apache.hadoop.fs.Path.init(Path.java:90)
 at org.apache.hadoop.util.StringUtils.stringToPath(StringUtils.java:224)
 at 
 org.apache.hadoop.mapred.FileInputFormat.setInputPaths(FileInputFormat.java:282)
 at 
 org.apache.hadoop.hive.ql.index.HiveIndexedInputFormat.getSplits(HiveIndexedInputFormat.java:123)
  {code}
 This could potentially be fixed by creating a new empty file to use for the 
 splits.
 Once this is fixed, the index_auto_test_if_used.q can be used.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2110) Hive Client is indefenitely waiting for reading from Socket

2011-08-04 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079706#comment-13079706
 ] 

jirapos...@reviews.apache.org commented on HIVE-2110:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1299/
---

Review request for hive and Carl Steinbach.


Summary
---

Pass the Login timeout to thrift/socket layer when connecting to the hive 
server.


This addresses bug HIVE-2110.
https://issues.apache.org/jira/browse/HIVE-2110


Diffs
-

  jdbc/src/java/org/apache/hadoop/hive/jdbc/HiveConnection.java 7c5df83 
  jdbc/src/java/org/apache/hadoop/hive/jdbc/HiveDriver.java c61425f 

Diff: https://reviews.apache.org/r/1299/diff


Testing
---

simple repro to verify the timeout. JDBC tests.


Thanks,

Prasad



 Hive Client is indefenitely waiting for reading from Socket
 ---

 Key: HIVE-2110
 URL: https://issues.apache.org/jira/browse/HIVE-2110
 Project: Hive
  Issue Type: Bug
  Components: JDBC
Affects Versions: 0.5.0
 Environment: Hadoop 0.20.1, Hive0.5.0 and SUSE Linux Enterprise 
 Server 10 SP2 (i586) - Kernel 2.6.16.60-0.21-smp (5).
Reporter: Chinna Rao Lalam
Assignee: Prasad Mujumdar

 Hive Client is indefenitely waiting for reading from Socket. Thread dump i  
 added below.
 Cause is:
  
   In the HiveClient, when client socket is created, the read timeout is 
 mentioned is 0. So the socket will indefinetly wait when the machine where 
 Hive Server is running is shutdown or network is unplugged. The same may 
 not happen if the HiveServer alone is killed or gracefully shutdown. At this 
 time, client will get connection reset exception. 
 Code in HiveConnection
 ---
 {noformat}
 transport = new TSocket(host, port);
 TProtocol protocol = new TBinaryProtocol(transport); 
 client = new HiveClient(protocol);
 {noformat}
 In the Client side, they send the query and wait for the response 
 send_execute(query,id); recv_execute(); // place where client waiting is 
 initiated
 Thread dump:
 {noformat}
 main prio=10 tid=0x40111000 nid=0x3641 runnable [0x7f0d73f29000]
   java.lang.Thread.State: RUNNABLE
   at java.net.SocketInputStream.socketRead0(Native Method)
   at java.net.SocketInputStream.read(SocketInputStream.java:129)
   at java.io.BufferedInputStream.fill(BufferedInputStream.java:218)
   at java.io.BufferedInputStream.read1(BufferedInputStream.java:258)
   at java.io.BufferedInputStream.read(BufferedInputStream.java:317) 
   locked 0x7f0d5d3f0828 (a java.io.BufferedInputStream)
   at 
 org.apache.thrift.transport.TIOStreamTransport.read(TIOStreamTransport.java:125)
   at org.apache.thrift.transport.TTransport.readAll(TTransport.java:84)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readAll(TBinaryProtocol.java:314)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readI32(TBinaryProtocol.java:262)
   at 
 org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:192)
   at 
 org.apache.hadoop.hive.service.ThriftHive$Client.recv_execute(ThriftHive.java:130)
   at 
 org.apache.hadoop.hive.service.ThriftHive$Client.execute(ThriftHive.java:109) 
   locked 0x7f0d5d3f0878 (a org.apache.thrift.transport.TSocket)
   at 
 org.apache.hadoop.hive.jdbc.HiveStatement.executeQuery(HiveStatement.java:218)
   at 
 org.apache.hadoop.hive.jdbc.HiveStatement.execute(HiveStatement.java:154)
 {noformat}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2343) stats not updated for non load table desc operations

2011-08-03 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079004#comment-13079004
 ] 

jirapos...@reviews.apache.org commented on HIVE-2343:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1279/
---

Review request for Ning Zhang.


Summary
---

table stats were not being updated for non load table desc plans due to NPE.


This addresses bug hive-2343.
https://issues.apache.org/jira/browse/hive-2343


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/StatsTask.java 1153601 
  trunk/ql/src/test/queries/clientpositive/stats16.q PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/stats16.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/1279/diff


Testing
---

all clientpositive/stats* unit tests pass


Thanks,

Franklin



 stats not updated for non load table desc operations
 --

 Key: HIVE-2343
 URL: https://issues.apache.org/jira/browse/HIVE-2343
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Franklin Hu
Assignee: Franklin Hu
 Fix For: 0.8.0

 Attachments: hive-2343.1.patch


 Bug introduced in HIVE-306 so that stats are updated only for LoadTableDesc 
 operations. For other operations (analyze table), null ptr is thrown and 
 stats are not updated.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2337) Predicate pushdown erroneously conservative with outer joins

2011-08-03 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079024#comment-13079024
 ] 

jirapos...@reviews.apache.org commented on HIVE-2337:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1275/
---

(Updated 2011-08-03 21:02:11.625203)


Review request for hive.


Changes
---

Added test cases, fixed comment


Summary
---

Initial patch


This addresses bug HIVE-2337.
https://issues.apache.org/jira/browse/HIVE-2337


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1153598 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join5.q.out
 PRE-CREATION 

Diff: https://reviews.apache.org/r/1275/diff


Testing
---


Thanks,

Charles



 Predicate pushdown erroneously conservative with outer joins
 

 Key: HIVE-2337
 URL: https://issues.apache.org/jira/browse/HIVE-2337
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Charles Chen
Assignee: Charles Chen
 Attachments: HIVE-2337v1.patch, HIVE-2337v2.patch


 The predicate pushdown filter is not applying left associativity of joins 
 correctly in determining possible aliases for pushing predicates.
 In hive.ql.ppd.OpProcFactory.JoinPPD.getQualifiedAliases, the criteria for 
 pushing aliases is specified as:
 {noformat}
 /**
  * Figures out the aliases for whom it is safe to push predicates based on
  * ANSI SQL semantics For inner join, all predicates for all aliases can 
 be
  * pushed For full outer join, none of the predicates can be pushed as 
 that
  * would limit the number of rows for join For left outer join, all the
  * predicates on the left side aliases can be pushed up For right outer
  * join, all the predicates on the right side aliases can be pushed up 
 Joins
  * chain containing both left and right outer joins are treated as full
  * outer join. [...]
  *
  * @param op
  *  Join Operator
  * @param rr
  *  Row resolver
  * @return set of qualified aliases
  */
 {noformat}
 Since hive joins are left associative, something like a RIGHT OUTER JOIN b 
 LEFT OUTER JOIN c INNER JOIN d should be interpreted as ((a RIGHT OUTER 
 JOIN b) LEFT OUTER JOIN c) INNER JOIN d, so there would be cases where joins 
 with both left and right outer joins can have aliases that can be pushed.  
 Here, aliases b and d are eligible to be pushed up while the current criteria 
 provide that none are eligible.
 Using:
 {noformat}
 create table t1 (id int, key string, value string);
 create table t2 (id int, key string, value string);
 create table t3 (id int, key string, value string);
 create table t4 (id int, key string, value string);
 {noformat}
 For example, the query
 {noformat}
 explain select * from t1 full outer join t2 on t1.id=t2.id join t3 on 
 t2.id=t3.id where t3.id=20; 
 {noformat}
 currently gives
 {noformat}
 STAGE DEPENDENCIES:
   Stage-1 is a root stage
   Stage-0 is a root stage
 STAGE PLANS:
   Stage: Stage-1
 Map Reduce
   Alias - Map Operator Tree:
 t1 
   TableScan
 alias: t1
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
 type: int
   tag: 0
   value expressions:
 expr: id
 type: int
 expr: key
 type: string
 expr: value
 type: string
 t2 
   TableScan
 alias: t2
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
 type: int
   tag: 1
   value expressions:
 expr: id
 type: int
 expr: key
 type: string
 expr: value
 type: string
 t3 
   TableScan
 alias: t3
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
  

[jira] [Commented] (HIVE-2337) Predicate pushdown erroneously conservative with outer joins

2011-08-03 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079023#comment-13079023
 ] 

jirapos...@reviews.apache.org commented on HIVE-2337:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1275/
---

(Updated 2011-08-03 21:03:23.393902)


Review request for hive.


Summary (updated)
---

https://issues.apache.org/jira/browse/HIVE-2337


This addresses bug HIVE-2337.
https://issues.apache.org/jira/browse/HIVE-2337


Diffs
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1153598 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join5.q.out
 PRE-CREATION 

Diff: https://reviews.apache.org/r/1275/diff


Testing
---


Thanks,

Charles



 Predicate pushdown erroneously conservative with outer joins
 

 Key: HIVE-2337
 URL: https://issues.apache.org/jira/browse/HIVE-2337
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Charles Chen
Assignee: Charles Chen
 Attachments: HIVE-2337v1.patch, HIVE-2337v2.patch


 The predicate pushdown filter is not applying left associativity of joins 
 correctly in determining possible aliases for pushing predicates.
 In hive.ql.ppd.OpProcFactory.JoinPPD.getQualifiedAliases, the criteria for 
 pushing aliases is specified as:
 {noformat}
 /**
  * Figures out the aliases for whom it is safe to push predicates based on
  * ANSI SQL semantics For inner join, all predicates for all aliases can 
 be
  * pushed For full outer join, none of the predicates can be pushed as 
 that
  * would limit the number of rows for join For left outer join, all the
  * predicates on the left side aliases can be pushed up For right outer
  * join, all the predicates on the right side aliases can be pushed up 
 Joins
  * chain containing both left and right outer joins are treated as full
  * outer join. [...]
  *
  * @param op
  *  Join Operator
  * @param rr
  *  Row resolver
  * @return set of qualified aliases
  */
 {noformat}
 Since hive joins are left associative, something like a RIGHT OUTER JOIN b 
 LEFT OUTER JOIN c INNER JOIN d should be interpreted as ((a RIGHT OUTER 
 JOIN b) LEFT OUTER JOIN c) INNER JOIN d, so there would be cases where joins 
 with both left and right outer joins can have aliases that can be pushed.  
 Here, aliases b and d are eligible to be pushed up while the current criteria 
 provide that none are eligible.
 Using:
 {noformat}
 create table t1 (id int, key string, value string);
 create table t2 (id int, key string, value string);
 create table t3 (id int, key string, value string);
 create table t4 (id int, key string, value string);
 {noformat}
 For example, the query
 {noformat}
 explain select * from t1 full outer join t2 on t1.id=t2.id join t3 on 
 t2.id=t3.id where t3.id=20; 
 {noformat}
 currently gives
 {noformat}
 STAGE DEPENDENCIES:
   Stage-1 is a root stage
   Stage-0 is a root stage
 STAGE PLANS:
   Stage: Stage-1
 Map Reduce
   Alias - Map Operator Tree:
 t1 
   TableScan
 alias: t1
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
 type: int
   tag: 0
   value expressions:
 expr: id
 type: int
 expr: key
 type: string
 expr: value
 type: string
 t2 
   TableScan
 alias: t2
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
 type: int
   tag: 1
   value expressions:
 expr: id
 type: int
 expr: key
 type: string
 expr: value
 type: string
 t3 
   TableScan
 alias: t3
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
 type: int

[jira] [Commented] (HIVE-2337) Predicate pushdown erroneously conservative with outer joins

2011-08-03 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079030#comment-13079030
 ] 

jirapos...@reviews.apache.org commented on HIVE-2337:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1275/
---

(Updated 2011-08-03 21:13:25.076077)


Review request for hive.


Changes
---

Fix testcase


Summary
---

https://issues.apache.org/jira/browse/HIVE-2337


This addresses bug HIVE-2337.
https://issues.apache.org/jira/browse/HIVE-2337


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1153598 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/ppd_outer_join5.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/ppd_outer_join5.q.out
 PRE-CREATION 

Diff: https://reviews.apache.org/r/1275/diff


Testing
---


Thanks,

Charles



 Predicate pushdown erroneously conservative with outer joins
 

 Key: HIVE-2337
 URL: https://issues.apache.org/jira/browse/HIVE-2337
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Charles Chen
Assignee: Charles Chen
 Attachments: HIVE-2337v1.patch, HIVE-2337v2.patch


 The predicate pushdown filter is not applying left associativity of joins 
 correctly in determining possible aliases for pushing predicates.
 In hive.ql.ppd.OpProcFactory.JoinPPD.getQualifiedAliases, the criteria for 
 pushing aliases is specified as:
 {noformat}
 /**
  * Figures out the aliases for whom it is safe to push predicates based on
  * ANSI SQL semantics For inner join, all predicates for all aliases can 
 be
  * pushed For full outer join, none of the predicates can be pushed as 
 that
  * would limit the number of rows for join For left outer join, all the
  * predicates on the left side aliases can be pushed up For right outer
  * join, all the predicates on the right side aliases can be pushed up 
 Joins
  * chain containing both left and right outer joins are treated as full
  * outer join. [...]
  *
  * @param op
  *  Join Operator
  * @param rr
  *  Row resolver
  * @return set of qualified aliases
  */
 {noformat}
 Since hive joins are left associative, something like a RIGHT OUTER JOIN b 
 LEFT OUTER JOIN c INNER JOIN d should be interpreted as ((a RIGHT OUTER 
 JOIN b) LEFT OUTER JOIN c) INNER JOIN d, so there would be cases where joins 
 with both left and right outer joins can have aliases that can be pushed.  
 Here, aliases b and d are eligible to be pushed up while the current criteria 
 provide that none are eligible.
 Using:
 {noformat}
 create table t1 (id int, key string, value string);
 create table t2 (id int, key string, value string);
 create table t3 (id int, key string, value string);
 create table t4 (id int, key string, value string);
 {noformat}
 For example, the query
 {noformat}
 explain select * from t1 full outer join t2 on t1.id=t2.id join t3 on 
 t2.id=t3.id where t3.id=20; 
 {noformat}
 currently gives
 {noformat}
 STAGE DEPENDENCIES:
   Stage-1 is a root stage
   Stage-0 is a root stage
 STAGE PLANS:
   Stage: Stage-1
 Map Reduce
   Alias - Map Operator Tree:
 t1 
   TableScan
 alias: t1
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
 type: int
   tag: 0
   value expressions:
 expr: id
 type: int
 expr: key
 type: string
 expr: value
 type: string
 t2 
   TableScan
 alias: t2
 Reduce Output Operator
   key expressions:
 expr: id
 type: int
   sort order: +
   Map-reduce partition columns:
 expr: id
 type: int
   tag: 1
   value expressions:
 expr: id
 type: int
 expr: key
 type: string
 expr: value
 type: string
 t3 
   TableScan
 alias: t3
 Reduce Output Operator
   key expressions:
 expr: id

[jira] [Commented] (HIVE-2342) mirror.facebook.net is 404ing

2011-08-03 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079047#comment-13079047
 ] 

jirapos...@reviews.apache.org commented on HIVE-2342:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1280/
---

Review request for hive and John Sichi.


Summary
---

Review for HIVE-2342.


This addresses bug HIVE-2342.
https://issues.apache.org/jira/browse/HIVE-2342


Diffs
-

  build.properties f85221f 
  ivy/ivysettings.xml 9e8dfe6 

Diff: https://reviews.apache.org/r/1280/diff


Testing
---


Thanks,

Carl



 mirror.facebook.net is 404ing
 -

 Key: HIVE-2342
 URL: https://issues.apache.org/jira/browse/HIVE-2342
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Reporter: Andrew Bayer
 Attachments: HIVE-2342.1.patch.txt


 http://mirror.facebook.net/ and everything under it is 404ing, which is 
 blocking any attempt to build Hive from working.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1734) Implement map_keys() and map_values() UDFs

2011-08-03 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079060#comment-13079060
 ] 

jirapos...@reviews.apache.org commented on HIVE-1734:
-



bq.  On 2011-08-01 10:03:11, Amareshwari Sriramadasu wrote:
bq.   ql/src/test/results/clientpositive/udf_map_keys.q.out, line 40
bq.   https://reviews.apache.org/r/1082/diff/1/?file=22333#file22333line40
bq.  
bq.   Shouldn't the output be [a, b, c] here?

The set of key/value pairs in a Map is unordered, and the same property also 
applies if you look only at the set of keys, or the set of values. 
Unfortunately, Hive's type system does not have a 'set' type, so we're forced 
to return the result of map_keys() and map_values() as an array, which implies 
that the set is ordered when it really isn't. If you want to apply a specific 
ordering property to the output of map_keys() or map_values(), then you would 
have to do this with a sort(array) UDF (which is not yet implemented).


bq.  On 2011-08-01 10:03:11, Amareshwari Sriramadasu wrote:
bq.   ql/src/test/results/clientpositive/udf_map_values.q.out, line 40
bq.   https://reviews.apache.org/r/1082/diff/1/?file=22334#file22334line40
bq.  
bq.   Shouldn't the output be [1,2,3] instead of [2,1,3]?

No. See previous comment.


- Carl


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1082/#review1247
---


On 2011-07-12 00:40:11, Carl Steinbach wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1082/
bq.  ---
bq.  
bq.  (Updated 2011-07-12 00:40:11)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  This patch adds map_keys(map) and map_values(map) UDFs.
bq.  
bq.  
bq.  This addresses bug HIVE-1734.
bq.  https://issues.apache.org/jira/browse/HIVE-1734
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.ql/src/test/results/clientpositive/udf_map_values.q.out PRE-CREATION 
bq.ql/src/test/results/clientpositive/udf_map_keys.q.out PRE-CREATION 
bq.ql/src/test/queries/clientpositive/udf_map_values.q PRE-CREATION 
bq.ql/src/test/results/clientnegative/udf_map_keys_arg_num.q.out 
PRE-CREATION 
bq.ql/src/test/results/clientnegative/udf_map_keys_arg_type.q.out 
PRE-CREATION 
bq.ql/src/test/results/clientnegative/udf_map_values_arg_num.q.out 
PRE-CREATION 
bq.ql/src/test/results/clientnegative/udf_map_values_arg_type.q.out 
PRE-CREATION 
bq.ql/src/test/results/clientpositive/show_functions.q.out d4f6c98 
bq.ql/src/test/queries/clientnegative/udf_map_values_arg_num.q PRE-CREATION 
bq.ql/src/test/queries/clientnegative/udf_map_values_arg_type.q 
PRE-CREATION 
bq.ql/src/test/queries/clientpositive/udf_map_keys.q PRE-CREATION 
bq.ql/src/java/org/apache/hadoop/hive/ql/exec/FunctionRegistry.java 16a207e 
bq.ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFMapKeys.java 
PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFMapValues.java 
PRE-CREATION 
bq.ql/src/test/queries/clientnegative/udf_map_keys_arg_num.q PRE-CREATION 
bq.ql/src/test/queries/clientnegative/udf_map_keys_arg_type.q PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1082/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Carl
bq.  
bq.



 Implement map_keys() and map_values() UDFs
 --

 Key: HIVE-1734
 URL: https://issues.apache.org/jira/browse/HIVE-1734
 Project: Hive
  Issue Type: New Feature
  Components: UDF
Affects Versions: 0.6.0
Reporter: Carl Steinbach
Assignee: Carl Steinbach
 Attachments: HIVE-1734-mapudf.1.patch.txt, MapKeys.java, 
 MapValues.java


 Implement the following UDFs:
 array map_keys(map)
 and
 array map_values(map)
 map_keys() takes a map as input and returns an array consisting of the key 
 values in the supplied map.
 Similarly, map_values() takes a map as input and returns an array containing 
 the map value fields.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2338) Alter table always throws an unhelpful error on failure

2011-08-03 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079130#comment-13079130
 ] 

jirapos...@reviews.apache.org commented on HIVE-2338:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1276/
---

(Updated 2011-08-03 23:59:06.774216)


Review request for hive and Paul Yang.


Changes
---

-remove unecessary comment


Summary
---

Every failure in an alter table function always return a MetaException. When 
altering tables and catching exceptions, we throw a MetaException in the 
finally part of a try-catch-finally block, which overrides any other 
exceptions thrown.

This patch moves the code to throw an error on a commit failure outside of the 
try-catch-finally block.  So any errors within the try block are thrown 
properly.  Only when success == false will we throw a MetaException.


This addresses bug HIVE-2338.
https://issues.apache.org/jira/browse/HIVE-2338


Diffs (updated)
-

  
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveAlterHandler.java 
1152020 

Diff: https://reviews.apache.org/r/1276/diff


Testing
---

Tested failing alter table metastore commands; they threw 
InvalidOperationExceptions as they should, instead of meaningless 
MetaExceptions.


Thanks,

Sohan



 Alter table always throws an unhelpful error on failure
 ---

 Key: HIVE-2338
 URL: https://issues.apache.org/jira/browse/HIVE-2338
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Reporter: Sohan Jain
Assignee: Sohan Jain
Priority: Minor
 Attachments: HIVE-2338.1.patch, HIVE-2338.2.patch


 Every failure in an alter table function always return a MetaException. When 
 altering tables and catching exceptions, we throw a MetaException in the 
 finally part of a try-catch-finally block, which overrides any other 
 exceptions thrown.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2171) Allow custom serdes to set field comments

2011-08-03 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079136#comment-13079136
 ] 

jirapos...@reviews.apache.org commented on HIVE-2171:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/790/
---

(Updated 2011-08-04 00:12:21.542815)


Review request for hive.


Changes
---

Updated patch based on review comments.


Summary
---

HIVE-2171: Allow custom serdes to set field comments


This addresses bug HIVE-2171.
https://issues.apache.org/jira/browse/HIVE-2171


Diffs (updated)
-

  ivy/libraries.properties af856bd 
  metastore/src/java/org/apache/hadoop/hive/metastore/MetaStoreUtils.java 
c1fa4e5 
  serde/ivy.xml d6c836a 
  
serde/src/java/org/apache/hadoop/hive/serde2/lazy/objectinspector/LazyObjectInspectorFactory.java
 4850601 
  
serde/src/java/org/apache/hadoop/hive/serde2/lazy/objectinspector/LazySimpleStructObjectInspector.java
 e2fa9db 
  
serde/src/java/org/apache/hadoop/hive/serde2/lazybinary/objectinspector/LazyBinaryObjectInspectorFactory.java
 2947e49 
  
serde/src/java/org/apache/hadoop/hive/serde2/lazybinary/objectinspector/LazyBinaryStructObjectInspector.java
 3d5408f 
  
serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ColumnarStructObjectInspector.java
 881c3c1 
  
serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/MetadataListStructObjectInspector.java
 bd42a0c 
  
serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ObjectInspectorFactory.java
 0c8cc42 
  
serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ReflectionStructObjectInspector.java
 4a934c5 
  
serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/StandardStructObjectInspector.java
 3b26e45 
  serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/StructField.java 
62c3017 
  
serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/UnionStructObjectInspector.java
 76ff736 
  serde/src/test/org/apache/hadoop/hive/serde2/TestSerdeWithFieldComments.java 
PRE-CREATION 
  
serde/src/test/org/apache/hadoop/hive/serde2/objectinspector/TestStandardObjectInspectors.java
 f139ea5 

Diff: https://reviews.apache.org/r/790/diff


Testing
---

New unit test and refactor existing unit test.


Thanks,

Jakob



 Allow custom serdes to set field comments
 -

 Key: HIVE-2171
 URL: https://issues.apache.org/jira/browse/HIVE-2171
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.7.0
Reporter: Jakob Homan
Assignee: Jakob Homan
 Attachments: HIVE-2171-2.patch, HIVE-2171.patch


 Currently, while serde implementations can set a field's name, they can't set 
 its comment.  These are set in the metastore utils to {{(from 
 deserializer)}}.  For those serdes that can provide meaningful comments for a 
 field, they should be propagated to the table description.  These 
 serde-provided comments could be prepended to (from deserializer) if others 
 feel that's a meaningful distinction.  This change involves updating 
 {{StructField}} to support a (possibly null) comment field and then 
 propagating this change out to the myriad places {{StructField}} is thrown 
 around.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1735) Extend Explode UDTF to handle Maps

2011-08-03 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13079200#comment-13079200
 ] 

jirapos...@reviews.apache.org commented on HIVE-1735:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1256/#review1288
---

Ship it!


+1. Will commit if tests pass.

- Carl


On 2011-08-03 04:45:10, Amareshwari Sriramadasu wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1256/
bq.  ---
bq.  
bq.  (Updated 2011-08-03 04:45:10)
bq.  
bq.  
bq.  Review request for hive and Carl Steinbach.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Changes from last patch:
bq.  
bq.  Re-based to trunk.
bq.  Removed assert not null statements for list and map.
bq.  Updated explode_null.q to test NULL values for map also.
bq.  
bq.  
bq.  This addresses bug HIVE-1735.
bq.  https://issues.apache.org/jira/browse/HIVE-1735
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDTFExplode.java 
1151047 
bq.trunk/ql/src/test/queries/clientnegative/udtf_explode_not_supported1.q 
PRE-CREATION 
bq.trunk/ql/src/test/queries/clientnegative/udtf_explode_not_supported2.q 
PRE-CREATION 
bq.trunk/ql/src/test/queries/clientnegative/udtf_explode_not_supported3.q 
PRE-CREATION 
bq.trunk/ql/src/test/queries/clientnegative/udtf_explode_not_supported4.q 
PRE-CREATION 
bq.trunk/ql/src/test/queries/clientpositive/explode_null.q 1151047 
bq.trunk/ql/src/test/queries/clientpositive/udf_explode.q 1151047 
bq.trunk/ql/src/test/queries/clientpositive/udtf_explode.q 1151047 
bq.
trunk/ql/src/test/results/clientnegative/udtf_explode_not_supported1.q.out 
PRE-CREATION 
bq.
trunk/ql/src/test/results/clientnegative/udtf_explode_not_supported2.q.out 
PRE-CREATION 
bq.
trunk/ql/src/test/results/clientnegative/udtf_explode_not_supported3.q.out 
PRE-CREATION 
bq.
trunk/ql/src/test/results/clientnegative/udtf_explode_not_supported4.q.out 
PRE-CREATION 
bq.trunk/ql/src/test/results/clientpositive/explode_null.q.out 1151047 
bq.trunk/ql/src/test/results/clientpositive/udf_explode.q.out 1151047 
bq.trunk/ql/src/test/results/clientpositive/udtf_explode.q.out 1151047 
bq.  
bq.  Diff: https://reviews.apache.org/r/1256/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  All tests passed with the patch.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Amareshwari
bq.  
bq.



 Extend Explode UDTF to handle Maps
 --

 Key: HIVE-1735
 URL: https://issues.apache.org/jira/browse/HIVE-1735
 Project: Hive
  Issue Type: New Feature
  Components: UDF
Reporter: Carl Steinbach
Assignee: Jonathan Natkins
 Fix For: 0.8.0

 Attachments: HIVE-1735.1.patch, HIVE-1735.2.patch.txt, 
 HIVE-1735.2.patch.txt, HIVE-1735.3.patch.txt


 The explode() UDTF currently only accepts arrays as input. We should modify it
 so that it can also handle map inputs, in which case it will output two 
 columns
 corresponding to the key and value fields.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2331) Turn off compression when generating index intermediate results

2011-08-02 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13078366#comment-13078366
 ] 

jirapos...@reviews.apache.org commented on HIVE-2331:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1254/
---

(Updated 2011-08-02 18:58:32.937434)


Review request for hive and John Sichi.


Changes
---

Regenerated the testcase outputs since they were mixed up previously.


Summary
---

add a parameter to the hiveConf when generating the index query to ensure that 
the output is uncompressed


This addresses bug HIVE-2331.
https://issues.apache.org/jira/browse/HIVE-2331


Diffs (updated)
-

  ql/src/java/org/apache/hadoop/hive/ql/index/bitmap/BitmapIndexHandler.java 
c93c852 
  ql/src/java/org/apache/hadoop/hive/ql/index/compact/CompactIndexHandler.java 
f4b7c15 
  ql/src/test/queries/clientpositive/index_bitmap_compression.q PRE-CREATION 
  ql/src/test/queries/clientpositive/index_compression.q PRE-CREATION 
  ql/src/test/results/clientpositive/index_bitmap_compression.q.out 
PRE-CREATION 
  ql/src/test/results/clientpositive/index_compression.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/1254/diff


Testing
---

added index_compression.q and index_bitmap_compression.q to test both index 
handlers for this case.


Thanks,

Syed



 Turn off compression when generating index intermediate results
 ---

 Key: HIVE-2331
 URL: https://issues.apache.org/jira/browse/HIVE-2331
 Project: Hive
  Issue Type: Bug
Reporter: Syed S. Albiz
Assignee: Syed S. Albiz
 Attachments: HIVE-2331.1.patch, HIVE-2331.2.patch


 HiveIndexResult is not compression-aware, so for any index to work 
 (regardless of compact/bitmap) we need to not compress the index intermediate 
 file when we generate it.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2286) ClassCastException when building index with security.authorization turned on

2011-08-02 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13078382#comment-13078382
 ] 

jirapos...@reviews.apache.org commented on HIVE-2286:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1137/
---

(Updated 2011-08-02 19:21:24.017675)


Review request for hive, John Sichi and Ning Zhang.


Changes
---

fixed merge conflicts on correct branch


Summary
---

Save the original HiveOperation/commandType when we generate the index builder 
task and restore it after we're done generating the task so that the 
authorization checks make the right decision when deciding what to do.


This addresses bug HIVE-2286.
https://issues.apache.org/jira/browse/HIVE-2286


Diffs (updated)
-

  cli/src/java/org/apache/hadoop/hive/cli/CliDriver.java 9fa7bc6 
  ql/src/java/org/apache/hadoop/hive/ql/Driver.java 602626f 
  ql/src/test/queries/clientpositive/index_auth.q PRE-CREATION 
  ql/src/test/results/clientnegative/alter_view_failure2.q.out 5915b4f 
  ql/src/test/results/clientnegative/alter_view_failure4.q.out 97d6b18 
  ql/src/test/results/clientnegative/create_or_replace_view1.q.out 97bfa21 
  ql/src/test/results/clientnegative/create_or_replace_view2.q.out 8edac34 
  ql/src/test/results/clientnegative/index_compact_entry_limit.q.out fcb2673 
  ql/src/test/results/clientnegative/index_compact_size_limit.q.out fcb2673 
  ql/src/test/results/clientpositive/create_or_replace_view.q.out fb72971 
  ql/src/test/results/clientpositive/create_view_partitioned.q.out 3b4dd10 
  ql/src/test/results/clientpositive/index_auth.q.out PRE-CREATION 
  ql/src/test/results/clientpositive/index_auto.q.out 8d65f98 
  ql/src/test/results/clientpositive/index_auto_file_format.q.out 194b35e 
  ql/src/test/results/clientpositive/index_auto_multiple.q.out 6b81fc3 
  ql/src/test/results/clientpositive/index_auto_partitioned.q.out b0635db 
  ql/src/test/results/clientpositive/index_auto_unused.q.out 3631bbc 
  ql/src/test/results/clientpositive/index_bitmap.q.out 8f41ce3 
  ql/src/test/results/clientpositive/index_bitmap1.q.out 9f638f5 
  ql/src/test/results/clientpositive/index_bitmap2.q.out e901477 
  ql/src/test/results/clientpositive/index_bitmap3.q.out 116c973 
  ql/src/test/results/clientpositive/index_bitmap_auto.q.out cc9d91e 
  ql/src/test/results/clientpositive/index_bitmap_auto_partitioned.q.out 
aa1f60c 
  ql/src/test/results/clientpositive/index_bitmap_rc.q.out 9bd3c98 
  ql/src/test/results/clientpositive/index_compact.q.out c339ec9 
  ql/src/test/results/clientpositive/index_compact_1.q.out 34ba3ca 
  ql/src/test/results/clientpositive/index_compact_2.q.out e8ce238 
  ql/src/test/results/clientpositive/index_compact_3.q.out d39556d 
  ql/src/test/results/clientpositive/index_creation.q.out 532f07e 
  ql/src/test/results/clientpositive/show_indexes_edge_cases.q.out 709bdc0 
  ql/src/test/results/clientpositive/show_indexes_syntax.q.out 2c2a4af 

Diff: https://reviews.apache.org/r/1137/diff


Testing
---

Added new testcase to TestCliDriver: index_auth.q


Thanks,

Syed



 ClassCastException when building index with security.authorization turned on
 

 Key: HIVE-2286
 URL: https://issues.apache.org/jira/browse/HIVE-2286
 Project: Hive
  Issue Type: Bug
Reporter: Syed S. Albiz
Assignee: Syed S. Albiz
 Attachments: HIVE-2286.1.patch, HIVE-2286.2.patch, HIVE-2286.6.patch, 
 HIVE-2286.7.patch, HIVE-2286.8.patch, HIVE-2286.9.patch


 When trying to build an index with authorization checks turned on, hive 
 issues the following ClassCastException:
 org.apache.hadoop.hive.ql.parse.DDLSemanticAnalyzer cannot be cast to
 org.apache.hadoop.hive.ql.parse.SemanticAnalyzer
  at
 org.apache.hadoop.hive.ql.Driver.doAuthorization(Driver.java:540)
  at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:431)
  at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:335)
  at org.apache.hadoop.hive.ql.Driver.run(Driver.java:848)
  at
 org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:224)
  at
 org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:358)
  at
 org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:293)
  at
 org.apache.hadoop.hive.cli.CliDriver.processReader(CliDriver.java:385)
  at
 org.apache.hadoop.hive.cli.CliDriver.processFile(CliDriver.java:392)
  at
 org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:567)
  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
  at
 

[jira] [Commented] (HIVE-2020) Create a separate namespace for Hive variables

2011-08-02 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13078474#comment-13078474
 ] 

jirapos...@reviews.apache.org commented on HIVE-2020:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1204/#review1273
---


This patch doesn't appear to address most of the issues described in HIVE-2020, 
which covers adding a separate namespace for variables that is distinct from 
hiveconf and environment/system properties. This patch also needs a testcase.


cli/src/java/org/apache/hadoop/hive/cli/DefaultPreprocessor.java
https://reviews.apache.org/r/1204/#comment2906

Please extend/modify/use the code in 
org.apache.hadoop.hive.ql.parse.VariableSubstitution instead of adding this 
class.



cli/src/java/org/apache/hadoop/hive/cli/OptionsProcessor.java
https://reviews.apache.org/r/1204/#comment2905

I think these two option definitions can be combined, e.g:

.withArgName(key=value)
.withLongOpt(define)
.create(d);


- Carl


On 2011-07-27 21:07:40, Vaibhav Aggarwal wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1204/
bq.  ---
bq.  
bq.  (Updated 2011-07-27 21:07:40)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Create a separate option for Hive variable
bq.  
bq.  
bq.  This addresses bug HIVE-2020.
bq.  https://issues.apache.org/jira/browse/HIVE-2020
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.cli/src/java/org/apache/hadoop/hive/cli/CliDriver.java 9fa7bc6 
bq.cli/src/java/org/apache/hadoop/hive/cli/DefaultPreprocessor.java 
PRE-CREATION 
bq.cli/src/java/org/apache/hadoop/hive/cli/KeyValue.java PRE-CREATION 
bq.cli/src/java/org/apache/hadoop/hive/cli/OptionsProcessor.java 90084ed 
bq.cli/src/java/org/apache/hadoop/hive/cli/Preprocessor.java PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1204/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Vaibhav
bq.  
bq.



 Create a separate namespace for Hive variables
 --

 Key: HIVE-2020
 URL: https://issues.apache.org/jira/browse/HIVE-2020
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Reporter: Carl Steinbach
Assignee: Vaibhav Aggarwal
 Attachments: HIVE-2020.patch


 Support for variable substitution was added in HIVE-1096. However, variable 
 substitution was implemented by reusing the HiveConf namespace, so there is 
 no separation between Hive configuration properties and Hive variables.
 This ticket encompasses the following enhancements:
 * Create a separate namespace for managing Hive variables.
 * Add support for setting variables on the command line via '-hivevar x=y'
 * Add support for setting variables through the CLI via 'var x=y'
 * Add support for referencing variables in statements using either 
 '${hivevar:var_name}' or '${var_name}'
 * Provide a means for differentiating between hiveconf, hivevar, system, and 
 environment properties in the output of 'set -v'

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2335) Indexes are still automatically queried when out of sync with their source tables

2011-08-02 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13078497#comment-13078497
 ] 

jirapos...@reviews.apache.org commented on HIVE-2335:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1260/
---

Review request for hive, John Sichi, Yongqiang He, and Ning Zhang.


Summary
---

Check the timestamps of the tables/partitions to ensure that they were not 
written to since the index was last generated


This addresses bug HIVE-2335.
https://issues.apache.org/jira/browse/HIVE-2335


Diffs
-

  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/index/IndexWhereProcessor.java
 8295687 
  ql/src/test/queries/clientpositive/index_stale.q PRE-CREATION 
  ql/src/test/queries/clientpositive/index_stale_partitioned.q PRE-CREATION 
  ql/src/test/results/clientpositive/index_stale.q.out PRE-CREATION 
  ql/src/test/results/clientpositive/index_stale_partitioned.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/1260/diff


Testing
---

added index_stale.q and index_stale_partitioned.q to the testcase suite


Thanks,

Syed



 Indexes are still automatically queried when out of sync with their source 
 tables
 -

 Key: HIVE-2335
 URL: https://issues.apache.org/jira/browse/HIVE-2335
 Project: Hive
  Issue Type: Bug
Reporter: Syed S. Albiz
Assignee: Syed S. Albiz
 Attachments: HIVE-2335.1.patch


 The automatic index usage does not check whether or not the indexes are still 
 up-to-date when generating the index queries. This can be addressed in two 
 stages, the first is to add a check before generating the index query to 
 ensure that the index is still valid. The next stage may be to add some sort 
 of mode where indexes are automatically updated on table writes.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2337) Predicate pushdown erroneously conservative with outer joins

2011-08-02 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13078551#comment-13078551
 ] 

jirapos...@reviews.apache.org commented on HIVE-2337:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1275/
---

Review request for hive.


Summary
---

Initial patch


This addresses bug HIVE-2337.
https://issues.apache.org/jira/browse/HIVE-2337


Diffs
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
 1153293 

Diff: https://reviews.apache.org/r/1275/diff


Testing
---


Thanks,

Charles



 Predicate pushdown erroneously conservative with outer joins
 

 Key: HIVE-2337
 URL: https://issues.apache.org/jira/browse/HIVE-2337
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Charles Chen
Assignee: Charles Chen
 Attachments: HIVE-2337v1.patch


 The predicate pushdown filter is not applying left associativity of joins 
 correctly in determining possible aliases for pushing predicates.
 In hive.ql.ppd.OpProcFactory.JoinPPD.getQualifiedAliases, the criteria for 
 pushing aliases is specified as:
 {noformat}
 /**
  * Figures out the aliases for whom it is safe to push predicates based on
  * ANSI SQL semantics For inner join, all predicates for all aliases can 
 be
  * pushed For full outer join, none of the predicates can be pushed as 
 that
  * would limit the number of rows for join For left outer join, all the
  * predicates on the left side aliases can be pushed up For right outer
  * join, all the predicates on the right side aliases can be pushed up 
 Joins
  * chain containing both left and right outer joins are treated as full
  * outer join. [...]
  *
  * @param op
  *  Join Operator
  * @param rr
  *  Row resolver
  * @return set of qualified aliases
  */
 {noformat}
 Since hive joins are left associative, something like a RIGHT OUTER JOIN b 
 LEFT OUTER JOIN c INNER JOIN d should be interpreted as ((a RIGHT OUTER 
 JOIN b) LEFT OUTER JOIN c) INNER JOIN d, so there would be cases where joins 
 with both left and right outer joins can have aliases that can be pushed.  
 Here, aliases a and d are eligible to be pushed up while the current criteria 
 provide that none are eligible.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1734) Implement map_keys() and map_values() UDFs

2011-08-01 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13073480#comment-13073480
 ] 

jirapos...@reviews.apache.org commented on HIVE-1734:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1082/#review1247
---



ql/src/test/results/clientpositive/udf_map_keys.q.out
https://reviews.apache.org/r/1082/#comment2850

Shouldn't the output be [a, b, c] here?



ql/src/test/results/clientpositive/udf_map_values.q.out
https://reviews.apache.org/r/1082/#comment2851

Shouldn't the output be [1,2,3] instead of [2,1,3]? 


- Amareshwari


On 2011-07-12 00:40:11, Carl Steinbach wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1082/
bq.  ---
bq.  
bq.  (Updated 2011-07-12 00:40:11)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  This patch adds map_keys(map) and map_values(map) UDFs.
bq.  
bq.  
bq.  This addresses bug HIVE-1734.
bq.  https://issues.apache.org/jira/browse/HIVE-1734
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.ql/src/test/results/clientpositive/udf_map_values.q.out PRE-CREATION 
bq.ql/src/test/results/clientpositive/udf_map_keys.q.out PRE-CREATION 
bq.ql/src/test/queries/clientpositive/udf_map_values.q PRE-CREATION 
bq.ql/src/test/results/clientnegative/udf_map_keys_arg_num.q.out 
PRE-CREATION 
bq.ql/src/test/results/clientnegative/udf_map_keys_arg_type.q.out 
PRE-CREATION 
bq.ql/src/test/results/clientnegative/udf_map_values_arg_num.q.out 
PRE-CREATION 
bq.ql/src/test/results/clientnegative/udf_map_values_arg_type.q.out 
PRE-CREATION 
bq.ql/src/test/results/clientpositive/show_functions.q.out d4f6c98 
bq.ql/src/test/queries/clientnegative/udf_map_values_arg_num.q PRE-CREATION 
bq.ql/src/test/queries/clientnegative/udf_map_values_arg_type.q 
PRE-CREATION 
bq.ql/src/test/queries/clientpositive/udf_map_keys.q PRE-CREATION 
bq.ql/src/java/org/apache/hadoop/hive/ql/exec/FunctionRegistry.java 16a207e 
bq.ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFMapKeys.java 
PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFMapValues.java 
PRE-CREATION 
bq.ql/src/test/queries/clientnegative/udf_map_keys_arg_num.q PRE-CREATION 
bq.ql/src/test/queries/clientnegative/udf_map_keys_arg_type.q PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1082/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Carl
bq.  
bq.



 Implement map_keys() and map_values() UDFs
 --

 Key: HIVE-1734
 URL: https://issues.apache.org/jira/browse/HIVE-1734
 Project: Hive
  Issue Type: New Feature
  Components: UDF
Affects Versions: 0.6.0
Reporter: Carl Steinbach
Assignee: Carl Steinbach
 Attachments: HIVE-1734-mapudf.1.patch.txt, MapKeys.java, 
 MapValues.java


 Implement the following UDFs:
 array map_keys(map)
 and
 array map_values(map)
 map_keys() takes a map as input and returns an array consisting of the key 
 values in the supplied map.
 Similarly, map_values() takes a map as input and returns an array containing 
 the map value fields.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2326) Turn off bitmap indexing when map-side aggregation is turned off

2011-08-01 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13073669#comment-13073669
 ] 

jirapos...@reviews.apache.org commented on HIVE-2326:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1232/#review1251
---



ql/src/java/org/apache/hadoop/hive/ql/index/bitmap/BitmapIndexHandler.java
https://reviews.apache.org/r/1232/#comment2863

Typo:  wihtout



ql/src/java/org/apache/hadoop/hive/ql/index/bitmap/BitmapIndexHandler.java
https://reviews.apache.org/r/1232/#comment2864

Throw the exception here instead of returning null.


- John


On 2011-07-31 00:26:44, Syed Albiz wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1232/
bq.  ---
bq.  
bq.  (Updated 2011-07-31 00:26:44)
bq.  
bq.  
bq.  Review request for hive, John Sichi and Ning Zhang.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  sets the correct object inspector for PARTIAL2, COMPLETE UDAF modes
bq.  
bq.  
bq.  This addresses bug HIVE-2326.
bq.  https://issues.apache.org/jira/browse/HIVE-2326
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.ql/src/java/org/apache/hadoop/hive/ql/index/TableBasedIndexHandler.java 
02ab78c 
bq.
ql/src/java/org/apache/hadoop/hive/ql/index/bitmap/BitmapIndexHandler.java 
61bbbf5 
bq.
ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFEWAHBitmap.java 
0a3df09 
bq.  
bq.  Diff: https://reviews.apache.org/r/1232/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Syed
bq.  
bq.



 Turn off bitmap indexing when map-side aggregation is turned off
 

 Key: HIVE-2326
 URL: https://issues.apache.org/jira/browse/HIVE-2326
 Project: Hive
  Issue Type: Bug
Reporter: Syed S. Albiz
Assignee: Syed S. Albiz
 Attachments: HIVE-2326.1.patch


 Simply adding the CLUSTER BY clause on the ROW_OFFSET does not work with a 
 GROUP BY clause, causing a SemanticException when trying to compile the the 
 index builder task. Based on conversation with John Sichi, for now we will 
 just turn off this feature.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2326) Turn off bitmap indexing when map-side aggregation is turned off

2011-08-01 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13073796#comment-13073796
 ] 

jirapos...@reviews.apache.org commented on HIVE-2326:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1232/
---

(Updated 2011-08-01 21:27:24.460533)


Review request for hive, John Sichi and Ning Zhang.


Changes
---

removed index_bitmap_auto_no_map_aggr.q from the clientpositive suite


Summary
---

sets the correct object inspector for PARTIAL2, COMPLETE UDAF modes


This addresses bug HIVE-2326.
https://issues.apache.org/jira/browse/HIVE-2326


Diffs (updated)
-

  ql/src/java/org/apache/hadoop/hive/ql/index/TableBasedIndexHandler.java 
02ab78c 
  ql/src/java/org/apache/hadoop/hive/ql/index/bitmap/BitmapIndexHandler.java 
61bbbf5 
  ql/src/java/org/apache/hadoop/hive/ql/index/compact/CompactIndexHandler.java 
7c91946 
  ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFEWAHBitmap.java 
0a3df09 
  ql/src/test/queries/clientnegative/index_bitmap_no_map_aggr.q PRE-CREATION 
  ql/src/test/results/clientnegative/index_bitmap_no_map_aggr.q.out 
PRE-CREATION 

Diff: https://reviews.apache.org/r/1232/diff


Testing
---


Thanks,

Syed



 Turn off bitmap indexing when map-side aggregation is turned off
 

 Key: HIVE-2326
 URL: https://issues.apache.org/jira/browse/HIVE-2326
 Project: Hive
  Issue Type: Bug
Reporter: Syed S. Albiz
Assignee: Syed S. Albiz
 Attachments: HIVE-2326.1.patch, HIVE-2326.2.patch, HIVE-2326.3.patch


 Simply adding the CLUSTER BY clause on the ROW_OFFSET does not work with a 
 GROUP BY clause, causing a SemanticException when trying to compile the the 
 index builder task. Based on conversation with John Sichi, for now we will 
 just turn off this feature.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2286) ClassCastException when building index with security.authorization turned on

2011-08-01 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13073987#comment-13073987
 ] 

jirapos...@reviews.apache.org commented on HIVE-2286:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1137/
---

(Updated 2011-08-01 23:23:46.966525)


Review request for hive, John Sichi and Ning Zhang.


Changes
---

regenerated patch to avoid noise/clutter in testcase output


Summary
---

Save the original HiveOperation/commandType when we generate the index builder 
task and restore it after we're done generating the task so that the 
authorization checks make the right decision when deciding what to do.


This addresses bug HIVE-2286.
https://issues.apache.org/jira/browse/HIVE-2286


Diffs (updated)
-

  cli/src/java/org/apache/hadoop/hive/cli/CliDriver.java 9fa7bc6 
  ql/src/java/org/apache/hadoop/hive/ql/Driver.java b278ffe 
  ql/src/test/queries/clientpositive/index_auth.q PRE-CREATION 
  ql/src/test/results/clientnegative/alter_view_failure2.q.out 5915b4f 
  ql/src/test/results/clientnegative/alter_view_failure4.q.out 97d6b18 
  ql/src/test/results/clientnegative/create_or_replace_view1.q.out 97bfa21 
  ql/src/test/results/clientnegative/create_or_replace_view2.q.out 8edac34 
  ql/src/test/results/clientnegative/index_compact_entry_limit.q.out fcb2673 
  ql/src/test/results/clientnegative/index_compact_size_limit.q.out fcb2673 
  ql/src/test/results/clientpositive/create_or_replace_view.q.out fb72971 
  ql/src/test/results/clientpositive/create_view_partitioned.q.out 3b4dd10 
  ql/src/test/results/clientpositive/index_auth.q.out PRE-CREATION 
  ql/src/test/results/clientpositive/index_auto.q.out 8d65f98 
  ql/src/test/results/clientpositive/index_auto_file_format.q.out 194b35e 
  ql/src/test/results/clientpositive/index_auto_multiple.q.out 6b81fc3 
  ql/src/test/results/clientpositive/index_auto_partitioned.q.out b0635db 
  ql/src/test/results/clientpositive/index_auto_unused.q.out 3631bbc 
  ql/src/test/results/clientpositive/index_bitmap.q.out 8f41ce3 
  ql/src/test/results/clientpositive/index_bitmap1.q.out 9f638f5 
  ql/src/test/results/clientpositive/index_bitmap2.q.out e901477 
  ql/src/test/results/clientpositive/index_bitmap3.q.out 116c973 
  ql/src/test/results/clientpositive/index_bitmap_auto.q.out cc9d91e 
  ql/src/test/results/clientpositive/index_bitmap_auto_partitioned.q.out 
9003eb4 
  ql/src/test/results/clientpositive/index_bitmap_rc.q.out 9bd3c98 
  ql/src/test/results/clientpositive/index_compact.q.out c339ec9 
  ql/src/test/results/clientpositive/index_compact_1.q.out 34ba3ca 
  ql/src/test/results/clientpositive/index_compact_2.q.out e8ce238 
  ql/src/test/results/clientpositive/index_compact_3.q.out d39556d 
  ql/src/test/results/clientpositive/index_creation.q.out 532f07e 
  ql/src/test/results/clientpositive/show_indexes_edge_cases.q.out 709bdc0 
  ql/src/test/results/clientpositive/show_indexes_syntax.q.out 2c2a4af 

Diff: https://reviews.apache.org/r/1137/diff


Testing
---

Added new testcase to TestCliDriver: index_auth.q


Thanks,

Syed



 ClassCastException when building index with security.authorization turned on
 

 Key: HIVE-2286
 URL: https://issues.apache.org/jira/browse/HIVE-2286
 Project: Hive
  Issue Type: Bug
Reporter: Syed S. Albiz
Assignee: Syed S. Albiz
 Attachments: HIVE-2286.1.patch, HIVE-2286.2.patch, HIVE-2286.6.patch


 When trying to build an index with authorization checks turned on, hive 
 issues the following ClassCastException:
 org.apache.hadoop.hive.ql.parse.DDLSemanticAnalyzer cannot be cast to
 org.apache.hadoop.hive.ql.parse.SemanticAnalyzer
  at
 org.apache.hadoop.hive.ql.Driver.doAuthorization(Driver.java:540)
  at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:431)
  at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:335)
  at org.apache.hadoop.hive.ql.Driver.run(Driver.java:848)
  at
 org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:224)
  at
 org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:358)
  at
 org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:293)
  at
 org.apache.hadoop.hive.cli.CliDriver.processReader(CliDriver.java:385)
  at
 org.apache.hadoop.hive.cli.CliDriver.processFile(CliDriver.java:392)
  at
 org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:567)
  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
  at
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.jav
 a:39)
  at

[jira] [Commented] (HIVE-1694) Accelerate GROUP BY execution using indexes

2011-08-01 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13075964#comment-13075964
 ] 

jirapos...@reviews.apache.org commented on HIVE-1694:
-



bq.  On 2011-07-28 21:40:30, John Sichi wrote:
bq.   ql/src/java/org/apache/hadoop/hive/ql/index/AggregateIndexHandler.java, 
line 61
bq.   https://reviews.apache.org/r/1194/diff/1/?file=27052#file27052line61
bq.  
bq.   Please run ant checkstyle and fix all the formatting discrepancies 
it reports for your new files.
bq.  
bq.  
bq.  Prajakta Kalmegh wrote:
bq.  Done! The code is still having checkstyle formatting errors only for 
places where we have used LinkedHashMap, HashMap and ArrayList. The error 
states Declaring variables, return values or parameters of type 'HashMap' is 
not allowed.

Best practice is to only use interfaces (Map/List) except at the point of 
instantiation where you select a concrete class.  Hive violates this in a 
number of places, and sometimes that forces you to violate it in new code too; 
but otherwise, please follow this one.


bq.  On 2011-07-28 21:40:30, John Sichi wrote:
bq.   ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java, 
line 603
bq.   https://reviews.apache.org/r/1194/diff/1/?file=27062#file27062line603
bq.  
bq.   Not sure why this new constructor is needed...after using it, all 
you do is get the table out of it.
bq.  
bq.  Prajakta Kalmegh wrote:
bq.  The only other constructor option for tableSpec needs the ASTNode as 
one of its parameters. Since we need to construct a new tableSpec using only 
the index table name, and we do not have a ASTNode for this, I need this 
constructor. If you have any other way in mind, please let me know. That would 
be helpful.

I'm asking why you even need to construct a new tableSpec instance.  All you do 
with it is reference ts.tableHandle.  And to create that tableHandle, you can 
just do db.getTable(tableName).  So I don't see the purpose of the tableSpec 
instance.


- John


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1194/#review1212
---


On 2011-07-26 14:44:01, Prajakta Kalmegh wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1194/
bq.  ---
bq.  
bq.  (Updated 2011-07-26 14:44:01)
bq.  
bq.  
bq.  Review request for hive and John Sichi.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  This patch has defined a new AggregateIndexHandler which is used to 
optimize the query plan for groupby queries. 
bq.  
bq.  
bq.  This addresses bug HIVE-1694.
bq.  https://issues.apache.org/jira/browse/HIVE-1694
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.common/src/java/org/apache/hadoop/hive/conf/HiveConf.java b46976f 
bq.ql/src/java/org/apache/hadoop/hive/ql/index/AggregateIndexHandler.java 
PRE-CREATION 
bq.ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndex.java 591c9ff 
bq.ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java 2ca63b3 
bq.ql/src/java/org/apache/hadoop/hive/ql/optimizer/Optimizer.java 590d69a 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyCtx.java 
PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyProcFactory.java
 PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteGBUsingIndex.java 
PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteParseContextGenerator.java
 PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteQueryUsingAggregateIndex.java
 PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteQueryUsingAggregateIndexCtx.java
 PRE-CREATION 
bq.ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java 
77a6dc6 
bq.ql/src/test/queries/clientpositive/ql_rewrite_gbtoidx.q PRE-CREATION 
bq.ql/src/test/results/clientpositive/ql_rewrite_gbtoidx.q.out PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1194/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Prajakta
bq.  
bq.



 Accelerate GROUP BY execution using indexes
 ---

 Key: HIVE-1694
 URL: https://issues.apache.org/jira/browse/HIVE-1694
 Project: Hive
  Issue Type: New Feature
  Components: Indexing, Query Processor
Affects Versions: 0.7.0
Reporter: Nikhil Deshpande
Assignee: Prajakta Kalmegh
 Attachments: HIVE-1694.1.patch.txt, HIVE-1694.2.patch.txt, 
 HIVE-1694.3.patch.txt, 

[jira] [Commented] (HIVE-1850) alter table set serdeproperties bypasses regexps checks (leaves table in a non-recoverable state?)

2011-07-31 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13073434#comment-13073434
 ] 

jirapos...@reviews.apache.org commented on HIVE-1850:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1233/
---

Review request for hive.


Summary
---

alter table set serdeproperties bypasses regexps checks (leaves table in a 
non-recoverable state?)


This addresses bug HIVE-1850.
https://issues.apache.org/jira/browse/HIVE-1850


Diffs
-

  trunk/ql/src/test/results/clientnegative/create_table_wrong_regex.q.out 
PRE-CREATION 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/MetaStoreUtils.java 
1151047 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Table.java 1151047 
  trunk/ql/src/test/queries/clientnegative/alter_table_wrong_regex.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/create_table_wrong_regex.q 
PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/alter_table_wrong_regex.q.out 
PRE-CREATION 

Diff: https://reviews.apache.org/r/1233/diff


Testing
---

All tests pass with the patch.


Thanks,

Amareshwari



 alter table set serdeproperties bypasses regexps checks (leaves table in a 
 non-recoverable state?)
 --

 Key: HIVE-1850
 URL: https://issues.apache.org/jira/browse/HIVE-1850
 Project: Hive
  Issue Type: Bug
  Components: Serializers/Deserializers
Affects Versions: 0.7.0
 Environment: Trunk build from a few days ago, but seen once before 
 with older version as well.
Reporter: Terje Marthinussen
Assignee: Amareshwari Sriramadasu
 Fix For: 0.8.0

 Attachments: patch-1850.txt


 {code}
 create table aa ( test STRING )
   ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe'
   WITH SERDEPROPERTIES (input.regex = [^\\](.*), output.format.string = 
 $1s);
 {code}
 This will fail. Great!
 {code}
 create table aa ( test STRING )
   ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe'
   WITH SERDEPROPERTIES (input.regex = (.*), output.format.string = 
 $1s);
 {code}
 Works, no problem there.
 {code}
 alter table aa set serdeproperties (input.regex = [^\\](.*), 
 output.format.string = $1s);
 {code}
 Wups... I can set that without any problems!
 {code}
 alter table aa set serdeproperties (input.regex = (.*), 
 output.format.string = $1s);
 FAILED: Hive Internal Error: java.util.regex.PatternSyntaxException(Unclosed 
 character class near index 7
 [^\](.*)
^)
 java.util.regex.PatternSyntaxException: Unclosed character class near index 7
 [^\](.*)
^
   at java.util.regex.Pattern.error(Pattern.java:1713)
   at java.util.regex.Pattern.clazz(Pattern.java:2254)
   at java.util.regex.Pattern.sequence(Pattern.java:1818)
   at java.util.regex.Pattern.expr(Pattern.java:1752)
   at java.util.regex.Pattern.compile(Pattern.java:1460)
   at java.util.regex.Pattern.init(Pattern.java:1133)
   at java.util.regex.Pattern.compile(Pattern.java:847)
   at 
 org.apache.hadoop.hive.contrib.serde2.RegexSerDe.initialize(RegexSerDe.java:101)
   at 
 org.apache.hadoop.hive.metastore.MetaStoreUtils.getDeserializer(MetaStoreUtils.java:199)
   at 
 org.apache.hadoop.hive.ql.metadata.Table.getDeserializer(Table.java:253)
   at org.apache.hadoop.hive.ql.metadata.Table.getCols(Table.java:484)
   at 
 org.apache.hadoop.hive.ql.metadata.Table.checkValidity(Table.java:161)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:803)
   at 
 org.apache.hadoop.hive.ql.parse.DDLSemanticAnalyzer.analyzeAlterTableSerdeProps(DDLSemanticAnalyzer.java:558)
   at 
 org.apache.hadoop.hive.ql.parse.DDLSemanticAnalyzer.analyzeInternal(DDLSemanticAnalyzer.java:232)
   at 
 org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:238)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:335)
   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:686)
   at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:142)
   at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:216)
   at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:370)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 

[jira] [Commented] (HIVE-2326) Turn off bitmap indexing when map-side aggregation is turned off

2011-07-30 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13073271#comment-13073271
 ] 

jirapos...@reviews.apache.org commented on HIVE-2326:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1232/
---

Review request for hive, John Sichi and Ning Zhang.


Summary
---

sets the correct object inspector for PARTIAL2, COMPLETE UDAF modes


This addresses bug HIVE-2326.
https://issues.apache.org/jira/browse/HIVE-2326


Diffs
-

  ql/src/java/org/apache/hadoop/hive/ql/index/TableBasedIndexHandler.java 
02ab78c 
  ql/src/java/org/apache/hadoop/hive/ql/index/bitmap/BitmapIndexHandler.java 
61bbbf5 
  ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFEWAHBitmap.java 
0a3df09 

Diff: https://reviews.apache.org/r/1232/diff


Testing
---


Thanks,

Syed



 Turn off bitmap indexing when map-side aggregation is turned off
 

 Key: HIVE-2326
 URL: https://issues.apache.org/jira/browse/HIVE-2326
 Project: Hive
  Issue Type: Bug
Reporter: Syed S. Albiz
Assignee: Syed S. Albiz
 Attachments: HIVE-2326.1.patch


 Simply adding the CLUSTER BY clause on the ROW_OFFSET does not work with a 
 GROUP BY clause, causing a SemanticException when trying to compile the the 
 index builder task. Based on conversation with John Sichi, for now we will 
 just turn off this feature.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2303) files with control-A,B are not delimited correctly.

2011-07-29 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072785#comment-13072785
 ] 

jirapos...@reviews.apache.org commented on HIVE-2303:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1219/
---

Review request for hive.


Summary
---

files with control-A,B are not delimited correctly.


This addresses bug HIVE-2303.
https://issues.apache.org/jira/browse/HIVE-2303


Diffs
-

  trunk/data/files/in7.txt PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/PlanUtils.java 1151047 
  trunk/ql/src/test/queries/clientpositive/delimiter.q PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/combine2.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/delimiter.q.out PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/filter_join_breaktask.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/input23.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/input42.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/input_part7.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/input_part9.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/louter_join_ppr.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/outer_join_ppr.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/pcr.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/rand_partitionpruner1.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/rand_partitionpruner3.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/regexp_extract.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/router_join_ppr.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/sample10.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/sample6.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/sample8.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/sample9.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/transform_ppr1.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/transform_ppr2.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/udf_explode.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/udf_reflect.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/udtf_explode.q.out 1151047 
  trunk/ql/src/test/results/clientpositive/union_ppr.q.out 1151047 
  trunk/ql/src/test/results/compiler/plan/cast1.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/groupby2.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/groupby3.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/groupby4.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/groupby5.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/groupby6.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/input20.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/input8.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/input_part1.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/input_testxpath.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/input_testxpath2.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/join4.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/join5.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/join6.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/join7.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/join8.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/sample1.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/udf1.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/udf4.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/udf6.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/udf_case.q.xml 1151047 
  trunk/ql/src/test/results/compiler/plan/udf_when.q.xml 1151047 

Diff: https://reviews.apache.org/r/1219/diff


Testing
---

All tests passed with patch


Thanks,

Amareshwari



 files with control-A,B are not delimited correctly.
 ---

 Key: HIVE-2303
 URL: https://issues.apache.org/jira/browse/HIVE-2303
 Project: Hive
  Issue Type: Bug
Reporter: Amareshwari Sriramadasu
Assignee: Amareshwari Sriramadasu
 Attachments: patch-2303.txt


 The following is from one of our users:
  
 create external table impressions (imp string, msg string)
   row format delimited
 fields terminated by '\t'
 lines terminated by '\n'
   stored as textfile 
   location '/xxx';
  
 Some strings in my data contains Control-A, Control-B etc as internal 
 delimiters.  If I do a
  
 Select * from impressions limit 10;
  
 All fields were able to print correctly.  However if I do a
  
 Select * from 

[jira] [Commented] (HIVE-2305) UNION ALL on different types throws runtime exception

2011-07-29 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072961#comment-13072961
 ] 

jirapos...@reviews.apache.org commented on HIVE-2305:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1223/
---

Review request for hive.


Summary
---

UNION ALL on columns of different types is semantically allowed but throws a 
runtime exception due to bugs in getting the correct object inspector after the 
parent MR input file is change and in setting the TypeInfo for the union row 
schema.


This addresses bug hive-2305.
https://issues.apache.org/jira/browse/hive-2305


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/ColumnInfo.java 1151937 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FunctionRegistry.java 
1151937 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MapOperator.java 1151937 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/UnionOperator.java 1151937 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
1151937 
  trunk/ql/src/test/queries/clientpositive/union24.q PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/union24.q.out PRE-CREATION 
  trunk/ql/src/test/results/compiler/plan/union.q.xml 1151937 

Diff: https://reviews.apache.org/r/1223/diff


Testing
---

unit tests pass


Thanks,

Franklin



 UNION ALL on different types throws runtime exception
 -

 Key: HIVE-2305
 URL: https://issues.apache.org/jira/browse/HIVE-2305
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.7.1
Reporter: Franklin Hu
Assignee: Franklin Hu
 Attachments: hive-2305.1.patch, hive-2305.2.patch, hive-2305.3.patch


 Ex:
 SELECT * (SELECT 123 FROM ... UNION ALL SELECT '123' FROM ..) t;
 Unioning columns of different types currently throws runtime exceptions.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2080) Few code improvements in the ql and serde packages.

2011-07-28 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072196#comment-13072196
 ] 

jirapos...@reviews.apache.org commented on HIVE-2080:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1144/
---

(Updated 2011-07-28 06:11:18.101615)


Review request for hive.


Changes
---

Patch rebased


Summary
---

Few code improvements in the ql and serde packages.
1) Little performance Improvements 
2) Null checks to avoid NPEs
3) Effective varaible management.


This addresses bug HIVE-2080.
https://issues.apache.org/jira/browse/HIVE-2080


Diffs (updated)
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/CommonJoinOperator.java 
1151731 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FileSinkOperator.java 
1151731 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FilterOperator.java 1151731 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/GroupByOperator.java 1151731 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Operator.java 1151731 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/ReduceSinkOperator.java 
1151731 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/SelectOperator.java 1151731 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/TableScanOperator.java 
1151731 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/TaskFactory.java 1151731 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/UnionOperator.java 1151731 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ASTNode.java 1151731 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java 
1151731 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ParseContext.java 1151731 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
1151731 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/dynamic_type/DynamicSerDeField.java
 1151731 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/dynamic_type/DynamicSerDeFieldType.java
 1151731 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/dynamic_type/DynamicSerDeFunction.java
 1151731 

Diff: https://reviews.apache.org/r/1144/diff


Testing
---

All unit test passed


Thanks,

chinna



 Few code improvements in the ql and serde packages.
 ---

 Key: HIVE-2080
 URL: https://issues.apache.org/jira/browse/HIVE-2080
 Project: Hive
  Issue Type: Bug
  Components: Query Processor, Serializers/Deserializers
Affects Versions: 0.7.0
 Environment: Hadoop 0.20.1, Hive0.7.0 and SUSE Linux Enterprise 
 Server 10 SP2 (i586) - Kernel 2.6.16.60-0.21-smp (5).
Reporter: Chinna Rao Lalam
Assignee: Chinna Rao Lalam
 Attachments: HIVE-2080.1.Patch, HIVE-2080.2.Patch, HIVE-2080.Patch


 Few code improvements in the ql and serde packages.
 1) Little performance Improvements 
 2) Null checks to avoid NPEs
 3) Effective varaible management.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2319) Calling alter_table after changing partition comment throws an exception

2011-07-28 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072215#comment-13072215
 ] 

jirapos...@reviews.apache.org commented on HIVE-2319:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1213/
---

Review request for hive and Paul Yang.


Summary
---

Altering a table's partition key comments raises an InvalidOperationException. 
The partition key name and type should not be mutable, but the comment should 
be able to get changed.


This addresses bug HIVE-2319.
https://issues.apache.org/jira/browse/HIVE-2319


Diffs
-

  
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveAlterHandler.java 
1151219 
  
trunk/metastore/src/test/org/apache/hadoop/hive/metastore/TestHiveMetaStore.java
 1151219 

Diff: https://reviews.apache.org/r/1213/diff


Testing
---

Added some test cases to HiveMetaStore that pass.


Thanks,

Sohan



 Calling alter_table after changing partition comment throws an exception
 

 Key: HIVE-2319
 URL: https://issues.apache.org/jira/browse/HIVE-2319
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Reporter: Sohan Jain

 Altering a table's partition key comments raises an 
 InvalidOperationException.  The partition key name and type should not be 
 mutable, but the comment should be able to get changed.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2319) Calling alter_table after changing partition comment throws an exception

2011-07-28 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072219#comment-13072219
 ] 

jirapos...@reviews.apache.org commented on HIVE-2319:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1213/
---

(Updated 2011-07-28 07:06:24.236980)


Review request for hive and Paul Yang.


Changes
---

Fixed some of the test case logic.


Summary
---

Altering a table's partition key comments raises an InvalidOperationException. 
The partition key name and type should not be mutable, but the comment should 
be able to get changed.


This addresses bug HIVE-2319.
https://issues.apache.org/jira/browse/HIVE-2319


Diffs (updated)
-

  
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveAlterHandler.java 
1151219 
  
trunk/metastore/src/test/org/apache/hadoop/hive/metastore/TestHiveMetaStore.java
 1151219 

Diff: https://reviews.apache.org/r/1213/diff


Testing
---

Added some test cases to HiveMetaStore that pass.


Thanks,

Sohan



 Calling alter_table after changing partition comment throws an exception
 

 Key: HIVE-2319
 URL: https://issues.apache.org/jira/browse/HIVE-2319
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Reporter: Sohan Jain

 Altering a table's partition key comments raises an 
 InvalidOperationException.  The partition key name and type should not be 
 mutable, but the comment should be able to get changed.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2272) add TIMESTAMP data type

2011-07-28 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072494#comment-13072494
 ] 

jirapos...@reviews.apache.org commented on HIVE-2272:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1135/
---

(Updated 2011-07-28 19:17:59.868991)


Review request for hive.


Changes
---

Decided to store timestamps as UTC on disk and add offsets (localize) if user 
needs through UDF. This simplifies use cases (compared to MySQL) where users 
from different time zones querying the same data would receive different 
results.

Will file a follow up JIRA ticket to add table level timezone defaults. This 
would allow a user to import existing data that is already offset from UTC and 
write new data to the table with that offset.

Summary:

-Added config var for default timezone
-Added GenericUDF for converting from UTC to localized timezone (either 
default, or supplied from list of Java timezones)
Ex: utc_to_localized_timestamp(timestamp) OR 
utc_to_localized_timestamp(timestamp, America/Chicago)
-Fixed DateDiff bug I introduced


Summary
---

Adds TIMESTAMP type to serde2 with both string (LazySimple) and binary 
(LazyBinary) serialization.
Supports SQL style jdbc timestamps of the format with nanosecond precision
-MM-DD HH:MM:SS[.fff...]


This addresses bug HIVE-2272.
https://issues.apache.org/jira/browse/HIVE-2272


Diffs (updated)
-

  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1151571 
  trunk/conf/hive-default.xml 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FunctionRegistry.java 
1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java 
1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java 
1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ErrorMsg.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/TypeCheckProcFactory.java 
1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDate.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDateAdd.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDateDiff.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDateSub.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDayOfMonth.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFHour.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFMinute.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFMonth.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFSecond.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToBoolean.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToByte.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToDouble.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToFloat.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToInteger.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToLong.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToShort.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToString.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFUnixTimeStamp.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFWeekOfYear.java 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFYear.java 1151571 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFAverage.java 
1151571 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFContextNGrams.java
 1151571 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCorrelation.java
 1151571 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCovariance.java
 1151571 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCovarianceSample.java
 1151571 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFHistogramNumeric.java
 1151571 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFPercentileApprox.java
 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFStd.java 
1151571 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFStdSample.java
 1151571 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFSum.java 
1151571 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFVariance.java
 1151571 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFVarianceSample.java
 1151571 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFnGrams.java 
1151571 
  

[jira] [Commented] (HIVE-1694) Accelerate GROUP BY execution using indexes

2011-07-28 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072550#comment-13072550
 ] 

jirapos...@reviews.apache.org commented on HIVE-1694:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1194/#review1212
---



ql/src/java/org/apache/hadoop/hive/ql/index/AggregateIndexHandler.java
https://reviews.apache.org/r/1194/#comment2711

Please run ant checkstyle and fix all the formatting discrepancies it 
reports for your new files.




ql/src/java/org/apache/hadoop/hive/ql/index/AggregateIndexHandler.java
https://reviews.apache.org/r/1194/#comment2695

Don't you need to reuse the compact implementation here so that the index 
can be used for WHERE (not just GROUP BY)?




ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java
https://reviews.apache.org/r/1194/#comment2696

This method is redundant now.



ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyCtx.java
https://reviews.apache.org/r/1194/#comment2698

I can't think of a case where it would be worse.



ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyProcFactory.java
https://reviews.apache.org/r/1194/#comment2699

Actually group-by is now preserved in all cases.



ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteGBUsingIndex.java
https://reviews.apache.org/r/1194/#comment2700

Please use HTML bullet syntax for Javadoc (otherwise it all gets run 
together into one line when rendered).




ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteGBUsingIndex.java
https://reviews.apache.org/r/1194/#comment2701

indentation



ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteQueryUsingAggregateIndex.java
https://reviews.apache.org/r/1194/#comment2703

Shouldn't this be BIGINT?

Also, I think you're supposed to use a TypeInfoFactory for this purpose.



ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteQueryUsingAggregateIndex.java
https://reviews.apache.org/r/1194/#comment2702

indentation



ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteQueryUsingAggregateIndex.java
https://reviews.apache.org/r/1194/#comment2704

typo:  Repace



ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java
https://reviews.apache.org/r/1194/#comment2707

Not sure why this new constructor is needed...after using it, all you do is 
get the table out of it.



ql/src/test/queries/clientpositive/ql_rewrite_gbtoidx.q
https://reviews.apache.org/r/1194/#comment2709

This should *not* be using the index, since the index is built on 
count(l_shipdate), and l_shipdate may contain nulls, whereas the query is 
referencing count(1), which is insensitive to nulls.



ql/src/test/queries/clientpositive/ql_rewrite_gbtoidx.q
https://reviews.apache.org/r/1194/#comment2710

Need additional tests to verify all the cases where the optimization should 
*not* be used:

* when configuration disables it
* when index partitions do not cover table partitions (I still don't see 
the code for this case)
* ... all the other conditions checked for in the code ...



- John


On 2011-07-26 14:44:01, Prajakta Kalmegh wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1194/
bq.  ---
bq.  
bq.  (Updated 2011-07-26 14:44:01)
bq.  
bq.  
bq.  Review request for hive and John Sichi.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  This patch has defined a new AggregateIndexHandler which is used to 
optimize the query plan for groupby queries. 
bq.  
bq.  
bq.  This addresses bug HIVE-1694.
bq.  https://issues.apache.org/jira/browse/HIVE-1694
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.common/src/java/org/apache/hadoop/hive/conf/HiveConf.java b46976f 
bq.ql/src/java/org/apache/hadoop/hive/ql/index/AggregateIndexHandler.java 
PRE-CREATION 
bq.ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndex.java 591c9ff 
bq.ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java 2ca63b3 
bq.ql/src/java/org/apache/hadoop/hive/ql/optimizer/Optimizer.java 590d69a 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyCtx.java 
PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyProcFactory.java
 PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteGBUsingIndex.java 
PRE-CREATION 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteParseContextGenerator.java
 PRE-CREATION 
bq.

[jira] [Commented] (HIVE-2272) add TIMESTAMP data type

2011-07-28 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072553#comment-13072553
 ] 

jirapos...@reviews.apache.org commented on HIVE-2272:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1135/
---

(Updated 2011-07-28 21:59:38.146393)


Review request for hive, Yongqiang He, Ning Zhang, and Siying Dong.


Changes
---

Add corresponding UDF for converting to UTC from a localized timestamp


Summary
---

Adds TIMESTAMP type to serde2 with both string (LazySimple) and binary 
(LazyBinary) serialization.
Supports SQL style jdbc timestamps of the format with nanosecond precision
-MM-DD HH:MM:SS[.fff...]


This addresses bug HIVE-2272.
https://issues.apache.org/jira/browse/HIVE-2272


Diffs (updated)
-

  trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1152003 
  trunk/conf/hive-default.xml 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FunctionRegistry.java 
1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java 
1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java 
1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ErrorMsg.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/TypeCheckProcFactory.java 
1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDate.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDateAdd.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDateDiff.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDateSub.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDayOfMonth.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFHour.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFMinute.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFMonth.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFSecond.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToBoolean.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToByte.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToDouble.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToFloat.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToInteger.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToLong.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToShort.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToString.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFUnixTimeStamp.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFWeekOfYear.java 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFYear.java 1152003 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFAverage.java 
1152003 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFContextNGrams.java
 1152003 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCorrelation.java
 1152003 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCovariance.java
 1152003 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCovarianceSample.java
 1152003 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFHistogramNumeric.java
 1152003 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFPercentileApprox.java
 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFStd.java 
1152003 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFStdSample.java
 1152003 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFSum.java 
1152003 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFVariance.java
 1152003 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFVarianceSample.java
 1152003 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFnGrams.java 
1152003 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFFromUtcTimestamp.java
 PRE-CREATION 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFTimestamp.java
 PRE-CREATION 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFToUtcTimestamp.java
 PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/invalid_t_create3.q 1152003 
  trunk/ql/src/test/queries/clientpositive/timestamp_1.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/timestamp_2.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/timestamp_3.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/timestamp_comparison.q PRE-CREATION 
  

[jira] [Commented] (HIVE-2020) Create a separate namespace for Hive variables

2011-07-27 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072011#comment-13072011
 ] 

jirapos...@reviews.apache.org commented on HIVE-2020:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1204/
---

Review request for hive.


Summary
---

Create a separate option for Hive variable


This addresses bug HIVE-2020.
https://issues.apache.org/jira/browse/HIVE-2020


Diffs
-

  cli/src/java/org/apache/hadoop/hive/cli/CliDriver.java 9fa7bc6 
  cli/src/java/org/apache/hadoop/hive/cli/DefaultPreprocessor.java PRE-CREATION 
  cli/src/java/org/apache/hadoop/hive/cli/KeyValue.java PRE-CREATION 
  cli/src/java/org/apache/hadoop/hive/cli/OptionsProcessor.java 90084ed 
  cli/src/java/org/apache/hadoop/hive/cli/Preprocessor.java PRE-CREATION 

Diff: https://reviews.apache.org/r/1204/diff


Testing
---


Thanks,

Vaibhav



 Create a separate namespace for Hive variables
 --

 Key: HIVE-2020
 URL: https://issues.apache.org/jira/browse/HIVE-2020
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Reporter: Carl Steinbach
Assignee: Vaibhav Aggarwal

 Support for variable substitution was added in HIVE-1096. However, variable 
 substitution was implemented by reusing the HiveConf namespace, so there is 
 no separation between Hive configuration properties and Hive variables.
 This ticket encompasses the following enhancements:
 * Create a separate namespace for managing Hive variables.
 * Add support for setting variables on the command line via '-hivevar x=y'
 * Add support for setting variables through the CLI via 'var x=y'
 * Add support for referencing variables in statements using either 
 '${hivevar:var_name}' or '${var_name}'
 * Provide a means for differentiating between hiveconf, hivevar, system, and 
 environment properties in the output of 'set -v'

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2318) Support multiple file systems

2011-07-27 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072036#comment-13072036
 ] 

jirapos...@reviews.apache.org commented on HIVE-2318:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1210/
---

Review request for hive.


Summary
---

Support multiple file systems


This addresses bug HIVE-2318.
https://issues.apache.org/jira/browse/HIVE-2318


Diffs
-

  ql/src/java/org/apache/hadoop/hive/ql/Context.java 1a1ffc5 
  ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java c031f40 
  ql/src/java/org/apache/hadoop/hive/ql/exec/FileSinkOperator.java c9e61a1 
  ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java c6f6755 
  ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java a57f9cf 
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/GenMRFileSink1.java 092a5cd 
  ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java 337a8eb 
  ql/src/java/org/apache/hadoop/hive/ql/parse/ImportSemanticAnalyzer.java 
3001575 
  ql/src/java/org/apache/hadoop/hive/ql/parse/LoadSemanticAnalyzer.java 6f311c6 
  ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java e1febc3 
  ql/src/java/org/apache/hadoop/hive/ql/plan/ConditionalResolverMergeFiles.java 
5c09789 
  ql/src/java/org/apache/hadoop/hive/ql/plan/DynamicPartitionCtx.java 29cfe9d 

Diff: https://reviews.apache.org/r/1210/diff


Testing
---


Thanks,

Vaibhav



 Support multiple file systems
 -

 Key: HIVE-2318
 URL: https://issues.apache.org/jira/browse/HIVE-2318
 Project: Hive
  Issue Type: New Feature
Reporter: Vaibhav Aggarwal
Assignee: Vaibhav Aggarwal

 Currently some of the Hive tasks like MoveTask, ConditionalMergeResolver 
 assume that the data is being copied or moved on the same file system.
 These operators file if the source table is in one filesystem (like HDFS) and 
 destination table is in another file system (like s3).
 This patch aims at:
 1. Support moving data between different file systems.
 2. Add support for file systems which do not support 'move' operation like s3.
 3. Remove redundant operations like moving data from and to the same location.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1694) Accelerate GROUP BY execution using indexes

2011-07-26 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13071130#comment-13071130
 ] 

jirapos...@reviews.apache.org commented on HIVE-1694:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1194/
---

Review request for hive and John Sichi.


Summary
---

This patch has defined a new AggregateIndexHandler which is used to optimize 
the query plan for groupby queries. 


This addresses bug HIVE-1694.
https://issues.apache.org/jira/browse/HIVE-1694


Diffs
-

  common/src/java/org/apache/hadoop/hive/conf/HiveConf.java b46976f 
  ql/src/java/org/apache/hadoop/hive/ql/index/AggregateIndexHandler.java 
PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndex.java 591c9ff 
  ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java 2ca63b3 
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/Optimizer.java 590d69a 
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyCtx.java 
PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteCanApplyProcFactory.java
 PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteGBUsingIndex.java 
PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteParseContextGenerator.java
 PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteQueryUsingAggregateIndex.java
 PRE-CREATION 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/index/RewriteQueryUsingAggregateIndexCtx.java
 PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java 77a6dc6 
  ql/src/test/queries/clientpositive/ql_rewrite_gbtoidx.q PRE-CREATION 
  ql/src/test/results/clientpositive/ql_rewrite_gbtoidx.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/1194/diff


Testing
---


Thanks,

Prajakta



 Accelerate GROUP BY execution using indexes
 ---

 Key: HIVE-1694
 URL: https://issues.apache.org/jira/browse/HIVE-1694
 Project: Hive
  Issue Type: New Feature
  Components: Indexing, Query Processor
Affects Versions: 0.7.0
Reporter: Nikhil Deshpande
Assignee: Prajakta Kalmegh
 Attachments: HIVE-1694.1.patch.txt, HIVE-1694.2.patch.txt, 
 HIVE-1694.3.patch.txt, HIVE-1694_2010-10-28.diff, demo_q1.hql, demo_q2.hql


 The index building patch (Hive-417) is checked into trunk, this JIRA issue 
 tracks supporting indexes in Hive compiler  execution engine for SELECT 
 queries.
 This is in ref. to John's comment at
 https://issues.apache.org/jira/browse/HIVE-417?focusedCommentId=12884869page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#action_12884869
 on creating separate JIRA issue for tracking index usage in optimizer  query 
 execution.
 The aim of this effort is to use indexes to accelerate query execution (for 
 certain class of queries). E.g.
 - Filters and range scans (already being worked on by He Yongqiang as part of 
 HIVE-417?)
 - Joins (index based joins)
 - Group By, Order By and other misc cases
 The proposal is multi-step:
 1. Building index based operators, compiler and execution engine changes
 2. Optimizer enhancements (e.g. cost-based optimizer to compare and choose 
 between index scans, full table scans etc.)
 This JIRA initially focuses on the first step. This JIRA is expected to hold 
 the information about index based plans  operator implementations for above 
 mentioned cases. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2286) ClassCastException when building index with security.authorization turned on

2011-07-26 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13071315#comment-13071315
 ] 

jirapos...@reviews.apache.org commented on HIVE-2286:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1137/#review1188
---



ql/src/java/org/apache/hadoop/hive/ql/Driver.java
https://reviews.apache.org/r/1137/#comment2597

java.util.Stack is deprecated since it adds unnecessary synchronization.  
We don't have a replacement yet (HIVE-1626) so we've just been using ArrayList.

Also, instead of typecasting to/from Object, use a static inner class for 
holding the record of state variables.


- John


On 2011-07-25 23:03:22, Syed Albiz wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1137/
bq.  ---
bq.  
bq.  (Updated 2011-07-25 23:03:22)
bq.  
bq.  
bq.  Review request for hive, John Sichi and Ning Zhang.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Save the original HiveOperation/commandType when we generate the index 
builder task and restore it after we're done generating the task so that the 
authorization checks make the right decision when deciding what to do.
bq.  
bq.  
bq.  This addresses bug HIVE-2286.
bq.  https://issues.apache.org/jira/browse/HIVE-2286
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.ql/src/java/org/apache/hadoop/hive/ql/Driver.java b278ffe 
bq.ql/src/test/queries/clientpositive/index_auth.q PRE-CREATION 
bq.ql/src/test/results/clientnegative/index_compact_entry_limit.q.out 
fcb2673 
bq.ql/src/test/results/clientnegative/index_compact_size_limit.q.out 
fcb2673 
bq.ql/src/test/results/clientpositive/index_auth.q.out PRE-CREATION 
bq.ql/src/test/results/clientpositive/index_auto.q.out 8d65f98 
bq.ql/src/test/results/clientpositive/index_auto_file_format.q.out 194b35e 
bq.ql/src/test/results/clientpositive/index_auto_multiple.q.out 6b81fc3 
bq.ql/src/test/results/clientpositive/index_auto_partitioned.q.out b0635db 
bq.ql/src/test/results/clientpositive/index_auto_unused.q.out 3631bbc 
bq.ql/src/test/results/clientpositive/index_bitmap.q.out 8f41ce3 
bq.ql/src/test/results/clientpositive/index_bitmap1.q.out 9f638f5 
bq.ql/src/test/results/clientpositive/index_bitmap2.q.out e901477 
bq.ql/src/test/results/clientpositive/index_bitmap3.q.out 116c973 
bq.ql/src/test/results/clientpositive/index_bitmap_auto.q.out cc9d91e 
bq.ql/src/test/results/clientpositive/index_bitmap_auto_partitioned.q.out 
9003eb4 
bq.ql/src/test/results/clientpositive/index_bitmap_rc.q.out 9bd3c98 
bq.ql/src/test/results/clientpositive/index_compact.q.out c339ec9 
bq.ql/src/test/results/clientpositive/index_compact_1.q.out 34ba3ca 
bq.ql/src/test/results/clientpositive/index_compact_2.q.out e8ce238 
bq.ql/src/test/results/clientpositive/index_compact_3.q.out d39556d 
bq.ql/src/test/results/clientpositive/index_creation.q.out 532f07e 
bq.  
bq.  Diff: https://reviews.apache.org/r/1137/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Added new testcase to TestCliDriver: index_auth.q
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Syed
bq.  
bq.



 ClassCastException when building index with security.authorization turned on
 

 Key: HIVE-2286
 URL: https://issues.apache.org/jira/browse/HIVE-2286
 Project: Hive
  Issue Type: Bug
Reporter: Syed S. Albiz
Assignee: Syed S. Albiz
 Attachments: HIVE-2286.1.patch, HIVE-2286.2.patch


 When trying to build an index with authorization checks turned on, hive 
 issues the following ClassCastException:
 org.apache.hadoop.hive.ql.parse.DDLSemanticAnalyzer cannot be cast to
 org.apache.hadoop.hive.ql.parse.SemanticAnalyzer
  at
 org.apache.hadoop.hive.ql.Driver.doAuthorization(Driver.java:540)
  at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:431)
  at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:335)
  at org.apache.hadoop.hive.ql.Driver.run(Driver.java:848)
  at
 org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:224)
  at
 org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:358)
  at
 org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:293)
  at
 org.apache.hadoop.hive.cli.CliDriver.processReader(CliDriver.java:385)
  at
 org.apache.hadoop.hive.cli.CliDriver.processFile(CliDriver.java:392)
  at
 

[jira] [Commented] (HIVE-2272) add TIMESTAMP data type

2011-07-26 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13071368#comment-13071368
 ] 

jirapos...@reviews.apache.org commented on HIVE-2272:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1135/
---

(Updated 2011-07-26 21:11:35.218104)


Review request for hive.


Changes
---

Rebase


Summary
---

Adds TIMESTAMP type to serde2 with both string (LazySimple) and binary 
(LazyBinary) serialization.
Supports SQL style jdbc timestamps of the format with nanosecond precision
-MM-DD HH:MM:SS[.fff...]


This addresses bug HIVE-2272.
https://issues.apache.org/jira/browse/HIVE-2272


Diffs (updated)
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FunctionRegistry.java 
1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java 
1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ErrorMsg.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/TypeCheckProcFactory.java 
1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDate.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDateAdd.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDateDiff.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDateSub.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFDayOfMonth.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFHour.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFMinute.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFMonth.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFSecond.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToBoolean.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToByte.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToDouble.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToFloat.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToInteger.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToLong.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToShort.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToString.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFUnixTimeStamp.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFWeekOfYear.java 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFYear.java 1151189 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFAverage.java 
1151189 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFContextNGrams.java
 1151189 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCorrelation.java
 1151189 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCovariance.java
 1151189 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCovarianceSample.java
 1151189 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFHistogramNumeric.java
 1151189 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFPercentileApprox.java
 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFStd.java 
1151189 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFStdSample.java
 1151189 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFSum.java 
1151189 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFVariance.java
 1151189 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFVarianceSample.java
 1151189 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFnGrams.java 
1151189 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFTimestamp.java
 PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/invalid_t_create3.q 1151189 
  trunk/ql/src/test/queries/clientpositive/timestamp_1.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/timestamp_2.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/timestamp_3.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/timestamp_comparison.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/timestamp_udf.q PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/invalid_create_tbl1.q.out 1151189 
  trunk/ql/src/test/results/clientnegative/invalid_t_alter1.q.out 1151189 
  trunk/ql/src/test/results/clientnegative/invalid_t_alter2.q.out 1151189 
  trunk/ql/src/test/results/clientnegative/invalid_t_create1.q.out 1151189 
  trunk/ql/src/test/results/clientnegative/invalid_t_create2.q.out 1151189 
  trunk/ql/src/test/results/clientnegative/invalid_t_transform.q.out 

[jira] [Commented] (HIVE-2286) ClassCastException when building index with security.authorization turned on

2011-07-26 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13071428#comment-13071428
 ] 

jirapos...@reviews.apache.org commented on HIVE-2286:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1137/
---

(Updated 2011-07-26 23:28:13.279889)


Review request for hive, John Sichi and Ning Zhang.


Changes
---

refactor patch to dump query state into an inner class rather than a Stack.


Summary
---

Save the original HiveOperation/commandType when we generate the index builder 
task and restore it after we're done generating the task so that the 
authorization checks make the right decision when deciding what to do.


This addresses bug HIVE-2286.
https://issues.apache.org/jira/browse/HIVE-2286


Diffs (updated)
-

  ql/src/java/org/apache/hadoop/hive/ql/Driver.java b278ffe 
  ql/src/test/queries/clientpositive/index_auth.q PRE-CREATION 
  ql/src/test/results/clientnegative/addpart1.q.out f4da8f1 
  ql/src/test/results/clientnegative/alter_concatenate_indexed_table.q.out 
8ae1f9d 
  ql/src/test/results/clientnegative/alter_non_native.q.out 8be2c3b 
  ql/src/test/results/clientnegative/alter_view_failure.q.out 9954b66 
  ql/src/test/results/clientnegative/alter_view_failure2.q.out 5915b4f 
  ql/src/test/results/clientnegative/alter_view_failure4.q.out 97d6b18 
  ql/src/test/results/clientnegative/alter_view_failure5.q.out 2291ca6 
  ql/src/test/results/clientnegative/alter_view_failure6.q.out 03b2bc3 
  ql/src/test/results/clientnegative/alter_view_failure7.q.out d0f958c 
  ql/src/test/results/clientnegative/alter_view_failure8.q.out 4420c57 
  ql/src/test/results/clientnegative/alter_view_failure9.q.out 67306d3 
  ql/src/test/results/clientnegative/altern1.q.out c52ca04 
  ql/src/test/results/clientnegative/analyze_view.q.out 99def40 
  ql/src/test/results/clientnegative/archive1.q.out 0927686 
  ql/src/test/results/clientnegative/archive2.q.out 25baefa 
  ql/src/test/results/clientnegative/authorization_fail_1.q.out ab1abe2 
  ql/src/test/results/clientnegative/authorization_fail_3.q.out cd7ceb1 
  ql/src/test/results/clientnegative/authorization_fail_4.q.out b05f9b7 
  ql/src/test/results/clientnegative/authorization_fail_5.q.out f5bdc6a 
  ql/src/test/results/clientnegative/authorization_fail_7.q.out a52fd1c 
  ql/src/test/results/clientnegative/authorization_part.q.out 625d60c 
  ql/src/test/results/clientnegative/column_rename1.q.out 7c30e4e 
  ql/src/test/results/clientnegative/column_rename2.q.out 0ca78f9 
  ql/src/test/results/clientnegative/column_rename4.q.out f14fd48 
  ql/src/test/results/clientnegative/create_or_replace_view1.q.out 97bfa21 
  ql/src/test/results/clientnegative/create_or_replace_view2.q.out 8edac34 
  ql/src/test/results/clientnegative/create_or_replace_view4.q.out 89dd5f5 
  ql/src/test/results/clientnegative/create_or_replace_view5.q.out a0aed59 
  ql/src/test/results/clientnegative/create_or_replace_view6.q.out df44e33 
  ql/src/test/results/clientnegative/create_or_replace_view7.q.out 9356dcc 
  ql/src/test/results/clientnegative/create_or_replace_view8.q.out 4161659 
  ql/src/test/results/clientnegative/create_view_failure1.q.out 43cded4 
  ql/src/test/results/clientnegative/create_view_failure2.q.out a038067 
  ql/src/test/results/clientnegative/create_view_failure4.q.out f968569 
  ql/src/test/results/clientnegative/database_create_already_exists.q.out 
08c04f9 
  ql/src/test/results/clientnegative/database_create_invalid_name.q.out 1e58089 
  ql/src/test/results/clientnegative/database_drop_does_not_exist.q.out 80c00cd 
  ql/src/test/results/clientnegative/database_drop_not_empty.q.out baa8f37 
  ql/src/test/results/clientnegative/database_drop_not_empty_restrict.q.out 
b297a99 
  ql/src/test/results/clientnegative/database_switch_does_not_exist.q.out 
8b5674d 
  ql/src/test/results/clientnegative/drop_partition_failure.q.out 8a7c63d 
  ql/src/test/results/clientnegative/drop_table_failure2.q.out 9b63102 
  ql/src/test/results/clientnegative/drop_view_failure1.q.out 61ec927 
  ql/src/test/results/clientnegative/dyn_part3.q.out 5f4df65 
  ql/src/test/results/clientnegative/exim_00_unsupported_schema.q.out 814b742 
  ql/src/test/results/clientnegative/exim_01_nonpart_over_loaded.q.out 0351bc1 
  ql/src/test/results/clientnegative/exim_02_all_part_over_overlap.q.out 
d40ff27 
  ql/src/test/results/clientnegative/exim_03_nonpart_noncompat_colschema.q.out 
adff0f8 
  ql/src/test/results/clientnegative/exim_04_nonpart_noncompat_colnumber.q.out 
b84e954 
  ql/src/test/results/clientnegative/exim_05_nonpart_noncompat_coltype.q.out 
96f8452 
  ql/src/test/results/clientnegative/exim_06_nonpart_noncompat_storage.q.out 
25deaa3 
  

[jira] [Commented] (HIVE-2246) Dedupe tables' column schemas from partitions in the metastore db

2011-07-25 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13070337#comment-13070337
 ] 

jirapos...@reviews.apache.org commented on HIVE-2246:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1183/#review1176
---



trunk/metastore/scripts/upgrade/mysql/008-HIVE-2246.mysql.sql
https://reviews.apache.org/r/1183/#comment2467

is the CHARSET (latin1) the same as SDS? This will require the user's 
comments to be in latin1 which prevents UTF chars.



trunk/metastore/scripts/upgrade/mysql/008-HIVE-2246.mysql.sql
https://reviews.apache.org/r/1183/#comment2466

can you also add migration script for derby? we support derby as a default 
metastore RDBMS as well. 



trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java
https://reviews.apache.org/r/1183/#comment2468

here do you check if the 'alter table' command changes the schema (columns 
definition)? If it just set a table property, then you don't need to create a 
new ColumnDescriptor right?

Also if a table's schema got changed, a new CD will be created, but the old 
partition will still have the old CDs. When we query the old partition, do we 
use the old partitons's CD or the table's CD? 

Also in the above case, when you run 'desc table partition 
old_partition', do you return the old partition's CD or the table's CD? 


- Ning


On 2011-07-22 05:30:29, Sohan Jain wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1183/
bq.  ---
bq.  
bq.  (Updated 2011-07-22 05:30:29)
bq.  
bq.  
bq.  Review request for hive, Ning Zhang and Paul Yang.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  This patch tries to make minimal changes to the API while keeping 
migration short and somewhat easy to revert.
bq.  
bq.  The new schema can be described as follows:
bq.  - CDS is a table corresponding to Column Descriptor objects.  Currently, 
it only stores a CD_ID.
bq.  - COLUMNS_V2 is a table corresponding to MFieldSchema objects, or columns. 
 A Column Descriptor holds a list of columns.  COLUMNS_V2 has a foreign key to 
the CD_ID to which it belongs.
bq.  - SDS was modified to reference a Column Descriptor. So SDS now has a 
foreign key to a CD_ID which describes its columns.
bq.  
bq.  During migration, we create Column Descriptors for tables in a 
straightforward manner: their columns are now just wrapped inside a column 
descriptor.  The SDS of partitions use their parent table's column descriptor, 
since currently a partition and its table share the same list of columns.
bq.  
bq.  When altering or adding a partition, give it it's parent table's column 
descriptor IF the columns they describe are the same.  Otherwise, create a new 
column descriptor for its columns.
bq.  
bq.  When adding or altering a table, create a new column descriptor every time.
bq.  
bq.  Whenever you drop a storage descriptor (e.g, when dropping tables or 
partitions), check to see if the related column descriptor has any other 
references in the table.  That is, check to see if any other storage 
descriptors point to that column descriptor.  If none do, then delete that 
column descriptor.  This check is in place so we don't have unreferenced column 
descriptors and columns hanging around after schema evolution for tables.
bq.  
bq.  
bq.  This addresses bug HIVE-2246.
bq.  https://issues.apache.org/jira/browse/HIVE-2246
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/metastore/scripts/upgrade/mysql/008-HIVE-2246.mysql.sql 
PRE-CREATION 
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java 
1148945 
bq.
trunk/metastore/src/model/org/apache/hadoop/hive/metastore/model/MColumnDescriptor.java
 PRE-CREATION 
bq.
trunk/metastore/src/model/org/apache/hadoop/hive/metastore/model/MStorageDescriptor.java
 1148945 
bq.trunk/metastore/src/model/package.jdo 1148945 
bq.  
bq.  Diff: https://reviews.apache.org/r/1183/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Passes facebook's regression testing and all existing test cases.  In one 
instance, before migration, the overhead involved with storage descriptors and 
columns was ~11 GB.  After migration, the overhead was ~1.5 GB.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Sohan
bq.  
bq.



 Dedupe tables' column schemas from partitions in the metastore db
 -

 Key: HIVE-2246
 URL: https://issues.apache.org/jira/browse/HIVE-2246
 Project: Hive
  Issue Type: Improvement
  

[jira] [Commented] (HIVE-2299) Optimize Hive query startup time for multiple partitions

2011-07-25 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13070768#comment-13070768
 ] 

jirapos...@reviews.apache.org commented on HIVE-2299:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1188/
---

Review request for hive.


Summary
---

Review request for HIVE-2299.


This addresses bug HIVE-2299.
https://issues.apache.org/jira/browse/HIVE-2299


Diffs
-

  ql/src/java/org/apache/hadoop/hive/ql/io/CombineHiveInputFormat.java 421140f 

Diff: https://reviews.apache.org/r/1188/diff


Testing
---


Thanks,

Carl



 Optimize Hive query startup time for multiple partitions
 

 Key: HIVE-2299
 URL: https://issues.apache.org/jira/browse/HIVE-2299
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Reporter: Vaibhav Aggarwal
Assignee: Vaibhav Aggarwal
 Attachments: HIVE-2299.patch


 Added an optimization to the way input splits are computed.
 Reduced an O(n^2) operation to O n operation.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2299) Optimize Hive query startup time for multiple partitions

2011-07-25 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13070777#comment-13070777
 ] 

jirapos...@reviews.apache.org commented on HIVE-2299:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1188/#review1180
---

Ship it!


+1. Will commit if tests pass.

- Carl


On 2011-07-25 21:22:09, Carl Steinbach wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1188/
bq.  ---
bq.  
bq.  (Updated 2011-07-25 21:22:09)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Review request for HIVE-2299.
bq.  
bq.  
bq.  This addresses bug HIVE-2299.
bq.  https://issues.apache.org/jira/browse/HIVE-2299
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.ql/src/java/org/apache/hadoop/hive/ql/io/CombineHiveInputFormat.java 
421140f 
bq.  
bq.  Diff: https://reviews.apache.org/r/1188/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Carl
bq.  
bq.



 Optimize Hive query startup time for multiple partitions
 

 Key: HIVE-2299
 URL: https://issues.apache.org/jira/browse/HIVE-2299
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Reporter: Vaibhav Aggarwal
Assignee: Vaibhav Aggarwal
 Attachments: HIVE-2299.patch


 Added an optimization to the way input splits are computed.
 Reduced an O(n^2) operation to O n operation.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2298) Fix UDAFPercentile to tolerate null percentiles

2011-07-25 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13070803#comment-13070803
 ] 

jirapos...@reviews.apache.org commented on HIVE-2298:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1189/
---

Review request for hive.


Summary
---

Review request for HIVE-2298.


This addresses bug HIVE-2298.
https://issues.apache.org/jira/browse/HIVE-2298


Diffs
-

  ql/src/java/org/apache/hadoop/hive/ql/udf/UDAFPercentile.java 92db544 

Diff: https://reviews.apache.org/r/1189/diff


Testing
---


Thanks,

Carl



 Fix UDAFPercentile to tolerate null percentiles
 ---

 Key: HIVE-2298
 URL: https://issues.apache.org/jira/browse/HIVE-2298
 Project: Hive
  Issue Type: Bug
  Components: UDF
Affects Versions: 0.7.0
Reporter: Vaibhav Aggarwal
Assignee: Vaibhav Aggarwal
 Attachments: HIVE-2298.patch


 UDAFPercentile when passed null percentile list will throw a null pointer 
 exception.
 Submitting a small fix for that.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2298) Fix UDAFPercentile to tolerate null percentiles

2011-07-25 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13070807#comment-13070807
 ] 

jirapos...@reviews.apache.org commented on HIVE-2298:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1189/#review1181
---



ql/src/java/org/apache/hadoop/hive/ql/udf/UDAFPercentile.java
https://reviews.apache.org/r/1189/#comment2483

Please fix the following checkstyle violations:

Line 238: File contains tab characters (this is the first instance).
Line 240: is longer than 100 characters.
Line 245: '}' should be on the same line.


- Carl


On 2011-07-25 21:53:12, Carl Steinbach wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1189/
bq.  ---
bq.  
bq.  (Updated 2011-07-25 21:53:12)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Review request for HIVE-2298.
bq.  
bq.  
bq.  This addresses bug HIVE-2298.
bq.  https://issues.apache.org/jira/browse/HIVE-2298
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.ql/src/java/org/apache/hadoop/hive/ql/udf/UDAFPercentile.java 92db544 
bq.  
bq.  Diff: https://reviews.apache.org/r/1189/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Carl
bq.  
bq.



 Fix UDAFPercentile to tolerate null percentiles
 ---

 Key: HIVE-2298
 URL: https://issues.apache.org/jira/browse/HIVE-2298
 Project: Hive
  Issue Type: Bug
  Components: UDF
Affects Versions: 0.7.0
Reporter: Vaibhav Aggarwal
Assignee: Vaibhav Aggarwal
 Attachments: HIVE-2298.patch


 UDAFPercentile when passed null percentile list will throw a null pointer 
 exception.
 Submitting a small fix for that.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2286) ClassCastException when building index with security.authorization turned on

2011-07-25 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13070847#comment-13070847
 ] 

jirapos...@reviews.apache.org commented on HIVE-2286:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1137/
---

(Updated 2011-07-25 23:03:22.871042)


Review request for hive, John Sichi and Ning Zhang.


Changes
---

Addressed comments, still need to regenerate a lot of testcase output since 
this will change the prehook/posthook messages for a lot of testcases.


Summary
---

Save the original HiveOperation/commandType when we generate the index builder 
task and restore it after we're done generating the task so that the 
authorization checks make the right decision when deciding what to do.


This addresses bug HIVE-2286.
https://issues.apache.org/jira/browse/HIVE-2286


Diffs (updated)
-

  ql/src/java/org/apache/hadoop/hive/ql/Driver.java b278ffe 
  ql/src/test/queries/clientpositive/index_auth.q PRE-CREATION 
  ql/src/test/results/clientnegative/index_compact_entry_limit.q.out fcb2673 
  ql/src/test/results/clientnegative/index_compact_size_limit.q.out fcb2673 
  ql/src/test/results/clientpositive/index_auth.q.out PRE-CREATION 
  ql/src/test/results/clientpositive/index_auto.q.out 8d65f98 
  ql/src/test/results/clientpositive/index_auto_file_format.q.out 194b35e 
  ql/src/test/results/clientpositive/index_auto_multiple.q.out 6b81fc3 
  ql/src/test/results/clientpositive/index_auto_partitioned.q.out b0635db 
  ql/src/test/results/clientpositive/index_auto_unused.q.out 3631bbc 
  ql/src/test/results/clientpositive/index_bitmap.q.out 8f41ce3 
  ql/src/test/results/clientpositive/index_bitmap1.q.out 9f638f5 
  ql/src/test/results/clientpositive/index_bitmap2.q.out e901477 
  ql/src/test/results/clientpositive/index_bitmap3.q.out 116c973 
  ql/src/test/results/clientpositive/index_bitmap_auto.q.out cc9d91e 
  ql/src/test/results/clientpositive/index_bitmap_auto_partitioned.q.out 
9003eb4 
  ql/src/test/results/clientpositive/index_bitmap_rc.q.out 9bd3c98 
  ql/src/test/results/clientpositive/index_compact.q.out c339ec9 
  ql/src/test/results/clientpositive/index_compact_1.q.out 34ba3ca 
  ql/src/test/results/clientpositive/index_compact_2.q.out e8ce238 
  ql/src/test/results/clientpositive/index_compact_3.q.out d39556d 
  ql/src/test/results/clientpositive/index_creation.q.out 532f07e 

Diff: https://reviews.apache.org/r/1137/diff


Testing
---

Added new testcase to TestCliDriver: index_auth.q


Thanks,

Syed



 ClassCastException when building index with security.authorization turned on
 

 Key: HIVE-2286
 URL: https://issues.apache.org/jira/browse/HIVE-2286
 Project: Hive
  Issue Type: Bug
Reporter: Syed S. Albiz
Assignee: Syed S. Albiz
 Attachments: HIVE-2286.1.patch, HIVE-2286.2.patch


 When trying to build an index with authorization checks turned on, hive 
 issues the following ClassCastException:
 org.apache.hadoop.hive.ql.parse.DDLSemanticAnalyzer cannot be cast to
 org.apache.hadoop.hive.ql.parse.SemanticAnalyzer
  at
 org.apache.hadoop.hive.ql.Driver.doAuthorization(Driver.java:540)
  at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:431)
  at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:335)
  at org.apache.hadoop.hive.ql.Driver.run(Driver.java:848)
  at
 org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:224)
  at
 org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:358)
  at
 org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:293)
  at
 org.apache.hadoop.hive.cli.CliDriver.processReader(CliDriver.java:385)
  at
 org.apache.hadoop.hive.cli.CliDriver.processFile(CliDriver.java:392)
  at
 org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:567)
  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
  at
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.jav
 a:39)
  at
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessor
 Impl.java:25)
  at java.lang.reflect.Method.invoke(Method.java:597)
  at org.apache.hadoop.util.RunJar.main(RunJar.java:156)

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1078) CREATE VIEW followup: CREATE OR REPLACE

2011-07-22 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069465#comment-13069465
 ] 

jirapos...@reviews.apache.org commented on HIVE-1078:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1058/
---

(Updated 2011-07-22 09:36:21.858818)


Review request for hive.


Changes
---

lol


Summary
---

https://issues.apache.org/jira/browse/HIVE-1078


This addresses bug HIVE-1078.
https://issues.apache.org/jira/browse/HIVE-1078


Diffs
-

  
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveAlterHandler.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CreateViewDesc.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view1.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view2.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view3.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view4.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view5.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view6.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view7.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view8.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/recursive_view.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_or_replace_view.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view1.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view2.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view3.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view4.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view5.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view6.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view7.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view8.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/recursive_view.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_or_replace_view.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_view.q.out
 1146902 

Diff: https://reviews.apache.org/r/1058/diff


Testing (updated)
---

Unit tests passed for r10 (HIVE-1078v9.patch)!!!


Thanks,

Charles



 CREATE VIEW followup:  CREATE OR REPLACE
 

 Key: HIVE-1078
 URL: https://issues.apache.org/jira/browse/HIVE-1078
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: John Sichi
Assignee: Charles Chen
 Attachments: HIVE-1078v3.patch, HIVE-1078v4.patch, HIVE-1078v5.patch, 
 HIVE-1078v6.patch, HIVE-1078v7.patch, HIVE-1078v8.patch, HIVE-1078v9.patch


 Currently, replacing a view requires
 DROP VIEW v;
 CREATE VIEW v AS new-definition;
 CREATE OR REPLACE would allow these to be combined into a single operation.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: 

[jira] [Commented] (HIVE-2282) Local mode needs to work well with block sampling

2011-07-22 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069635#comment-13069635
 ] 

jirapos...@reviews.apache.org commented on HIVE-2282:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1132/
---

(Updated 2011-07-22 17:40:44.736466)


Review request for hive and Siying Dong.


Changes
---

I added the q.out file which I had forgotten for the new q file.

I also modified the test queries to select count(1) instead of selecting keys 
and values.


Summary
---

A query should run in local mode when block sampling is used and the sample is 
small enough.  The size of the sample is currently being estimated, as it is 
done to estimate the number of reducers.


This addresses bug HIVE-2282.
https://issues.apache.org/jira/browse/HIVE-2282


Diffs (updated)
-

  ql/src/java/org/apache/hadoop/hive/ql/exec/MapRedTask.java 53769a0 
  ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java cd3de76 
  ql/src/test/org/apache/hadoop/hive/ql/hooks/VerifyIsLocalModeHook.java 
PRE-CREATION 
  ql/src/test/queries/clientpositive/sample_islocalmode_hook.q PRE-CREATION 
  ql/src/test/results/clientpositive/sample_islocalmode_hook.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/1132/diff


Testing
---

TestCliDriver TestNegativeCliDriver, manually tested


Thanks,

Kevin



 Local mode needs to work well with block sampling
 -

 Key: HIVE-2282
 URL: https://issues.apache.org/jira/browse/HIVE-2282
 Project: Hive
  Issue Type: Improvement
Reporter: Siying Dong
Assignee: Kevin Wilfong
 Attachments: HIVE-2282.1.patch.txt, HIVE-2282.2.patch.txt, 
 HIVE-2282.3.patch.txt


 Currently, if block sampling is enabled and large set of data are sampled to 
 a small set, local mode needs to be kicked in. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2292) Comment clause should immediately follow identifier field in CREATE DATABASE statement

2011-07-22 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069675#comment-13069675
 ] 

jirapos...@reviews.apache.org commented on HIVE-2292:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1186/
---

Review request for hive.


Summary
---

Review for HIVE-2292.


This addresses bug HIVE-2292.
https://issues.apache.org/jira/browse/HIVE-2292


Diffs
-

  ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g 3a6d0a6 
  ql/src/test/queries/clientpositive/database_location.q ac8687f 
  ql/src/test/results/clientpositive/database_location.q.out a09f5a2 

Diff: https://reviews.apache.org/r/1186/diff


Testing
---


Thanks,

Carl



 Comment clause should immediately follow identifier field in CREATE DATABASE 
 statement
 --

 Key: HIVE-2292
 URL: https://issues.apache.org/jira/browse/HIVE-2292
 Project: Hive
  Issue Type: Bug
  Components: SQL
Reporter: Carl Steinbach
Assignee: Carl Steinbach
 Fix For: 0.8.0

 Attachments: HIVE-2292.1.patch.txt




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2183) In Task class and its subclasses logger is initialized in constructor

2011-07-22 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069854#comment-13069854
 ] 

jirapos...@reviews.apache.org commented on HIVE-2183:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1146/#review1174
---



trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java
https://reviews.apache.org/r/1146/#comment2463

Why is this NO_OF_THREADS_CAN_BE_LAUNCHED for sleeptime?




trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/MapJoinProcessor.java
https://reviews.apache.org/r/1146/#comment2462

I don't think this is an improvement; it's better to see the rule 
definition inline.  If this was suggested by an automatic code scanner, 
probably tell it to ignore this instance.


- John


On 2011-07-20 14:36:59, chinna wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1146/
bq.  ---
bq.  
bq.  (Updated 2011-07-20 14:36:59)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  In Task class and its subclasses logger is initialized in constructor. Log 
object no need to initialize every time in the constructor, Log object can make 
it as static object.
bq.  
bq.  
bq.  This addresses bug HIVE-2183.
bq.  https://issues.apache.org/jira/browse/HIVE-2183
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1145025 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/CopyTask.java 1145025 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java 1145025 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FetchTask.java 1145025 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FunctionRegistry.java 
1145025 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FunctionTask.java 
1145025 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MapredLocalTask.java 
1145025 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MoveTask.java 1145025 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/StatsTask.java 1145025 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Task.java 1145025 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/MapJoinProcessor.java 
1145025 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java 
1145025 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/FunctionSemanticAnalyzer.java 
1145025 
bq.  
bq.  Diff: https://reviews.apache.org/r/1146/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  All unit tests passed
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  chinna
bq.  
bq.



 In Task class and its subclasses logger is initialized in constructor
 -

 Key: HIVE-2183
 URL: https://issues.apache.org/jira/browse/HIVE-2183
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.5.0, 0.8.0
 Environment: Hadoop 0.20.1, Hive0.8.0 and SUSE Linux Enterprise 
 Server 10 SP2 (i586) - Kernel 2.6.16.60-0.21-smp (5)
Reporter: Chinna Rao Lalam
Assignee: Chinna Rao Lalam
Priority: Minor
 Attachments: HIVE-2183.1.patch, HIVE-2183.patch


 In Task class and its subclasses logger is initialized in constructor. Log 
 object no need to initialize every time in the constructor, Log object can 
 make it as static object.
 {noformat}
 Ex:
   public ExecDriver() {
 super();
 LOG = LogFactory.getLog(this.getClass().getName());
 console = new LogHelper(LOG);
 this.jobExecHelper = new HadoopJobExecHelper(job, console, this, this);
   }
 {noformat}
 Need to change like this
 {noformat}
 private static final Log LOG = LogFactory.getLog(ExecDriver.class);
 {noformat}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2236) Cli: Print Hadoop's CPU milliseconds

2011-07-21 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069094#comment-13069094
 ] 

jirapos...@reviews.apache.org commented on HIVE-2236:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/948/
---

(Updated 2011-07-21 17:30:55.228025)


Review request for hive, Yongqiang He, Ning Zhang, and namit jain.


Changes
---

fix a bug


Summary
---

In hive CLI, print out CPU msec from Hadoop MapReduce coutners.


This addresses bug HIVE-2236.
https://issues.apache.org/jira/browse/HIVE-2236


Diffs (updated)
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1148623 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/MapRedStats.java PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java 1148623 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/HadoopJobExecHelper.java 
1148623 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java 1148623 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/session/SessionState.java 1148623 

Diff: https://reviews.apache.org/r/948/diff


Testing
---

run the updated codes against real clusters and make sure it printing is 
correct.


Thanks,

Siying



 Cli: Print Hadoop's CPU milliseconds
 

 Key: HIVE-2236
 URL: https://issues.apache.org/jira/browse/HIVE-2236
 Project: Hive
  Issue Type: New Feature
  Components: CLI
Reporter: Siying Dong
Assignee: Siying Dong
Priority: Minor
 Attachments: HIVE-2236.1.patch, HIVE-2236.2.patch, HIVE-2236.3.patch


 CPU Milliseonds information is available from Hadoop's framework. Printing it 
 out to Hive CLI when executing a job will help users to know more about their 
 jobs.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2247) ALTER TABLE RENAME PARTITION

2011-07-21 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069146#comment-13069146
 ] 

jirapos...@reviews.apache.org commented on HIVE-2247:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1105/#review1156
---


Please try to add the new column in the middle first. If that works, we should 
do that way to make it consistent with alter_table() call. If that doesn't 
work, it's OK to add it to the end now.


trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java
https://reviews.apache.org/r/1105/#comment2385

Why we still need another function call rename_partition_core()? Can't we 
just modify alter_partition_core() to always use the same logic?


- Siying


On 2011-07-21 01:20:25, Weiyan Wang wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1105/
bq.  ---
bq.  
bq.  (Updated 2011-07-21 01:20:25)
bq.  
bq.  
bq.  Review request for Siying Dong.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Implement ALTER TABLE PARTITION RENAME function to rename a partition. 
bq.  Add HiveQL syntax ALTER TABLE bar PARTITION (k1='v1', k2='v2') RENAME TO 
PARTITION (k1='v3', k2='v4');
bq.  This is my first Hive diff, I just learn everything from existing codebase 
and may not have a good understanding on it. 
bq.  Feel free to inform me if I make something wrong. Thanks
bq.  
bq.  
bq.  This addresses bug HIVE-2247.
bq.  https://issues.apache.org/jira/browse/HIVE-2247
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/metastore/if/hive_metastore.thrift 1145366 
bq.trunk/metastore/src/gen/thrift/gen-cpp/ThriftHiveMetastore.h 1145366 
bq.trunk/metastore/src/gen/thrift/gen-cpp/ThriftHiveMetastore.cpp 1145366 
bq.
trunk/metastore/src/gen/thrift/gen-cpp/ThriftHiveMetastore_server.skeleton.cpp 
1145366 
bq.
trunk/metastore/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/metastore/api/ThriftHiveMetastore.java
 1145366 
bq.
trunk/metastore/src/gen/thrift/gen-php/hive_metastore/ThriftHiveMetastore.php 
1145366 
bq.
trunk/metastore/src/gen/thrift/gen-py/hive_metastore/ThriftHiveMetastore-remote 
1145366 
bq.
trunk/metastore/src/gen/thrift/gen-py/hive_metastore/ThriftHiveMetastore.py 
1145366 
bq.trunk/metastore/src/gen/thrift/gen-rb/thrift_hive_metastore.rb 1145366 
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveAlterHandler.java 
1145366 
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
1145366 
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStoreClient.java
 1145366 
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/IMetaStoreClient.java 
1145366 
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java 
1145366 
bq.trunk/metastore/src/java/org/apache/hadoop/hive/metastore/RawStore.java 
1145366 
bq.
trunk/metastore/src/test/org/apache/hadoop/hive/metastore/TestHiveMetaStore.java
 1145366 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1145366 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java 1145366 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java 
1145366 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g 1145366 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzerFactory.java 
1145366 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/AlterTableDesc.java 
1145366 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/DDLWork.java 1145366 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/HiveOperation.java 
1145366 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/RenamePartitionDesc.java 
PRE-CREATION 
bq.
trunk/ql/src/test/queries/clientnegative/alter_rename_partition_failure.q 
PRE-CREATION 
bq.
trunk/ql/src/test/queries/clientnegative/alter_rename_partition_failure2.q 
PRE-CREATION 
bq.
trunk/ql/src/test/queries/clientnegative/alter_rename_partition_failure3.q 
PRE-CREATION 
bq.trunk/ql/src/test/queries/clientpositive/alter_rename_partition.q 
PRE-CREATION 
bq.
trunk/ql/src/test/queries/clientpositive/alter_rename_partition_authorization.q 
PRE-CREATION 
bq.
trunk/ql/src/test/results/clientnegative/alter_rename_partition_failure.q.out 
PRE-CREATION 
bq.
trunk/ql/src/test/results/clientnegative/alter_rename_partition_failure2.q.out 
PRE-CREATION 
bq.
trunk/ql/src/test/results/clientnegative/alter_rename_partition_failure3.q.out 
PRE-CREATION 
bq.

[jira] [Commented] (HIVE-1078) CREATE VIEW followup: CREATE OR REPLACE

2011-07-21 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069240#comment-13069240
 ] 

jirapos...@reviews.apache.org commented on HIVE-1078:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1058/
---

(Updated 2011-07-21 22:07:29.150219)


Review request for hive.


Changes
---

Fix failure of create_or_replace_view.q


Summary
---

https://issues.apache.org/jira/browse/HIVE-1078


This addresses bug HIVE-1078.
https://issues.apache.org/jira/browse/HIVE-1078


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveAlterHandler.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CreateViewDesc.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view1.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view2.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view3.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view4.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view5.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view6.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view7.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view8.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/recursive_view.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_or_replace_view.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view1.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view2.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view3.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view4.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view5.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view6.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view7.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view8.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/recursive_view.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_or_replace_view.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_view.q.out
 1146902 

Diff: https://reviews.apache.org/r/1058/diff


Testing
---

Passes unit tests


Thanks,

Charles



 CREATE VIEW followup:  CREATE OR REPLACE
 

 Key: HIVE-1078
 URL: https://issues.apache.org/jira/browse/HIVE-1078
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: John Sichi
Assignee: Charles Chen
 Attachments: HIVE-1078v3.patch, HIVE-1078v4.patch, HIVE-1078v5.patch, 
 HIVE-1078v6.patch, HIVE-1078v7.patch, HIVE-1078v8.patch, HIVE-1078v9.patch


 Currently, replacing a view requires
 DROP VIEW v;
 CREATE VIEW v AS new-definition;
 CREATE OR REPLACE would allow these to be combined into a single operation.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: 

[jira] [Commented] (HIVE-2128) Automatic Indexing with multiple tables

2011-07-21 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069299#comment-13069299
 ] 

jirapos...@reviews.apache.org commented on HIVE-2128:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1010/
---

(Updated 2011-07-21 23:52:23.929900)


Review request for hive and John Sichi.


Changes
---

Added order by to testcases. This revealed an existing bug where we would walk 
the entire operator tree for each task in the task tree in 
IndexWhereTaskDispatcher. I amended this to only walk the subset of the 
operator tree in the current task.


Summary
---

Grab the indexed tables during optimized query generation, grab the associated 
path URIs, and keep those around in the Configuration object. When the job is 
passed to ExecDriver, this data is extracted and used in HiveIndexedInputFormat 
to decide whether to use the index file or delegate to the parent 
(HiveInputFormat) class. Not sure if this is robust. 


This addresses bug HIVE-2128.
https://issues.apache.org/jira/browse/HIVE-2128


Diffs (updated)
-

  ql/src/test/results/clientpositive/index_bitmap_auto_partitioned.q.out 
4c9efd1 
  ql/src/test/results/clientpositive/index_auto_self_join.q.out PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndexResult.java b9b586e 
  ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndexedInputFormat.java 
f1ee95d 
  ql/src/java/org/apache/hadoop/hive/ql/index/bitmap/BitmapIndexHandler.java 
61bbbf5 
  ql/src/java/org/apache/hadoop/hive/ql/index/compact/CompactIndexHandler.java 
7c91946 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/index/IndexWhereProcessor.java
 dbc489f 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/index/IndexWhereTaskDispatcher.java
 da084f6 
  ql/src/java/org/apache/hadoop/hive/ql/plan/MapredWork.java a03a9a6 
  ql/src/test/queries/clientpositive/index_auto_mult_tables.q PRE-CREATION 
  ql/src/test/queries/clientpositive/index_auto_mult_tables_compact.q 
PRE-CREATION 
  ql/src/test/queries/clientpositive/index_auto_self_join.q PRE-CREATION 
  ql/src/test/results/clientpositive/index_auto_mult_tables.q.out PRE-CREATION 
  ql/src/test/results/clientpositive/index_auto_mult_tables_compact.q.out 
PRE-CREATION 

Diff: https://reviews.apache.org/r/1010/diff


Testing
---

added new testcase index_auto_mult_tables.q


Thanks,

Syed



 Automatic Indexing with multiple tables
 ---

 Key: HIVE-2128
 URL: https://issues.apache.org/jira/browse/HIVE-2128
 Project: Hive
  Issue Type: Improvement
  Components: Indexing
Affects Versions: 0.8.0
Reporter: Russell Melick
Assignee: Syed S. Albiz
 Attachments: HIVE-2128.1.patch, HIVE-2128.1.patch, HIVE-2128.2.patch, 
 HIVE-2128.4.patch, HIVE-2128.5.patch, HIVE-2128.6.patch


 Make automatic indexing work with jobs which access multiple tables.  We'll 
 probably need to modify the way that the index input format works in order to 
 associate index formats/files with specific tables.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2246) Dedupe tables' column schemas from partitions in the metastore db

2011-07-21 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069389#comment-13069389
 ] 

jirapos...@reviews.apache.org commented on HIVE-2246:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1183/
---

Review request for hive, Ning Zhang and Paul Yang.


Summary
---

This patch tries to make minimal changes to the API while keeping migration 
short and somewhat easy to revert.

The new schema can be described as follows:
- CDS is a table corresponding to Column Descriptor objects.  Currently, it 
only stores a CD_ID.
- COLUMNS_V2 is a table corresponding to MFieldSchema objects, or columns.  A 
Column Descriptor holds a list of columns.  COLUMNS_V2 has a foreign key to the 
CD_ID to which it belongs.
- SDS was modified to reference a Column Descriptor. So SDS now has a foreign 
key to a CD_ID which describes its columns.

During migration, we create Column Descriptors for tables in a straightforward 
manner: their columns are now just wrapped inside a column descriptor.  The SDS 
of partitions use their parent table's column descriptor, since currently a 
partition and its table share the same list of columns.

When altering or adding a partition, give it it's parent table's column 
descriptor IF the columns they describe are the same.  Otherwise, create a new 
column descriptor for its columns.

When adding or altering a table, create a new column descriptor every time.

Whenever you drop a storage descriptor (e.g, when dropping tables or 
partitions), check to see if the related column descriptor has any other 
references in the table.  That is, check to see if any other storage 
descriptors point to that column descriptor.  If none do, then delete that 
column descriptor.  This check is in place so we don't have unreferenced column 
descriptors and columns hanging around after schema evolution for tables.


This addresses bug HIVE-2246.
https://issues.apache.org/jira/browse/HIVE-2246


Diffs
-

  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java 
1148945 
  
trunk/metastore/src/model/org/apache/hadoop/hive/metastore/model/MStorageDescriptor.java
 1148945 
  trunk/metastore/src/model/package.jdo 1148945 

Diff: https://reviews.apache.org/r/1183/diff


Testing
---

Passes facebook's regression testing and all existing test cases.  In one 
instance, before migration, the overhead involved with storage descriptors and 
columns was ~11 GB.  After migration, the overhead was ~1.5 GB.


Thanks,

Sohan



 Dedupe tables' column schemas from partitions in the metastore db
 -

 Key: HIVE-2246
 URL: https://issues.apache.org/jira/browse/HIVE-2246
 Project: Hive
  Issue Type: Improvement
  Components: Metastore
Reporter: Sohan Jain
Assignee: Sohan Jain
 Attachments: HIVE-2246.2.patch


 We can re-organize the JDO models to reduce space usage to keep the metastore 
 scalable for the future.  Currently, partitions are the fastest growing 
 objects in the metastore, and the metastore keeps a separate copy of the 
 columns list for each partition.  We can normalize the metastore db by 
 decoupling Columns from Storage Descriptors and not storing duplicate lists 
 of the columns for each partition. 
 An idea is to create an additional level of indirection with a Column 
 Descriptor that has a list of columns.  A table has a reference to its 
 latest Column Descriptor (note: a table may have more than one Column 
 Descriptor in the case of schema evolution).  Partitions and Indexes can 
 reference the same Column Descriptors as their parent table.
 Currently, the COLUMNS table in the metastore has roughly (number of 
 partitions + number of tables) * (average number of columns pertable) rows.  
 We can reduce this to (number of tables) * (average number of columns per 
 table) rows, while incurring a small cost proportional to the number of 
 tables to store the Column Descriptors.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2246) Dedupe tables' column schemas from partitions in the metastore db

2011-07-21 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13069392#comment-13069392
 ] 

jirapos...@reviews.apache.org commented on HIVE-2246:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1183/
---

(Updated 2011-07-22 05:30:29.026246)


Review request for hive, Ning Zhang and Paul Yang.


Changes
---

Adding some files I missed in the last diff.


Summary
---

This patch tries to make minimal changes to the API while keeping migration 
short and somewhat easy to revert.

The new schema can be described as follows:
- CDS is a table corresponding to Column Descriptor objects.  Currently, it 
only stores a CD_ID.
- COLUMNS_V2 is a table corresponding to MFieldSchema objects, or columns.  A 
Column Descriptor holds a list of columns.  COLUMNS_V2 has a foreign key to the 
CD_ID to which it belongs.
- SDS was modified to reference a Column Descriptor. So SDS now has a foreign 
key to a CD_ID which describes its columns.

During migration, we create Column Descriptors for tables in a straightforward 
manner: their columns are now just wrapped inside a column descriptor.  The SDS 
of partitions use their parent table's column descriptor, since currently a 
partition and its table share the same list of columns.

When altering or adding a partition, give it it's parent table's column 
descriptor IF the columns they describe are the same.  Otherwise, create a new 
column descriptor for its columns.

When adding or altering a table, create a new column descriptor every time.

Whenever you drop a storage descriptor (e.g, when dropping tables or 
partitions), check to see if the related column descriptor has any other 
references in the table.  That is, check to see if any other storage 
descriptors point to that column descriptor.  If none do, then delete that 
column descriptor.  This check is in place so we don't have unreferenced column 
descriptors and columns hanging around after schema evolution for tables.


This addresses bug HIVE-2246.
https://issues.apache.org/jira/browse/HIVE-2246


Diffs (updated)
-

  trunk/metastore/scripts/upgrade/mysql/008-HIVE-2246.mysql.sql PRE-CREATION 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java 
1148945 
  
trunk/metastore/src/model/org/apache/hadoop/hive/metastore/model/MColumnDescriptor.java
 PRE-CREATION 
  
trunk/metastore/src/model/org/apache/hadoop/hive/metastore/model/MStorageDescriptor.java
 1148945 
  trunk/metastore/src/model/package.jdo 1148945 

Diff: https://reviews.apache.org/r/1183/diff


Testing
---

Passes facebook's regression testing and all existing test cases.  In one 
instance, before migration, the overhead involved with storage descriptors and 
columns was ~11 GB.  After migration, the overhead was ~1.5 GB.


Thanks,

Sohan



 Dedupe tables' column schemas from partitions in the metastore db
 -

 Key: HIVE-2246
 URL: https://issues.apache.org/jira/browse/HIVE-2246
 Project: Hive
  Issue Type: Improvement
  Components: Metastore
Reporter: Sohan Jain
Assignee: Sohan Jain
 Attachments: HIVE-2246.2.patch, HIVE-2246.3.patch


 Note: this patch proposes a schema change, and is therefore incompatible with 
 the current metastore.
 We can re-organize the JDO models to reduce space usage to keep the metastore 
 scalable for the future.  Currently, partitions are the fastest growing 
 objects in the metastore, and the metastore keeps a separate copy of the 
 columns list for each partition.  We can normalize the metastore db by 
 decoupling Columns from Storage Descriptors and not storing duplicate lists 
 of the columns for each partition. 
 An idea is to create an additional level of indirection with a Column 
 Descriptor that has a list of columns.  A table has a reference to its 
 latest Column Descriptor (note: a table may have more than one Column 
 Descriptor in the case of schema evolution).  Partitions and Indexes can 
 reference the same Column Descriptors as their parent table.
 Currently, the COLUMNS table in the metastore has roughly (number of 
 partitions + number of tables) * (average number of columns pertable) rows.  
 We can reduce this to (number of tables) * (average number of columns per 
 table) rows, while incurring a small cost proportional to the number of 
 tables to store the Column Descriptors.
 Please see the latest review board for additional implementation details.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1078) CREATE VIEW followup: CREATE OR REPLACE

2011-07-20 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13068165#comment-13068165
 ] 

jirapos...@reviews.apache.org commented on HIVE-1078:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1058/#review1128
---



http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
https://reviews.apache.org/r/1058/#comment2356

Defer the db.getPartitions (which could be expensive) so that we don't do 
it unless we're sure that the partition keys are actually changing.



http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
https://reviews.apache.org/r/1058/#comment2357

Avoid usage of java.util.Stack.  Some old Hive code uses it but it's 
deprecated because it's synchronized for no good reason.



http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
https://reviews.apache.org/r/1058/#comment2358

add spaces around operators such as =


- John


On 2011-07-20 01:01:53, Charles Chen wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1058/
bq.  ---
bq.  
bq.  (Updated 2011-07-20 01:01:53)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  https://issues.apache.org/jira/browse/HIVE-1078
bq.  
bq.  
bq.  This addresses bug HIVE-1078.
bq.  https://issues.apache.org/jira/browse/HIVE-1078
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveAlterHandler.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CreateViewDesc.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view1.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view2.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view3.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view4.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_or_replace_view.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view1.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view2.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view3.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view4.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_or_replace_view.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_view.q.out
 1146902 
bq.  
bq.  Diff: https://reviews.apache.org/r/1058/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Passes unit tests
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Charles
bq.  
bq.



 CREATE VIEW followup:  CREATE OR REPLACE
 

 Key: HIVE-1078
 URL: https://issues.apache.org/jira/browse/HIVE-1078
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: John Sichi
Assignee: Charles Chen
 Attachments: HIVE-1078v3.patch, HIVE-1078v4.patch, HIVE-1078v5.patch, 
 HIVE-1078v6.patch, HIVE-1078v7.patch


 Currently, replacing a view requires
 DROP VIEW v;
 CREATE VIEW v AS new-definition;
 CREATE OR REPLACE would allow these to be combined into a single operation.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: 

[jira] [Commented] (HIVE-2236) Cli: Print Hadoop's CPU milliseconds

2011-07-20 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13068169#comment-13068169
 ] 

jirapos...@reviews.apache.org commented on HIVE-2236:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/948/
---

(Updated 2011-07-20 06:27:19.820431)


Review request for hive, Yongqiang He, Ning Zhang, and namit jain.


Changes
---

remove MapRedStats from DriverContext and add more counters to it.


Summary
---

In hive CLI, print out CPU msec from Hadoop MapReduce coutners.


This addresses bug HIVE-2236.
https://issues.apache.org/jira/browse/HIVE-2236


Diffs (updated)
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1148623 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/MapRedStats.java PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java 1148623 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/HadoopJobExecHelper.java 
1148623 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java 1148623 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/session/SessionState.java 1148623 

Diff: https://reviews.apache.org/r/948/diff


Testing
---

run the updated codes against real clusters and make sure it printing is 
correct.


Thanks,

Siying



 Cli: Print Hadoop's CPU milliseconds
 

 Key: HIVE-2236
 URL: https://issues.apache.org/jira/browse/HIVE-2236
 Project: Hive
  Issue Type: New Feature
  Components: CLI
Reporter: Siying Dong
Assignee: Siying Dong
Priority: Minor
 Attachments: HIVE-2236.1.patch, HIVE-2236.2.patch


 CPU Milliseonds information is available from Hadoop's framework. Printing it 
 out to Hive CLI when executing a job will help users to know more about their 
 jobs.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2080) Few code improvements in the ql and serde packages.

2011-07-20 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13068404#comment-13068404
 ] 

jirapos...@reviews.apache.org commented on HIVE-2080:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1144/
---

Review request for hive.


Summary
---

Few code improvements in the ql and serde packages.
1) Little performance Improvements 
2) Null checks to avoid NPEs
3) Effective varaible management.


This addresses bug HIVE-2080.
https://issues.apache.org/jira/browse/HIVE-2080


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/CommonJoinOperator.java 
1148179 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FileSinkOperator.java 
1148179 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FilterOperator.java 1148179 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/GroupByOperator.java 1148179 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Operator.java 1148179 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/ReduceSinkOperator.java 
1148179 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/SelectOperator.java 1148179 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/TableScanOperator.java 
1148179 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/TaskFactory.java 1148179 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/UnionOperator.java 1148179 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ASTNode.java 1148179 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java 
1148179 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/ParseContext.java 1148179 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
1148179 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/dynamic_type/DynamicSerDeField.java
 1148179 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/dynamic_type/DynamicSerDeFieldType.java
 1148179 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/dynamic_type/DynamicSerDeFunction.java
 1148179 

Diff: https://reviews.apache.org/r/1144/diff


Testing
---

All unit test passed


Thanks,

chinna



 Few code improvements in the ql and serde packages.
 ---

 Key: HIVE-2080
 URL: https://issues.apache.org/jira/browse/HIVE-2080
 Project: Hive
  Issue Type: Bug
  Components: Query Processor, Serializers/Deserializers
Affects Versions: 0.7.0
 Environment: Hadoop 0.20.1, Hive0.7.0 and SUSE Linux Enterprise 
 Server 10 SP2 (i586) - Kernel 2.6.16.60-0.21-smp (5).
Reporter: Chinna Rao Lalam
Assignee: Chinna Rao Lalam
 Attachments: HIVE-2080.1.Patch, HIVE-2080.Patch


 Few code improvements in the ql and serde packages.
 1) Little performance Improvements 
 2) Null checks to avoid NPEs
 3) Effective varaible management.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2183) In Task class and its subclasses logger is initialized in constructor

2011-07-20 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13068408#comment-13068408
 ] 

jirapos...@reviews.apache.org commented on HIVE-2183:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1146/
---

Review request for hive.


Summary
---

In Task class and its subclasses logger is initialized in constructor. Log 
object no need to initialize every time in the constructor, Log object can make 
it as static object.


This addresses bug HIVE-2183.
https://issues.apache.org/jira/browse/HIVE-2183


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/Driver.java 1145025 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/CopyTask.java 1145025 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java 1145025 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FetchTask.java 1145025 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FunctionRegistry.java 
1145025 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FunctionTask.java 1145025 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MapredLocalTask.java 1145025 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/MoveTask.java 1145025 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/StatsTask.java 1145025 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Task.java 1145025 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/MapJoinProcessor.java 
1145025 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java 
1145025 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/FunctionSemanticAnalyzer.java 
1145025 

Diff: https://reviews.apache.org/r/1146/diff


Testing
---

All unit tests passed


Thanks,

chinna



 In Task class and its subclasses logger is initialized in constructor
 -

 Key: HIVE-2183
 URL: https://issues.apache.org/jira/browse/HIVE-2183
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.5.0, 0.8.0
 Environment: Hadoop 0.20.1, Hive0.8.0 and SUSE Linux Enterprise 
 Server 10 SP2 (i586) - Kernel 2.6.16.60-0.21-smp (5)
Reporter: Chinna Rao Lalam
Assignee: Chinna Rao Lalam
Priority: Minor
 Attachments: HIVE-2183.1.patch, HIVE-2183.patch


 In Task class and its subclasses logger is initialized in constructor. Log 
 object no need to initialize every time in the constructor, Log object can 
 make it as static object.
 {noformat}
 Ex:
   public ExecDriver() {
 super();
 LOG = LogFactory.getLog(this.getClass().getName());
 console = new LogHelper(LOG);
 this.jobExecHelper = new HadoopJobExecHelper(job, console, this, this);
   }
 {noformat}
 Need to change like this
 {noformat}
 private static final Log LOG = LogFactory.getLog(ExecDriver.class);
 {noformat}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1078) CREATE VIEW followup: CREATE OR REPLACE

2011-07-20 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13068519#comment-13068519
 ] 

jirapos...@reviews.apache.org commented on HIVE-1078:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1058/
---

(Updated 2011-07-20 18:03:04.848086)


Review request for hive.


Changes
---

Add testcases, fixed issues in comments above (btw the previous revision passed 
unit tests)


Summary
---

https://issues.apache.org/jira/browse/HIVE-1078


This addresses bug HIVE-1078.
https://issues.apache.org/jira/browse/HIVE-1078


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveAlterHandler.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CreateViewDesc.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view1.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view2.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view3.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view4.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view5.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view6.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view7.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view8.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/recursive_view.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_or_replace_view.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view1.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view2.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view3.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view4.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view5.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view6.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view7.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view8.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/recursive_view.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_or_replace_view.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_view.q.out
 1146902 

Diff: https://reviews.apache.org/r/1058/diff


Testing
---

Passes unit tests


Thanks,

Charles



 CREATE VIEW followup:  CREATE OR REPLACE
 

 Key: HIVE-1078
 URL: https://issues.apache.org/jira/browse/HIVE-1078
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: John Sichi
Assignee: Charles Chen
 Attachments: HIVE-1078v3.patch, HIVE-1078v4.patch, HIVE-1078v5.patch, 
 HIVE-1078v6.patch, HIVE-1078v7.patch, HIVE-1078v8.patch


 Currently, replacing a view requires
 DROP VIEW v;
 CREATE VIEW v AS new-definition;
 CREATE OR REPLACE would allow these to be combined into a single operation.

--
This message is automatically generated by JIRA.
For more information 

[jira] [Commented] (HIVE-2201) reduce name node calls in hive by creating temporary directories

2011-07-20 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13068715#comment-13068715
 ] 

jirapos...@reviews.apache.org commented on HIVE-2201:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/952/
---

(Updated 2011-07-20 23:31:54.007436)


Review request for hive, Yongqiang He, Ning Zhang, and namit jain.


Changes
---

1. change block merge task too
2. change the capital file name


Summary
---

reduce name node calls in hive by creating temporary directories


This addresses bug HIVE-2201.
https://issues.apache.org/jira/browse/HIVE-2201


Diffs (updated)
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java 1148905 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FileSinkOperator.java 
1148905 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java 1148905 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/io/RCFileOutputFormat.java 
1148905 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/BlockMergeTask.java 
1148905 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/io/rcfile/merge/RCFileMergeMapper.java
 1148905 

Diff: https://reviews.apache.org/r/952/diff


Testing
---


Thanks,

Siying



 reduce name node calls in hive by creating temporary directories
 

 Key: HIVE-2201
 URL: https://issues.apache.org/jira/browse/HIVE-2201
 Project: Hive
  Issue Type: Improvement
Reporter: Namit Jain
Assignee: Siying Dong
 Attachments: HIVE-2201.1.patch, HIVE-2201.2.patch, HIVE-2201.3.patch, 
 HIVE-2201.4.patch


 Currently, in Hive, when a file gets written by a FileSinkOperator,
 the sequence of operations is as follows:
 1. In tmp directory tmp1, create a tmp file _tmp_1
 2. At the end of the operator, move
 /tmp1/_tmp_1 to /tmp1/1
 3. Move directory /tmp1 to /tmp2
 4. For all files in /tmp2, remove all files starting with _tmp and
 duplicate files.
 Due to speculative execution, a lot of temporary files are created
 in /tmp1 (or /tmp2). This leads to a lot of name node calls,
 specially for large queries.
 The protocol above can be modified slightly:
 1. In tmp directory tmp1, create a tmp file _tmp_1
 2. At the end of the operator, move
 /tmp1/_tmp_1 to /tmp2/1
 3. Move directory /tmp2 to /tmp3
 4. For all files in /tmp3, remove all duplicate files.
 This should reduce the number of tmp files.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2247) ALTER TABLE RENAME PARTITION

2011-07-20 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13068753#comment-13068753
 ] 

jirapos...@reviews.apache.org commented on HIVE-2247:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1105/
---

(Updated 2011-07-21 01:20:25.242756)


Review request for Siying Dong.


Changes
---

Refactor the code, rename_partition shares the same thrift API as 
alter_partition, we do alter_partition when part_vals is empty, we do 
rename_partition when part_vals is given 


Summary
---

Implement ALTER TABLE PARTITION RENAME function to rename a partition. 
Add HiveQL syntax ALTER TABLE bar PARTITION (k1='v1', k2='v2') RENAME TO 
PARTITION (k1='v3', k2='v4');
This is my first Hive diff, I just learn everything from existing codebase and 
may not have a good understanding on it. 
Feel free to inform me if I make something wrong. Thanks


This addresses bug HIVE-2247.
https://issues.apache.org/jira/browse/HIVE-2247


Diffs (updated)
-

  trunk/metastore/if/hive_metastore.thrift 1145366 
  trunk/metastore/src/gen/thrift/gen-cpp/ThriftHiveMetastore.h 1145366 
  trunk/metastore/src/gen/thrift/gen-cpp/ThriftHiveMetastore.cpp 1145366 
  
trunk/metastore/src/gen/thrift/gen-cpp/ThriftHiveMetastore_server.skeleton.cpp 
1145366 
  
trunk/metastore/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/metastore/api/ThriftHiveMetastore.java
 1145366 
  trunk/metastore/src/gen/thrift/gen-php/hive_metastore/ThriftHiveMetastore.php 
1145366 
  
trunk/metastore/src/gen/thrift/gen-py/hive_metastore/ThriftHiveMetastore-remote 
1145366 
  trunk/metastore/src/gen/thrift/gen-py/hive_metastore/ThriftHiveMetastore.py 
1145366 
  trunk/metastore/src/gen/thrift/gen-rb/thrift_hive_metastore.rb 1145366 
  
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveAlterHandler.java 
1145366 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
1145366 
  
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStoreClient.java
 1145366 
  
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/IMetaStoreClient.java 
1145366 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java 
1145366 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/RawStore.java 
1145366 
  
trunk/metastore/src/test/org/apache/hadoop/hive/metastore/TestHiveMetaStore.java
 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java 
1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g 1145366 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzerFactory.java 
1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/AlterTableDesc.java 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/DDLWork.java 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/HiveOperation.java 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/RenamePartitionDesc.java 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/alter_rename_partition_failure.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/alter_rename_partition_failure2.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/alter_rename_partition_failure3.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/alter_rename_partition.q 
PRE-CREATION 
  
trunk/ql/src/test/queries/clientpositive/alter_rename_partition_authorization.q 
PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/alter_rename_partition_failure.q.out 
PRE-CREATION 
  
trunk/ql/src/test/results/clientnegative/alter_rename_partition_failure2.q.out 
PRE-CREATION 
  
trunk/ql/src/test/results/clientnegative/alter_rename_partition_failure3.q.out 
PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/alter_rename_partition.q.out 
PRE-CREATION 
  
trunk/ql/src/test/results/clientpositive/alter_rename_partition_authorization.q.out
 PRE-CREATION 

Diff: https://reviews.apache.org/r/1105/diff


Testing
---

Add a partition A in the table
Rename partition A to partition B
Show the partitions in the table, it returns partition B.
SELECT the data from partition A, it returns no results
SELECT the data from partition B, it returns the data originally stored in 
partition A


Thanks,

Weiyan



 ALTER TABLE RENAME PARTITION
 

 Key: HIVE-2247
 URL: https://issues.apache.org/jira/browse/HIVE-2247
 Project: Hive
  Issue Type: New Feature
Reporter: Siying Dong
Assignee: Weiyan Wang
 Attachments: HIVE-2247.3.patch.txt, 

[jira] [Commented] (HIVE-2296) bad compressed file names from insert into

2011-07-20 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13068794#comment-13068794
 ] 

jirapos...@reviews.apache.org commented on HIVE-2296:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1155/
---

Review request for hive and Siying Dong.


Summary
---

Fixes problem of bad compressed file names by stripping off the file format (ex 
.gz) and reappending it to the path later.


This addresses bug HIVE-2296.
https://issues.apache.org/jira/browse/HIVE-2296


Diffs
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java 1148973 
  trunk/ql/src/test/queries/clientpositive/insert_compressed.q PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/insert_compressed.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/1155/diff


Testing
---

Unit tests pass


Thanks,

Franklin



 bad compressed file names from insert into
 --

 Key: HIVE-2296
 URL: https://issues.apache.org/jira/browse/HIVE-2296
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Franklin Hu
Assignee: Franklin Hu
 Attachments: hive-2296.1.patch, hive-2296.2.patch


 When INSERT INTO is run on a table with compressed output 
 (hive.exec.compress.output=true) and existing files in the table, it may copy 
 the new files in bad file names:
 Before INSERT INTO:
 00_0.gz
 After INSERT INTO:
 00_0.gz
 00_0.gz_copy_1
 This causes corrupted output when doing a SELECT * on the table.
 Correct behavior should be to pick a valid filename such as:
 00_0_copy_1.gz

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1884) Potential risk of resource leaks in Hive

2011-07-19 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067638#comment-13067638
 ] 

jirapos...@reviews.apache.org commented on HIVE-1884:
-



bq.  On 2011-07-18 20:45:08, John Sichi wrote:
bq.   trunk/cli/src/java/org/apache/hadoop/hive/cli/CliDriver.java, line 392
bq.   https://reviews.apache.org/r/1098/diff/1/?file=25035#file25035line392
bq.  
bq.   Rename this to rc to avoid confusion with the processReader method 
name.

processReader variable name renamed to rc


bq.  On 2011-07-18 20:45:08, John Sichi wrote:
bq.   
trunk/contrib/src/java/org/apache/hadoop/hive/contrib/util/typedbytes/TypedBytesWritableInput.java,
 line 385
bq.   https://reviews.apache.org/r/1098/diff/1/?file=25036#file25036line385
bq.  
bq.   Fix indentation for the try block.

properly formatted the try block


- chinna


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1098/#review1101
---


On 2011-07-13 14:28:47, chinna wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1098/
bq.  ---
bq.  
bq.  (Updated 2011-07-13 14:28:47)
bq.  
bq.  
bq.  Review request for hive and John Sichi.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  There are couple of resource leaks.
bq.  For example,
bq.  In CliDriver.java, Method :- processReader() the buffered reader is not 
closed.
bq.  
bq.  Also there are risk(s) of resource(s) getting leaked , in such cases 
refactored the code to move closing of resources in finally block.
bq.  
bq.  
bq.  This addresses bug HIVE-1884.
bq.  https://issues.apache.org/jira/browse/HIVE-1884
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/cli/src/java/org/apache/hadoop/hive/cli/CliDriver.java 1145950 
bq.
trunk/contrib/src/java/org/apache/hadoop/hive/contrib/util/typedbytes/TypedBytesWritableInput.java
 1145950 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1145950 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/io/RCFileInputFormat.java 
1145950 
bq.  
bq.  Diff: https://reviews.apache.org/r/1098/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  All unit tests passed
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  chinna
bq.  
bq.



 Potential risk of resource leaks in Hive
 

 Key: HIVE-1884
 URL: https://issues.apache.org/jira/browse/HIVE-1884
 Project: Hive
  Issue Type: Bug
  Components: CLI, Metastore, Query Processor, Server Infrastructure
Affects Versions: 0.3.0, 0.4.0, 0.4.1, 0.5.0, 0.6.0
 Environment: Hive 0.6.0, Hadoop 0.20.1
 SUSE Linux Enterprise Server 11 (i586)
Reporter: Mohit Sikri
Assignee: Chinna Rao Lalam
 Attachments: HIVE-1884.1.PATCH, HIVE-1884.2.patch, HIVE-1884.3.patch, 
 HIVE-1884.4.patch, HIVE-1884.5.patch


 h3.There are couple of resource leaks.
 h4.For example,
 In CliDriver.java, Method :- processReader() the buffered reader is not 
 closed.
 h3.Also there are risk(s) of  resource(s) getting leaked , in such cases we 
 need to re factor the code to move closing of resources in finally block.
 h4. For Example :- 
 In Throttle.java   Method:- checkJobTracker() , the following code snippet 
 might cause resource leak.
 {code}
 InputStream in = url.openStream();
 in.read(buffer);
 in.close();
 {code}
 Ideally and as per the best coding practices it should be like below
 {code}
 InputStream in=null;
 try   {
 in = url.openStream();
 int numRead = in.read(buffer);
 }
 finally {
IOUtils.closeStream(in);
 }
 {code}
 Similar cases, were found in ExplainTask.java, DDLTask.java etc.Need to re 
 factor all such occurrences.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2086) Data loss with external table

2011-07-19 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067833#comment-13067833
 ] 

jirapos...@reviews.apache.org commented on HIVE-2086:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/604/
---

(Updated 2011-07-19 16:50:07.742470)


Review request for hive.


Changes
---

The bug was fixed as a part of HIVE-1218, and this diff adds some regression 
tests for this particular issue.


Summary
---

Review request for HIVE-2086.  The external parameters were being set, but then 
being zeroed out by a call to params.clear(), resulting in an external table 
that wasn't marked as external.


This addresses bug HIVE-2086.
https://issues.apache.org/jira/browse/HIVE-2086


Diffs (updated)
-

  build-common.xml b6807fa 
  data/files/ext_test/test.dat PRE-CREATION 
  ql/src/test/org/apache/hadoop/hive/ql/QTestUtil.java 6d742e5 
  ql/src/test/queries/clientpositive/create_like.q 3f8e58d 
  ql/src/test/results/clientpositive/create_like.q.out 118235e 

Diff: https://reviews.apache.org/r/604/diff


Testing
---

Automated tests added


Thanks,

Jonathan



 Data loss with external table
 -

 Key: HIVE-2086
 URL: https://issues.apache.org/jira/browse/HIVE-2086
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Affects Versions: 0.7.0
 Environment: Amazon  elastics mapreduce cluster
Reporter: Q Long
Assignee: Jonathan Natkins
 Attachments: HIVE-2086.1.patch, HIVE-2086.2.patch, create_like.q.out


 Data loss when using create external table like statement. 
 1) Set up an external table S, point to location L. Populate data in S.
 2) Create another external table T, using statement like this:
 create external table T like S location L
Make sure table T point to the same location as the original table S.
 3) Query table T, see the same set of data in S.
 4) drop table T.
 5) Query table S will return nothing, and location L is deleted. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2286) ClassCastException when building index with security.authorization turned on

2011-07-19 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067856#comment-13067856
 ] 

jirapos...@reviews.apache.org commented on HIVE-2286:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1137/
---

(Updated 2011-07-19 17:37:34.434426)


Review request for hive, John Sichi and Ning Zhang.


Changes
---

Add a set of helpers to save/restore query specific session state. As far as I 
know, there isn't really anything else command specific that needs to be done.
The other change here is to add the resetTaskIds flag = false which I think 
should have been the case all along if we wanted these queries to be properly 
re-entrant. I guess in the case of ALTER INDEX, where nesting was not a big 
issue this was less likely to break?


Summary
---

Save the original HiveOperation/commandType when we generate the index builder 
task and restore it after we're done generating the task so that the 
authorization checks make the right decision when deciding what to do.


This addresses bug HIVE-2286.
https://issues.apache.org/jira/browse/HIVE-2286


Diffs (updated)
-

  ql/src/test/results/clientpositive/index_bitmap.q.out 8f41ce3 
  ql/src/test/results/clientpositive/index_auto_unused.q.out 8fe3ac9 
  ql/src/test/results/clientpositive/index_auto_multiple.q.out 28add9f 
  ql/src/test/results/clientpositive/index_auto_partitioned.q.out b332a8b 
  ql/src/test/results/clientpositive/index_auto_file_format.q.out 8430296 
  ql/src/test/results/clientpositive/index_auto.q.out 222db34 
  ql/src/test/results/clientpositive/index_auth.q.out PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/Driver.java b278ffe 
  ql/src/java/org/apache/hadoop/hive/ql/index/bitmap/BitmapIndexHandler.java 
61bbbf5 
  ql/src/java/org/apache/hadoop/hive/ql/index/compact/CompactIndexHandler.java 
7c91946 
  ql/src/test/queries/clientpositive/index_auth.q PRE-CREATION 
  ql/src/test/results/clientnegative/index_compact_entry_limit.q.out fcb2673 
  ql/src/test/results/clientnegative/index_compact_size_limit.q.out fcb2673 
  ql/src/test/results/clientpositive/index_bitmap1.q.out 9f638f5 
  ql/src/test/results/clientpositive/index_bitmap2.q.out e901477 
  ql/src/test/results/clientpositive/index_bitmap3.q.out 8091359 
  ql/src/test/results/clientpositive/index_bitmap_auto.q.out e73877a 
  ql/src/test/results/clientpositive/index_bitmap_auto_partitioned.q.out 
4c9efd1 
  ql/src/test/results/clientpositive/index_bitmap_rc.q.out 9bd3c98 
  ql/src/test/results/clientpositive/index_compact.q.out c339ec9 
  ql/src/test/results/clientpositive/index_compact_1.q.out 34ba3ca 
  ql/src/test/results/clientpositive/index_compact_2.q.out e8ce238 
  ql/src/test/results/clientpositive/index_compact_3.q.out d39556d 
  ql/src/test/results/clientpositive/index_creation.q.out 532f07e 

Diff: https://reviews.apache.org/r/1137/diff


Testing
---

Added new testcase to TestCliDriver: index_auth.q


Thanks,

Syed



 ClassCastException when building index with security.authorization turned on
 

 Key: HIVE-2286
 URL: https://issues.apache.org/jira/browse/HIVE-2286
 Project: Hive
  Issue Type: Bug
Reporter: Syed S. Albiz
Assignee: Syed S. Albiz
 Attachments: HIVE-2286.1.patch, HIVE-2286.2.patch


 When trying to build an index with authorization checks turned on, hive 
 issues the following ClassCastException:
 org.apache.hadoop.hive.ql.parse.DDLSemanticAnalyzer cannot be cast to
 org.apache.hadoop.hive.ql.parse.SemanticAnalyzer
  at
 org.apache.hadoop.hive.ql.Driver.doAuthorization(Driver.java:540)
  at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:431)
  at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:335)
  at org.apache.hadoop.hive.ql.Driver.run(Driver.java:848)
  at
 org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:224)
  at
 org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:358)
  at
 org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:293)
  at
 org.apache.hadoop.hive.cli.CliDriver.processReader(CliDriver.java:385)
  at
 org.apache.hadoop.hive.cli.CliDriver.processFile(CliDriver.java:392)
  at
 org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:567)
  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
  at
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.jav
 a:39)
  at
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessor
 Impl.java:25)
  at 

[jira] [Commented] (HIVE-2156) Improve error messages emitted during task execution

2011-07-19 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067858#comment-13067858
 ] 

jirapos...@reviews.apache.org commented on HIVE-2156:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/777/
---

(Updated 2011-07-19 17:42:08.908937)


Review request for hive and John Sichi.


Changes
---

Rebased patch against latest trunk


Summary
---

- Add local error messages to point to job logs and provide TaskIDs
- Add a timeout to the fetching of task logs and errors


This addresses bug HIVE-2156.
https://issues.apache.org/jira/browse/HIVE-2156


Diffs (updated)
-

  build-common.xml b6807fa 
  common/src/java/org/apache/hadoop/hive/conf/HiveConf.java d29cc77 
  conf/hive-default.xml 0ca7085 
  ql/build.xml 449b47a 
  ql/src/java/org/apache/hadoop/hive/ql/exec/HadoopJobExecHelper.java 40d2644 
  ql/src/java/org/apache/hadoop/hive/ql/exec/JobDebugger.java PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/exec/MapRedTask.java 53769a0 
  ql/src/java/org/apache/hadoop/hive/ql/exec/MapredLocalTask.java 691f038 
  ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java cd3de76 
  ql/src/test/queries/clientnegative/minimr_broken_pipe.q PRE-CREATION 
  ql/src/test/results/clientnegative/dyn_part3.q.out 5f4df65 
  ql/src/test/results/clientnegative/index_compact_entry_limit.q.out fcb2673 
  ql/src/test/results/clientnegative/index_compact_size_limit.q.out fcb2673 
  ql/src/test/results/clientnegative/minimr_broken_pipe.q.out PRE-CREATION 
  ql/src/test/results/clientnegative/script_broken_pipe1.q.out d33d2cc 
  ql/src/test/results/clientnegative/script_broken_pipe2.q.out afbaa44 
  ql/src/test/results/clientnegative/script_broken_pipe3.q.out fe8f757 
  ql/src/test/results/clientnegative/script_error.q.out c72d780 
  ql/src/test/results/clientnegative/udf_reflect_neg.q.out f2082a3 
  ql/src/test/results/clientnegative/udf_test_error.q.out 5fd9a00 
  ql/src/test/results/clientnegative/udf_test_error_reduce.q.out ddc5e5b 
  ql/src/test/templates/TestNegativeCliDriver.vm ec13f79 

Diff: https://reviews.apache.org/r/777/diff


Testing
---

Tested TestNegativeCliDriver in both local and miniMR mode


Thanks,

Syed



 Improve error messages emitted during task execution
 

 Key: HIVE-2156
 URL: https://issues.apache.org/jira/browse/HIVE-2156
 Project: Hive
  Issue Type: Improvement
Reporter: Syed S. Albiz
Assignee: Syed S. Albiz
 Attachments: HIVE-2156.1.patch, HIVE-2156.2.patch, HIVE-2156.4.patch


 Follow-up to HIVE-1731
 A number of issues were related to reporting errors from task execution and 
 surfacing these in a more useful form.
 Currently a cryptic message with Execution Error and a return code and 
 class name of the task is emitted.
 The most useful log messages here are emitted to the local logs, which can be 
 found through jobtracker. Having either a pointer to these logs as part of 
 the error message or the actual content would improve the usefulness 
 substantially. It may also warrant looking into how the underlying error 
 reporting through Hadoop is done and if more information can be propagated up 
 from there.
 Specific issues raised in  HIVE-1731:
 FAILED: Execution Error, return code 2 from 
 org.apache.hadoop.hive.ql.exec.MapRedTask
 * issue was in regexp_extract syntax
 FAILED: Execution Error, return code 1 from 
 org.apache.hadoop.hive.ql.exec.DDLTask
 * tried: desc table_does_not_exist;

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2128) Automatic Indexing with multiple tables

2011-07-19 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067878#comment-13067878
 ] 

jirapos...@reviews.apache.org commented on HIVE-2128:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1010/#review1112
---



ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndexQueryContext.java
https://reviews.apache.org/r/1010/#comment2271

Why was this comment truncated?



ql/src/test/queries/clientpositive/index_auto_mult_tables.q
https://reviews.apache.org/r/1010/#comment2273

All of these SELECT statements need ORDER BY for determinism.


- John


On 2011-07-19 03:15:17, Syed Albiz wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1010/
bq.  ---
bq.  
bq.  (Updated 2011-07-19 03:15:17)
bq.  
bq.  
bq.  Review request for hive and John Sichi.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Grab the indexed tables during optimized query generation, grab the 
associated path URIs, and keep those around in the Configuration object. When 
the job is passed to ExecDriver, this data is extracted and used in 
HiveIndexedInputFormat to decide whether to use the index file or delegate to 
the parent (HiveInputFormat) class. Not sure if this is robust. 
bq.  
bq.  
bq.  This addresses bug HIVE-2128.
bq.  https://issues.apache.org/jira/browse/HIVE-2128
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.ql/src/test/results/clientpositive/index_auto_self_join.q.out 
PRE-CREATION 
bq.ql/src/test/results/clientpositive/index_auto_mult_tables_compact.q.out 
PRE-CREATION 
bq.ql/src/test/queries/clientpositive/index_auto_self_join.q PRE-CREATION 
bq.ql/src/test/results/clientpositive/index_auto_mult_tables.q.out 
PRE-CREATION 
bq.ql/src/java/org/apache/hadoop/hive/ql/plan/MapredWork.java a03a9a6 
bq.ql/src/test/queries/clientpositive/index_auto_mult_tables.q PRE-CREATION 
bq.ql/src/test/queries/clientpositive/index_auto_mult_tables_compact.q 
PRE-CREATION 
bq.ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndexResult.java b9b586e 
bq.ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndexedInputFormat.java 
f1ee95d 
bq.
ql/src/java/org/apache/hadoop/hive/ql/index/bitmap/BitmapIndexHandler.java 
61bbbf5 
bq.
ql/src/java/org/apache/hadoop/hive/ql/index/compact/CompactIndexHandler.java 
7c91946 
bq.
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/index/IndexWhereProcessor.java
 dbc489f 
bq.ql/src/java/org/apache/hadoop/hive/ql/index/HiveIndexQueryContext.java 
617723e 
bq.  
bq.  Diff: https://reviews.apache.org/r/1010/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  added new testcase index_auto_mult_tables.q
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Syed
bq.  
bq.



 Automatic Indexing with multiple tables
 ---

 Key: HIVE-2128
 URL: https://issues.apache.org/jira/browse/HIVE-2128
 Project: Hive
  Issue Type: Improvement
  Components: Indexing
Affects Versions: 0.8.0
Reporter: Russell Melick
Assignee: Syed S. Albiz
 Attachments: HIVE-2128.1.patch, HIVE-2128.1.patch, HIVE-2128.2.patch, 
 HIVE-2128.4.patch, HIVE-2128.5.patch


 Make automatic indexing work with jobs which access multiple tables.  We'll 
 probably need to modify the way that the index input format works in order to 
 associate index formats/files with specific tables.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2139) Enables HiveServer to accept -hiveconf option

2011-07-19 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067933#comment-13067933
 ] 

jirapos...@reviews.apache.org commented on HIVE-2139:
-



bq.  On 2011-07-19 00:08:19, Carl Steinbach wrote:
bq.   metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java, 
line 3382
bq.   https://reviews.apache.org/r/958/diff/2/?file=26007#file26007line3382
bq.  
bq.   This should be $METASTORE_PORT instead of HIVE_PORT (see 
metastore.sh).

I missed that, thanks.


bq.  On 2011-07-19 00:08:19, Carl Steinbach wrote:
bq.   common/build.xml, line 30
bq.   https://reviews.apache.org/r/958/diff/2/?file=26001#file26001line30
bq.  
bq.   This target overrides the definition of 'compile' in 
build-common.xml, but neglects to include this action which copies over 
hive-log4j.properties:
bq.   
bq.   copy todir=${build.classes} failonerror=false
bq. fileset dir=${src.dir}/conf/
bq.   /copy
bq.   
bq.   If this is added in I think the missing hive-log4j.properties issue 
will be resolved.

Ok -- FYI I originally copied this section from cli/build.xml, not 
build-common.xml. (it's the same except for this missing copy)


- Patrick


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/958/#review1106
---


On 2011-07-15 23:51:55, Patrick Hunt wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/958/
bq.  ---
bq.  
bq.  (Updated 2011-07-15 23:51:55)
bq.  
bq.  
bq.  Review request for hive and Carl Steinbach.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  This patch updates HiveServer and HiveMetastore to add proper cli handling 
- similar to that used in CliDriver (ie GnuParser).
bq.  
bq.  There's a common HiveCli class that's used by both main classes.
bq.  
bq.  I've attempted to make the cli's backward compatible with the prior 
command line processing. Notice I've deprecated (via warnings, but the code 
still runs) if the old style CLI usage is used.
bq.  
bq.  commands such as the following now work as expected:
bq.  
bq.  bin/hive --service hiveserver -t 200 -p 12000 --hiveconf 
hive.root.logger=DEBUG,console
bq.  
bq.  as does the following which generates usage information:
bq.  
bq.  bin/hive --service hiveserver -h
bq.  
bq.  Note: HiveMetastore as not initializing log4j, I updated the code to do 
similar to HiveServer (otw the hiveconf hive.root.logger option above didn't 
work).
bq.  
bq.  
bq.  This addresses bug HIVE-2139.
bq.  https://issues.apache.org/jira/browse/HIVE-2139
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.ql/src/test/org/apache/hadoop/hive/ql/history/TestHiveHistory.java 
728958d 
bq.service/src/java/org/apache/hadoop/hive/service/HiveServer.java ea04be9 
bq.ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java 090ecfc 
bq.ql/src/java/org/apache/hadoop/hive/ql/session/SessionState.java 61393c8 
bq.bin/ext/hiveserver.sh b5edce4 
bq.bin/ext/metastore.sh db15f6e 
bq.cli/src/java/org/apache/hadoop/hive/cli/CliDriver.java bd9f6b1 
bq.common/build.xml 0969e16 
bq.common/ivy.xml bee3bf0 
bq.common/src/java/org/apache/hadoop/hive/common/LogUtils.java PRE-CREATION 
bq.common/src/java/org/apache/hadoop/hive/common/cli/CommonCliOptions.java 
PRE-CREATION 
bq.hwi/src/java/org/apache/hadoop/hive/hwi/HWISessionItem.java 1bf8c97 
bq.metastore/ivy.xml 83218fc 
bq.metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
e540c63 
bq.  
bq.  Diff: https://reviews.apache.org/r/958/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  I couldn't find any tests for these changes, so I verfied the changes 
manually.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Patrick
bq.  
bq.



 Enables HiveServer to accept -hiveconf option
 -

 Key: HIVE-2139
 URL: https://issues.apache.org/jira/browse/HIVE-2139
 Project: Hive
  Issue Type: Improvement
  Components: CLI
 Environment: Linux + CDH3u0 (Hive 0.7.0+27.1-2~lucid-cdh3)
Reporter: Kazuki Ohta
Assignee: Patrick Hunt
 Attachments: HIVE-2139.patch, HIVE-2139.patch


 Currently, I'm trying to test HiveHBaseIntegration on HiveServer. But it 
 doesn't seem to accept -hiveconf command.
 {code}
 hive --service hiveserver -hiveconf hbase.zookeeper.quorum=hdp0,hdp1,hdp2
 Starting Hive Thrift Server
 java.lang.NumberFormatException: For input string: -hiveconf
 at 
 java.lang.NumberFormatException.forInputString(NumberFormatException.java:48)
 

[jira] [Commented] (HIVE-2139) Enables HiveServer to accept -hiveconf option

2011-07-19 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067934#comment-13067934
 ] 

jirapos...@reviews.apache.org commented on HIVE-2139:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/958/
---

(Updated 2011-07-19 20:15:07.278811)


Review request for hive and Carl Steinbach.


Changes
---

Updated with Carl's comments.


Summary
---

This patch updates HiveServer and HiveMetastore to add proper cli handling - 
similar to that used in CliDriver (ie GnuParser).

There's a common HiveCli class that's used by both main classes.

I've attempted to make the cli's backward compatible with the prior command 
line processing. Notice I've deprecated (via warnings, but the code still 
runs) if the old style CLI usage is used.

commands such as the following now work as expected:

bin/hive --service hiveserver -t 200 -p 12000 --hiveconf 
hive.root.logger=DEBUG,console

as does the following which generates usage information:

bin/hive --service hiveserver -h

Note: HiveMetastore as not initializing log4j, I updated the code to do similar 
to HiveServer (otw the hiveconf hive.root.logger option above didn't work).


This addresses bug HIVE-2139.
https://issues.apache.org/jira/browse/HIVE-2139


Diffs (updated)
-

  common/build.xml 0969e16 
  common/ivy.xml bee3bf0 
  common/src/java/org/apache/hadoop/hive/common/LogUtils.java PRE-CREATION 
  common/src/java/org/apache/hadoop/hive/common/cli/CommonCliOptions.java 
PRE-CREATION 
  hwi/src/java/org/apache/hadoop/hive/hwi/HWISessionItem.java 1bf8c97 
  metastore/ivy.xml 83218fc 
  metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
e540c63 
  ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java 090ecfc 
  ql/src/java/org/apache/hadoop/hive/ql/session/SessionState.java 61393c8 
  ql/src/test/org/apache/hadoop/hive/ql/history/TestHiveHistory.java 728958d 
  service/src/java/org/apache/hadoop/hive/service/HiveServer.java ea04be9 
  cli/src/java/org/apache/hadoop/hive/cli/CliDriver.java bd9f6b1 
  bin/ext/hiveserver.sh b5edce4 
  bin/ext/metastore.sh db15f6e 

Diff: https://reviews.apache.org/r/958/diff


Testing
---

I couldn't find any tests for these changes, so I verfied the changes manually.


Thanks,

Patrick



 Enables HiveServer to accept -hiveconf option
 -

 Key: HIVE-2139
 URL: https://issues.apache.org/jira/browse/HIVE-2139
 Project: Hive
  Issue Type: Improvement
  Components: CLI
 Environment: Linux + CDH3u0 (Hive 0.7.0+27.1-2~lucid-cdh3)
Reporter: Kazuki Ohta
Assignee: Patrick Hunt
 Attachments: HIVE-2139.patch, HIVE-2139.patch


 Currently, I'm trying to test HiveHBaseIntegration on HiveServer. But it 
 doesn't seem to accept -hiveconf command.
 {code}
 hive --service hiveserver -hiveconf hbase.zookeeper.quorum=hdp0,hdp1,hdp2
 Starting Hive Thrift Server
 java.lang.NumberFormatException: For input string: -hiveconf
 at 
 java.lang.NumberFormatException.forInputString(NumberFormatException.java:48)
 at java.lang.Integer.parseInt(Integer.java:449)
 at java.lang.Integer.parseInt(Integer.java:499)
 at org.apache.hadoop.hive.service.HiveServer.main(HiveServer.java:382)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
 at java.lang.reflect.Method.invoke(Method.java:597)
 at org.apache.hadoop.util.RunJar.main(RunJar.java:186)
 {code}
 Therefore, you need to throw the query like set 
 hbase.zookeeper.quorum=hdp0,hdp1,hdp2 everytime. It's not convenient for 
 separating the configuration between server-side and client-side.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1078) CREATE VIEW followup: CREATE OR REPLACE

2011-07-19 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13068111#comment-13068111
 ] 

jirapos...@reviews.apache.org commented on HIVE-1078:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1058/
---

(Updated 2011-07-20 01:01:53.872666)


Review request for hive.


Changes
---

Fixed issues in above comment


Summary
---

https://issues.apache.org/jira/browse/HIVE-1078


This addresses bug HIVE-1078.
https://issues.apache.org/jira/browse/HIVE-1078


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveAlterHandler.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CreateViewDesc.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view1.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view2.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view3.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view4.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_or_replace_view.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view1.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view2.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view3.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view4.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_or_replace_view.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_view.q.out
 1146902 

Diff: https://reviews.apache.org/r/1058/diff


Testing
---

Passes unit tests


Thanks,

Charles



 CREATE VIEW followup:  CREATE OR REPLACE
 

 Key: HIVE-1078
 URL: https://issues.apache.org/jira/browse/HIVE-1078
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: John Sichi
Assignee: Charles Chen
 Attachments: HIVE-1078v3.patch, HIVE-1078v4.patch, HIVE-1078v5.patch, 
 HIVE-1078v6.patch


 Currently, replacing a view requires
 DROP VIEW v;
 CREATE VIEW v AS new-definition;
 CREATE OR REPLACE would allow these to be combined into a single operation.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2209) Provide a way by which ObjectInspectorUtils.compare can be extended by the caller for comparing maps which are part of the object

2011-07-19 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13068128#comment-13068128
 ] 

jirapos...@reviews.apache.org commented on HIVE-2209:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/879/
---

(Updated 2011-07-20 02:25:36.169590)


Review request for hive and Yongqiang He.


Summary
---

Patch for HIVE-2209


This addresses bug HIVE-2209.
https://issues.apache.org/jira/browse/HIVE-2209


Diffs
-

  
serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/CrossMapEqualComparer.java
 PRE-CREATION 
  
serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/FullMapEqualComparer.java
 PRE-CREATION 
  
serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/MapEqualComparer.java
 PRE-CREATION 
  
serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ObjectInspectorUtils.java
 2b77072 
  
serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/SimpleMapEqualComparer.java
 PRE-CREATION 
  
serde/src/test/org/apache/hadoop/hive/serde2/objectinspector/TestCrossMapEqualComparer.java
 PRE-CREATION 
  
serde/src/test/org/apache/hadoop/hive/serde2/objectinspector/TestFullMapEqualComparer.java
 PRE-CREATION 
  
serde/src/test/org/apache/hadoop/hive/serde2/objectinspector/TestSimpleMapEqualComparer.java
 PRE-CREATION 

Diff: https://reviews.apache.org/r/879/diff


Testing
---

Tests added


Thanks,

Krishna



 Provide a way by which ObjectInspectorUtils.compare can be extended by the 
 caller for comparing maps which are part of the object
 -

 Key: HIVE-2209
 URL: https://issues.apache.org/jira/browse/HIVE-2209
 Project: Hive
  Issue Type: Improvement
Reporter: Krishna Kumar
Assignee: Krishna Kumar
Priority: Minor
 Attachments: HIVE-2209v0.patch, HIVE-2209v2.patch, HIVE2209v1.patch


 Now ObjectInspectorUtils.compare throws an exception if a map is contained 
 (recursively) within the objects being compared. Two obvious implementations 
 are
 - a simple map comparer which assumes keys of the first map can be used to 
 fetch values from the second
 - a 'cross-product' comparer which compares every pair of key-value pairs in 
 the two maps, and calls a match if and only if all pairs are matched
 Note that it would be difficult to provide a transitive 
 greater-than/less-than indication with maps so that is not in scope. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1078) CREATE VIEW followup: CREATE OR REPLACE

2011-07-18 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066820#comment-13066820
 ] 

jirapos...@reviews.apache.org commented on HIVE-1078:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1058/
---

(Updated 2011-07-18 07:01:02.376155)


Review request for hive.


Changes
---

Added back check: New view must contain all old partition columns


Summary
---

https://issues.apache.org/jira/browse/HIVE-1078


This addresses bug HIVE-1078.
https://issues.apache.org/jira/browse/HIVE-1078


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_view.q.out
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_or_replace_view.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view4.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CreateViewDesc.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view1.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view2.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view3.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view4.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_or_replace_view.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view1.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view2.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view3.q.out
 PRE-CREATION 

Diff: https://reviews.apache.org/r/1058/diff


Testing
---

Passes unit tests


Thanks,

Charles



 CREATE VIEW followup:  CREATE OR REPLACE
 

 Key: HIVE-1078
 URL: https://issues.apache.org/jira/browse/HIVE-1078
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: John Sichi
Assignee: Charles Chen
 Attachments: HIVE-1078v3.patch, HIVE-1078v4.patch, HIVE-1078v5.patch


 Currently, replacing a view requires
 DROP VIEW v;
 CREATE VIEW v AS new-definition;
 CREATE OR REPLACE would allow these to be combined into a single operation.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1078) CREATE VIEW followup: CREATE OR REPLACE

2011-07-18 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066818#comment-13066818
 ] 

jirapos...@reviews.apache.org commented on HIVE-1078:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1058/#review1089
---



http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
https://reviews.apache.org/r/1058/#comment2223

Oops the comment should be here


- Charles


On 2011-07-18 07:01:02, Charles Chen wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1058/
bq.  ---
bq.  
bq.  (Updated 2011-07-18 07:01:02)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  https://issues.apache.org/jira/browse/HIVE-1078
bq.  
bq.  
bq.  This addresses bug HIVE-1078.
bq.  https://issues.apache.org/jira/browse/HIVE-1078
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_view.q.out
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_or_replace_view.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view4.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CreateViewDesc.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view1.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view2.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view3.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view4.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_or_replace_view.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view1.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view2.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view3.q.out
 PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1058/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Passes unit tests
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Charles
bq.  
bq.



 CREATE VIEW followup:  CREATE OR REPLACE
 

 Key: HIVE-1078
 URL: https://issues.apache.org/jira/browse/HIVE-1078
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: John Sichi
Assignee: Charles Chen
 Attachments: HIVE-1078v3.patch, HIVE-1078v4.patch, HIVE-1078v5.patch


 Currently, replacing a view requires
 DROP VIEW v;
 CREATE VIEW v AS new-definition;
 CREATE OR REPLACE would allow these to be combined into a single operation.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1078) CREATE VIEW followup: CREATE OR REPLACE

2011-07-18 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066822#comment-13066822
 ] 

jirapos...@reviews.apache.org commented on HIVE-1078:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1058/#review1091
---



http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
https://reviews.apache.org/r/1058/#comment2225

Here ***


- Charles


On 2011-07-18 07:01:02, Charles Chen wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1058/
bq.  ---
bq.  
bq.  (Updated 2011-07-18 07:01:02)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  https://issues.apache.org/jira/browse/HIVE-1078
bq.  
bq.  
bq.  This addresses bug HIVE-1078.
bq.  https://issues.apache.org/jira/browse/HIVE-1078
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_view.q.out
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_or_replace_view.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view4.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CreateViewDesc.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view1.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view2.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view3.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view4.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_or_replace_view.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view1.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view2.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view3.q.out
 PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1058/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Passes unit tests
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Charles
bq.  
bq.



 CREATE VIEW followup:  CREATE OR REPLACE
 

 Key: HIVE-1078
 URL: https://issues.apache.org/jira/browse/HIVE-1078
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: John Sichi
Assignee: Charles Chen
 Attachments: HIVE-1078v3.patch, HIVE-1078v4.patch, HIVE-1078v5.patch


 Currently, replacing a view requires
 DROP VIEW v;
 CREATE VIEW v AS new-definition;
 CREATE OR REPLACE would allow these to be combined into a single operation.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1078) CREATE VIEW followup: CREATE OR REPLACE

2011-07-18 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066819#comment-13066819
 ] 

jirapos...@reviews.apache.org commented on HIVE-1078:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1058/#review1088
---



http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
https://reviews.apache.org/r/1058/#comment

Haha ok so now I remember why we need this:

(unit test clientnegative/create_or_replace_view1.q):
-- New view must contain all old partition columns.

drop view v;
create view v partitioned on (ds, hr) as select * from srcpart;
create or replace view v as select key, hr from srcpart;

Here, we don't explicitly change the partition columns, so the new view 
inherits the old columns, but those columns better still exist.


- Charles


On 2011-07-18 07:01:02, Charles Chen wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1058/
bq.  ---
bq.  
bq.  (Updated 2011-07-18 07:01:02)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  https://issues.apache.org/jira/browse/HIVE-1078
bq.  
bq.  
bq.  This addresses bug HIVE-1078.
bq.  https://issues.apache.org/jira/browse/HIVE-1078
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_view.q.out
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_or_replace_view.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view4.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CreateViewDesc.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view1.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view2.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view3.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view4.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_or_replace_view.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view1.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view2.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view3.q.out
 PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1058/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Passes unit tests
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Charles
bq.  
bq.



 CREATE VIEW followup:  CREATE OR REPLACE
 

 Key: HIVE-1078
 URL: https://issues.apache.org/jira/browse/HIVE-1078
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: John Sichi
Assignee: Charles Chen
 Attachments: HIVE-1078v3.patch, HIVE-1078v4.patch, HIVE-1078v5.patch


 Currently, replacing a view requires
 DROP VIEW v;
 CREATE VIEW v AS new-definition;
 CREATE OR REPLACE would allow these to be combined into a single operation.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1218) CREATE TABLE t LIKE some_view should create a new empty base table, but instead creates a copy of view

2011-07-18 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067159#comment-13067159
 ] 

jirapos...@reviews.apache.org commented on HIVE-1218:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1057/#review1096
---


r9 passes unit tests

- Charles


On 2011-07-16 22:51:22, Charles Chen wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1057/
bq.  ---
bq.  
bq.  (Updated 2011-07-16 22:51:22)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  https://issues.apache.org/jira/browse/HIVE-1218
bq.  
bq.  
bq.  This addresses bug HIVE-1218.
bq.  https://issues.apache.org/jira/browse/HIVE-1218
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
 1146883 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1146883 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CreateTableLikeDesc.java
 1146883 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_like.q
 1146883 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_like_view.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_like.q.out
 1146883 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_like_view.q.out
 PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1057/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Unit tests passed
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Charles
bq.  
bq.



 CREATE TABLE t LIKE some_view should create a new empty base table, but 
 instead creates a copy of view
 --

 Key: HIVE-1218
 URL: https://issues.apache.org/jira/browse/HIVE-1218
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Affects Versions: 0.6.0
Reporter: John Sichi
Assignee: Charles Chen
 Fix For: 0.8.0

 Attachments: HIVE-1218v0.patch, HIVE-1218v2.patch, HIVE-1218v3.patch, 
 HIVE-1218v4.patch, HIVE-1218v5.patch, HIVE-1218v6.patch, HIVE-1218v7.patch, 
 HIVE-1218v8.patch


 I think it should copy only the column definitions from the view metadata.  
 Currently it is copying the entire descriptor, resulting in a new view 
 instead of a new base table.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1078) CREATE VIEW followup: CREATE OR REPLACE

2011-07-18 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067160#comment-13067160
 ] 

jirapos...@reviews.apache.org commented on HIVE-1078:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1058/#review1097
---


r7 passes unit tests

- Charles


On 2011-07-18 07:01:02, Charles Chen wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1058/
bq.  ---
bq.  
bq.  (Updated 2011-07-18 07:01:02)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  https://issues.apache.org/jira/browse/HIVE-1078
bq.  
bq.  
bq.  This addresses bug HIVE-1078.
bq.  https://issues.apache.org/jira/browse/HIVE-1078
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_view.q.out
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_or_replace_view.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view4.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CreateViewDesc.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view1.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view2.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view3.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view4.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_or_replace_view.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view1.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view2.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view3.q.out
 PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1058/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Passes unit tests
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Charles
bq.  
bq.



 CREATE VIEW followup:  CREATE OR REPLACE
 

 Key: HIVE-1078
 URL: https://issues.apache.org/jira/browse/HIVE-1078
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: John Sichi
Assignee: Charles Chen
 Attachments: HIVE-1078v3.patch, HIVE-1078v4.patch, HIVE-1078v5.patch, 
 HIVE-1078v6.patch


 Currently, replacing a view requires
 DROP VIEW v;
 CREATE VIEW v AS new-definition;
 CREATE OR REPLACE would allow these to be combined into a single operation.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2272) add TIMESTAMP data type

2011-07-18 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067222#comment-13067222
 ] 

jirapos...@reviews.apache.org commented on HIVE-2272:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1135/
---

(Updated 2011-07-18 19:12:08.745755)


Review request for hive.


Summary (updated)
---

Adds TIMESTAMP type to serde2 with both string (LazySimple) and binary 
(LazyBinary) serialization.
Supports SQL style jdbc timestamps of the format with nanosecond precision
-MM-DD HH:MM:SS[.fff...]


This addresses bug HIVE-2272.
https://issues.apache.org/jira/browse/HIVE-2272


Diffs
-

  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/PrimitiveObjectInspectorFactory.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/PrimitiveObjectInspectorUtils.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/SettableTimestampObjectInspector.java
 PRE-CREATION 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/TimestampObjectInspector.java
 PRE-CREATION 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/WritableTimestampObjectInspector.java
 PRE-CREATION 
  trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazy/LazyFactory.java 
1146895 
  trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazy/LazyTimestamp.java 
PRE-CREATION 
  trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazy/LazyUtils.java 
1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazy/objectinspector/primitive/LazyPrimitiveObjectInspectorFactory.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazy/objectinspector/primitive/LazyTimestampObjectInspector.java
 PRE-CREATION 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazybinary/LazyBinaryFactory.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazybinary/LazyBinarySerDe.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazybinary/LazyBinaryTimestamp.java
 PRE-CREATION 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazybinary/LazyBinaryUtils.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ObjectInspectorConverters.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ObjectInspectorUtils.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/PrimitiveObjectInspector.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/JavaTimestampObjectInspector.java
 PRE-CREATION 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/PrimitiveObjectInspectorConverter.java
 1146895 
  trunk/ql/src/test/results/clientpositive/timestamp_3.q.out PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/timestamp_comparison.q.out 
PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/timestamp_udf.q.out PRE-CREATION 
  trunk/serde/src/java/org/apache/hadoop/hive/serde2/SerDeUtils.java 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/binarysortable/BinarySortableSerDe.java
 1146895 
  trunk/serde/src/java/org/apache/hadoop/hive/serde2/io/TimestampWritable.java 
PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/timestamp_2.q.out PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToFloat.java 1146895 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToInteger.java 1146895 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToLong.java 1146895 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToShort.java 1146895 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFToString.java 1146895 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFUnixTimeStamp.java 1146895 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFWeekOfYear.java 1146895 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFYear.java 1146895 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFAverage.java 
1146895 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFContextNGrams.java
 1146895 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCorrelation.java
 1146895 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCovariance.java
 1146895 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFCovarianceSample.java
 1146895 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFHistogramNumeric.java
 1146895 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFPercentileApprox.java
 1146895 
  

[jira] [Commented] (HIVE-2272) add TIMESTAMP data type

2011-07-18 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067223#comment-13067223
 ] 

jirapos...@reviews.apache.org commented on HIVE-2272:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1135/
---

(Updated 2011-07-18 19:13:31.962359)


Review request for hive.


Summary
---

Adds TIMESTAMP type to serde2 with both string (LazySimple) and binary 
(LazyBinary) serialization.
Supports SQL style jdbc timestamps of the format with nanosecond precision
-MM-DD HH:MM:SS[.fff...]


This addresses bug HIVE-2272.
https://issues.apache.org/jira/browse/HIVE-2272


Diffs (updated)
-

  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/PrimitiveObjectInspector.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/JavaTimestampObjectInspector.java
 PRE-CREATION 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/PrimitiveObjectInspectorConverter.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/PrimitiveObjectInspectorFactory.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/PrimitiveObjectInspectorUtils.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/SettableTimestampObjectInspector.java
 PRE-CREATION 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/TimestampObjectInspector.java
 PRE-CREATION 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/primitive/WritableTimestampObjectInspector.java
 PRE-CREATION 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazybinary/LazyBinarySerDe.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazybinary/LazyBinaryTimestamp.java
 PRE-CREATION 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazybinary/LazyBinaryUtils.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ObjectInspectorConverters.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ObjectInspectorUtils.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazybinary/LazyBinaryFactory.java
 1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazy/objectinspector/primitive/LazyTimestampObjectInspector.java
 PRE-CREATION 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazy/objectinspector/primitive/LazyPrimitiveObjectInspectorFactory.java
 1146895 
  trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazy/LazyFactory.java 
1146895 
  trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazy/LazyTimestamp.java 
PRE-CREATION 
  trunk/serde/src/java/org/apache/hadoop/hive/serde2/lazy/LazyUtils.java 
1146895 
  
trunk/serde/src/java/org/apache/hadoop/hive/serde2/binarysortable/BinarySortableSerDe.java
 1146895 
  trunk/serde/src/java/org/apache/hadoop/hive/serde2/io/TimestampWritable.java 
PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/timestamp_comparison.q.out 
PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/timestamp_udf.q.out PRE-CREATION 
  trunk/serde/src/java/org/apache/hadoop/hive/serde2/SerDeUtils.java 1146895 
  trunk/ql/src/test/results/clientpositive/timestamp_3.q.out PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/timestamp_2.q.out PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/wrong_column_type.q.out 1146895 
  trunk/ql/src/test/results/clientpositive/show_functions.q.out 1146895 
  trunk/ql/src/test/results/clientpositive/timestamp_1.q.out PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/invalid_t_create2.q.out 1146895 
  trunk/ql/src/test/results/clientnegative/invalid_t_transform.q.out 1146895 
  trunk/ql/src/test/results/clientnegative/invalid_t_alter2.q.out 1146895 
  trunk/ql/src/test/results/clientnegative/invalid_t_create1.q.out 1146895 
  trunk/ql/src/test/queries/clientpositive/timestamp_2.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/timestamp_3.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/timestamp_comparison.q PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/timestamp_udf.q PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/invalid_create_tbl1.q.out 1146895 
  trunk/ql/src/test/results/clientnegative/invalid_t_alter1.q.out 1146895 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFStdSample.java
 1146895 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFSum.java 
1146895 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFVariance.java
 1146895 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFVarianceSample.java
 1146895 
  

[jira] [Commented] (HIVE-2184) Few improvements in org.apache.hadoop.hive.ql.metadata.Hive.close()

2011-07-18 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067285#comment-13067285
 ] 

jirapos...@reviews.apache.org commented on HIVE-2184:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1088/#review1099
---



trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStoreClient.java
https://reviews.apache.org/r/1088/#comment2230

We should get rid of the member variable standAloneClient too since it is 
part of the dead code.

Now that we'll start calling shutdown (which we were skipping before), what 
will be the effect?



- John


On 2011-07-12 14:06:28, chinna wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1088/
bq.  ---
bq.  
bq.  (Updated 2011-07-12 14:06:28)
bq.  
bq.  
bq.  Review request for hive and John Sichi.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Few improvements in org.apache.hadoop.hive.ql.metadata.Hive.close()
bq.  
bq.  
bq.  This addresses bug HIVE-2184.
bq.  https://issues.apache.org/jira/browse/HIVE-2184
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStoreClient.java
 1144190 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java 1144190 
bq.  
bq.  Diff: https://reviews.apache.org/r/1088/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Ran all the testcases
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  chinna
bq.  
bq.



 Few improvements in org.apache.hadoop.hive.ql.metadata.Hive.close()
 ---

 Key: HIVE-2184
 URL: https://issues.apache.org/jira/browse/HIVE-2184
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Affects Versions: 0.5.0, 0.8.0
 Environment: Hadoop 0.20.1, Hive0.8.0 and SUSE Linux Enterprise 
 Server 10 SP2 (i586) - Kernel 2.6.16.60-0.21-smp (5)
Reporter: Chinna Rao Lalam
Assignee: Chinna Rao Lalam
 Attachments: HIVE-2184.1.patch, HIVE-2184.1.patch, HIVE-2184.patch


 1)Hive.close() will call HiveMetaStoreClient.close() in this method the 
 variable standAloneClient is never become true then client.shutdown() never 
 call.
 2)Hive.close() After calling metaStoreClient.close() need to make 
 metaStoreClient=null

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1884) Potential risk of resource leaks in Hive

2011-07-18 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067286#comment-13067286
 ] 

jirapos...@reviews.apache.org commented on HIVE-1884:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1098/#review1101
---



trunk/cli/src/java/org/apache/hadoop/hive/cli/CliDriver.java
https://reviews.apache.org/r/1098/#comment2233

Rename this to rc to avoid confusion with the processReader method name.



trunk/contrib/src/java/org/apache/hadoop/hive/contrib/util/typedbytes/TypedBytesWritableInput.java
https://reviews.apache.org/r/1098/#comment2234

Fix indentation for the try block.


- John


On 2011-07-13 14:28:47, chinna wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1098/
bq.  ---
bq.  
bq.  (Updated 2011-07-13 14:28:47)
bq.  
bq.  
bq.  Review request for hive and John Sichi.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  There are couple of resource leaks.
bq.  For example,
bq.  In CliDriver.java, Method :- processReader() the buffered reader is not 
closed.
bq.  
bq.  Also there are risk(s) of resource(s) getting leaked , in such cases 
refactored the code to move closing of resources in finally block.
bq.  
bq.  
bq.  This addresses bug HIVE-1884.
bq.  https://issues.apache.org/jira/browse/HIVE-1884
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.trunk/cli/src/java/org/apache/hadoop/hive/cli/CliDriver.java 1145950 
bq.
trunk/contrib/src/java/org/apache/hadoop/hive/contrib/util/typedbytes/TypedBytesWritableInput.java
 1145950 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1145950 
bq.trunk/ql/src/java/org/apache/hadoop/hive/ql/io/RCFileInputFormat.java 
1145950 
bq.  
bq.  Diff: https://reviews.apache.org/r/1098/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  All unit tests passed
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  chinna
bq.  
bq.



 Potential risk of resource leaks in Hive
 

 Key: HIVE-1884
 URL: https://issues.apache.org/jira/browse/HIVE-1884
 Project: Hive
  Issue Type: Bug
  Components: CLI, Metastore, Query Processor, Server Infrastructure
Affects Versions: 0.3.0, 0.4.0, 0.4.1, 0.5.0, 0.6.0
 Environment: Hive 0.6.0, Hadoop 0.20.1
 SUSE Linux Enterprise Server 11 (i586)
Reporter: Mohit Sikri
Assignee: Chinna Rao Lalam
 Attachments: HIVE-1884.1.PATCH, HIVE-1884.2.patch, HIVE-1884.3.patch, 
 HIVE-1884.4.patch


 h3.There are couple of resource leaks.
 h4.For example,
 In CliDriver.java, Method :- processReader() the buffered reader is not 
 closed.
 h3.Also there are risk(s) of  resource(s) getting leaked , in such cases we 
 need to re factor the code to move closing of resources in finally block.
 h4. For Example :- 
 In Throttle.java   Method:- checkJobTracker() , the following code snippet 
 might cause resource leak.
 {code}
 InputStream in = url.openStream();
 in.read(buffer);
 in.close();
 {code}
 Ideally and as per the best coding practices it should be like below
 {code}
 InputStream in=null;
 try   {
 in = url.openStream();
 int numRead = in.read(buffer);
 }
 finally {
IOUtils.closeStream(in);
 }
 {code}
 Similar cases, were found in ExplainTask.java, DDLTask.java etc.Need to re 
 factor all such occurrences.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1078) CREATE VIEW followup: CREATE OR REPLACE

2011-07-18 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067334#comment-13067334
 ] 

jirapos...@reviews.apache.org commented on HIVE-1078:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1058/#review1102
---



http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
https://reviews.apache.org/r/1058/#comment2236

We'll allow this if no partitions currently exist.



http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
https://reviews.apache.org/r/1058/#comment2237

Shouldn't we allow for changes to the view comment?  Also, any 
TBLPROPERTIES specified here should be applied to the view (but we shouldn't 
delete old ones, only overwrite them if they already exist).




http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_or_replace_view.q
https://reviews.apache.org/r/1058/#comment2235

These need explicit PARTITIONED ON clauses to indicate that the existing 
partitioning scheme is being preserved.

Also, need test case for modifying the definition of an unpartitioned view.

Plus new cases we discussed for allowing change in partitioning when no 
partitions currently exist.

Also, negative case for combining IF NOT EXISTS with OR REPLACE.

Also, negative case for updating the view to have an invalid definition.

Also, positive case for using this to fix an invalid view.

Have you tried creating a view cycle?



- John


On 2011-07-18 07:01:02, Charles Chen wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1058/
bq.  ---
bq.  
bq.  (Updated 2011-07-18 07:01:02)
bq.  
bq.  
bq.  Review request for hive.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  https://issues.apache.org/jira/browse/HIVE-1078
bq.  
bq.  
bq.  This addresses bug HIVE-1078.
bq.  https://issues.apache.org/jira/browse/HIVE-1078
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_view.q.out
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_or_replace_view.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view4.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CreateViewDesc.java
 1146902 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view1.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view2.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view3.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view4.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_or_replace_view.q
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view1.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view2.q.out
 PRE-CREATION 
bq.
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view3.q.out
 PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1058/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  Passes unit tests
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Charles
bq.  
bq.



 CREATE VIEW followup:  CREATE OR REPLACE
 

 Key: HIVE-1078
 URL: https://issues.apache.org/jira/browse/HIVE-1078
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: John Sichi

[jira] [Commented] (HIVE-2139) Enables HiveServer to accept -hiveconf option

2011-07-18 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067411#comment-13067411
 ] 

jirapos...@reviews.apache.org commented on HIVE-2139:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/958/#review1106
---



common/build.xml
https://reviews.apache.org/r/958/#comment2245

This target overrides the definition of 'compile' in build-common.xml, but 
neglects to include this action which copies over hive-log4j.properties:

copy todir=${build.classes} failonerror=false
  fileset dir=${src.dir}/conf/
/copy

If this is added in I think the missing hive-log4j.properties issue will be 
resolved.



metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java
https://reviews.apache.org/r/958/#comment2244

This should be $METASTORE_PORT instead of HIVE_PORT (see metastore.sh).


- Carl


On 2011-07-15 23:51:55, Patrick Hunt wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/958/
bq.  ---
bq.  
bq.  (Updated 2011-07-15 23:51:55)
bq.  
bq.  
bq.  Review request for hive and Carl Steinbach.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  This patch updates HiveServer and HiveMetastore to add proper cli handling 
- similar to that used in CliDriver (ie GnuParser).
bq.  
bq.  There's a common HiveCli class that's used by both main classes.
bq.  
bq.  I've attempted to make the cli's backward compatible with the prior 
command line processing. Notice I've deprecated (via warnings, but the code 
still runs) if the old style CLI usage is used.
bq.  
bq.  commands such as the following now work as expected:
bq.  
bq.  bin/hive --service hiveserver -t 200 -p 12000 --hiveconf 
hive.root.logger=DEBUG,console
bq.  
bq.  as does the following which generates usage information:
bq.  
bq.  bin/hive --service hiveserver -h
bq.  
bq.  Note: HiveMetastore as not initializing log4j, I updated the code to do 
similar to HiveServer (otw the hiveconf hive.root.logger option above didn't 
work).
bq.  
bq.  
bq.  This addresses bug HIVE-2139.
bq.  https://issues.apache.org/jira/browse/HIVE-2139
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.ql/src/test/org/apache/hadoop/hive/ql/history/TestHiveHistory.java 
728958d 
bq.service/src/java/org/apache/hadoop/hive/service/HiveServer.java ea04be9 
bq.ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java 090ecfc 
bq.ql/src/java/org/apache/hadoop/hive/ql/session/SessionState.java 61393c8 
bq.bin/ext/hiveserver.sh b5edce4 
bq.bin/ext/metastore.sh db15f6e 
bq.cli/src/java/org/apache/hadoop/hive/cli/CliDriver.java bd9f6b1 
bq.common/build.xml 0969e16 
bq.common/ivy.xml bee3bf0 
bq.common/src/java/org/apache/hadoop/hive/common/LogUtils.java PRE-CREATION 
bq.common/src/java/org/apache/hadoop/hive/common/cli/CommonCliOptions.java 
PRE-CREATION 
bq.hwi/src/java/org/apache/hadoop/hive/hwi/HWISessionItem.java 1bf8c97 
bq.metastore/ivy.xml 83218fc 
bq.metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
e540c63 
bq.  
bq.  Diff: https://reviews.apache.org/r/958/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  I couldn't find any tests for these changes, so I verfied the changes 
manually.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Patrick
bq.  
bq.



 Enables HiveServer to accept -hiveconf option
 -

 Key: HIVE-2139
 URL: https://issues.apache.org/jira/browse/HIVE-2139
 Project: Hive
  Issue Type: Improvement
  Components: CLI
 Environment: Linux + CDH3u0 (Hive 0.7.0+27.1-2~lucid-cdh3)
Reporter: Kazuki Ohta
Assignee: Patrick Hunt
 Attachments: HIVE-2139.patch, HIVE-2139.patch


 Currently, I'm trying to test HiveHBaseIntegration on HiveServer. But it 
 doesn't seem to accept -hiveconf command.
 {code}
 hive --service hiveserver -hiveconf hbase.zookeeper.quorum=hdp0,hdp1,hdp2
 Starting Hive Thrift Server
 java.lang.NumberFormatException: For input string: -hiveconf
 at 
 java.lang.NumberFormatException.forInputString(NumberFormatException.java:48)
 at java.lang.Integer.parseInt(Integer.java:449)
 at java.lang.Integer.parseInt(Integer.java:499)
 at org.apache.hadoop.hive.service.HiveServer.main(HiveServer.java:382)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
 at 
 

[jira] [Commented] (HIVE-2247) ALTER TABLE RENAME PARTITION

2011-07-17 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066756#comment-13066756
 ] 

jirapos...@reviews.apache.org commented on HIVE-2247:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1105/
---

(Updated 2011-07-18 00:10:53.957462)


Review request for Siying Dong.


Changes
---

Change thrift interface to 
void rename_partition(1:string dbname, 2:string name, 3:liststring part_vals, 
4:liststring newpart_vals)
Rename the data directory of old partition to that of new partition and change 
location/name parameter of partition metadata
use Alter_partition to update the partition metadata


Summary
---

Implement ALTER TABLE PARTITION RENAME function to rename a partition. 
Add HiveQL syntax ALTER TABLE bar PARTITION (k1='v1', k2='v2') RENAME TO 
PARTITION (k1='v3', k2='v4');
This is my first Hive diff, I just learn everything from existing codebase and 
may not have a good understanding on it. 
Feel free to inform me if I make something wrong. Thanks


This addresses bug HIVE-2247.
https://issues.apache.org/jira/browse/HIVE-2247


Diffs (updated)
-

  
trunk/metastore/src/test/org/apache/hadoop/hive/metastore/TestHiveMetaStore.java
 1145366 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java 
1145366 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/RawStore.java 
1145366 
  
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/IMetaStoreClient.java 
1145366 
  
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStoreClient.java
 1145366 
  trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
1145366 
  
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveAlterHandler.java 
1145366 
  trunk/metastore/src/gen/thrift/gen-rb/thrift_hive_metastore.rb 1145366 
  
trunk/metastore/src/gen/thrift/gen-py/hive_metastore/ThriftHiveMetastore-remote 
1145366 
  trunk/metastore/src/gen/thrift/gen-py/hive_metastore/ThriftHiveMetastore.py 
1145366 
  trunk/ql/src/test/queries/clientnegative/alter_rename_partition_failure.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/alter_rename_partition_failure2.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientnegative/alter_rename_partition_failure3.q 
PRE-CREATION 
  trunk/ql/src/test/queries/clientpositive/alter_rename_partition.q 
PRE-CREATION 
  trunk/ql/src/test/results/clientnegative/alter_rename_partition_failure.q.out 
PRE-CREATION 
  
trunk/ql/src/test/results/clientnegative/alter_rename_partition_failure2.q.out 
PRE-CREATION 
  
trunk/ql/src/test/results/clientnegative/alter_rename_partition_failure3.q.out 
PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/alter_rename_partition.q.out 
PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/RenamePartitionDesc.java 
PRE-CREATION 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g 1145366 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzerFactory.java 
1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/AlterTableDesc.java 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/DDLWork.java 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/HiveOperation.java 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java 1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java 
1145366 
  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java 1145366 
  trunk/metastore/if/hive_metastore.thrift 1145366 
  trunk/metastore/src/gen/thrift/gen-cpp/ThriftHiveMetastore.h 1145366 
  trunk/metastore/src/gen/thrift/gen-cpp/ThriftHiveMetastore.cpp 1145366 
  
trunk/metastore/src/gen/thrift/gen-cpp/ThriftHiveMetastore_server.skeleton.cpp 
1145366 
  
trunk/metastore/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/metastore/api/ThriftHiveMetastore.java
 1145366 
  trunk/metastore/src/gen/thrift/gen-php/hive_metastore/ThriftHiveMetastore.php 
1145366 

Diff: https://reviews.apache.org/r/1105/diff


Testing
---

Add a partition A in the table
Rename partition A to partition B
Show the partitions in the table, it returns partition B.
SELECT the data from partition A, it returns no results
SELECT the data from partition B, it returns the data originally stored in 
partition A


Thanks,

Weiyan



 ALTER TABLE RENAME PARTITION
 

 Key: HIVE-2247
 URL: https://issues.apache.org/jira/browse/HIVE-2247
 Project: Hive
  Issue Type: New Feature
Reporter: Siying Dong
Assignee: Weiyan Wang
 Attachments: HIVE-2247.3.patch.txt, HIVE-2247.4.patch.txt


 We need a ALTER TABLE TABLE RENAME PARTITIONfunction that 

[jira] [Commented] (HIVE-2247) ALTER TABLE RENAME PARTITION

2011-07-17 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066757#comment-13066757
 ] 

jirapos...@reviews.apache.org commented on HIVE-2247:
-



bq.  On 2011-07-13 23:58:21, Siying Dong wrote:
bq.   trunk/metastore/if/hive_metastore.thrift, line 272
bq.   https://reviews.apache.org/r/1105/diff/1/?file=25058#file25058line272
bq.  
bq.   Do we really need a new metastore function? Can we just use 
alter_partition()?

alter_partition() just updates the partition metadata and neither check whether 
the new partition exists nor do physical rename of data directory. It's 
basically used by alter_table when there is no data in the old partition. 


bq.  On 2011-07-13 23:58:21, Siying Dong wrote:
bq.   
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/DDLSemanticAnalyzer.java, 
line 1704
bq.   https://reviews.apache.org/r/1105/diff/1/?file=25075#file25075line1704
bq.  
bq.   why we still continue when tab is null?

Fixed, throw an exception when tab is null.


bq.  On 2011-07-13 23:58:21, Siying Dong wrote:
bq.   trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/HiveOperation.java, 
line 39
bq.   https://reviews.apache.org/r/1105/diff/1/?file=25080#file25080line39
bq.  
bq.   Privilege.ALTER_METADATA doesn't sound right to me here. Can you use 
Privilede.DROP and Priviledge.CREATE?

Change it to ALTERTABLE_RENAMEPART(ALTERTABLE_RENAMEPART, new 
Privilege[]{Privilege.DROP}, new Privilege[]{Privilege.CREATE})
Not sure if it sounds right to you.


bq.  On 2011-07-13 23:58:21, Siying Dong wrote:
bq.   
trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/RenamePartitionDesc.java, line 
1
bq.   https://reviews.apache.org/r/1105/diff/1/?file=25081#file25081line1
bq.  
bq.   Looks like you copied from the file AddPartitionDesc, but do you 
need all those class members for renaming partition?

Only keep necessary functions


- Weiyan


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1105/#review1052
---


On 2011-07-18 00:10:53, Weiyan Wang wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1105/
bq.  ---
bq.  
bq.  (Updated 2011-07-18 00:10:53)
bq.  
bq.  
bq.  Review request for Siying Dong.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  Implement ALTER TABLE PARTITION RENAME function to rename a partition. 
bq.  Add HiveQL syntax ALTER TABLE bar PARTITION (k1='v1', k2='v2') RENAME TO 
PARTITION (k1='v3', k2='v4');
bq.  This is my first Hive diff, I just learn everything from existing codebase 
and may not have a good understanding on it. 
bq.  Feel free to inform me if I make something wrong. Thanks
bq.  
bq.  
bq.  This addresses bug HIVE-2247.
bq.  https://issues.apache.org/jira/browse/HIVE-2247
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.
trunk/metastore/src/test/org/apache/hadoop/hive/metastore/TestHiveMetaStore.java
 1145366 
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java 
1145366 
bq.trunk/metastore/src/java/org/apache/hadoop/hive/metastore/RawStore.java 
1145366 
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/IMetaStoreClient.java 
1145366 
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStoreClient.java
 1145366 
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveMetaStore.java 
1145366 
bq.
trunk/metastore/src/java/org/apache/hadoop/hive/metastore/HiveAlterHandler.java 
1145366 
bq.trunk/metastore/src/gen/thrift/gen-rb/thrift_hive_metastore.rb 1145366 
bq.
trunk/metastore/src/gen/thrift/gen-py/hive_metastore/ThriftHiveMetastore-remote 
1145366 
bq.
trunk/metastore/src/gen/thrift/gen-py/hive_metastore/ThriftHiveMetastore.py 
1145366 
bq.
trunk/ql/src/test/queries/clientnegative/alter_rename_partition_failure.q 
PRE-CREATION 
bq.
trunk/ql/src/test/queries/clientnegative/alter_rename_partition_failure2.q 
PRE-CREATION 
bq.
trunk/ql/src/test/queries/clientnegative/alter_rename_partition_failure3.q 
PRE-CREATION 
bq.trunk/ql/src/test/queries/clientpositive/alter_rename_partition.q 
PRE-CREATION 
bq.
trunk/ql/src/test/results/clientnegative/alter_rename_partition_failure.q.out 
PRE-CREATION 
bq.
trunk/ql/src/test/results/clientnegative/alter_rename_partition_failure2.q.out 
PRE-CREATION 
bq.
trunk/ql/src/test/results/clientnegative/alter_rename_partition_failure3.q.out 
PRE-CREATION 
bq.trunk/ql/src/test/results/clientpositive/alter_rename_partition.q.out 
PRE-CREATION 
bq.
trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/RenamePartitionDesc.java 
PRE-CREATION 
bq.

[jira] [Commented] (HIVE-1218) CREATE TABLE t LIKE some_view should create a new empty base table, but instead creates a copy of view

2011-07-16 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066550#comment-13066550
 ] 

jirapos...@reviews.apache.org commented on HIVE-1218:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1057/
---

(Updated 2011-07-16 22:51:22.078393)


Review request for hive.


Changes
---

Regenerated unit tests that had EXTERNAL : FALSE


Summary
---

https://issues.apache.org/jira/browse/HIVE-1218


This addresses bug HIVE-1218.
https://issues.apache.org/jira/browse/HIVE-1218


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
 1146883 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1146883 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CreateTableLikeDesc.java
 1146883 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_like.q
 1146883 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_like_view.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_like.q.out
 1146883 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_like_view.q.out
 PRE-CREATION 

Diff: https://reviews.apache.org/r/1057/diff


Testing
---

Unit tests passed


Thanks,

Charles



 CREATE TABLE t LIKE some_view should create a new empty base table, but 
 instead creates a copy of view
 --

 Key: HIVE-1218
 URL: https://issues.apache.org/jira/browse/HIVE-1218
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Affects Versions: 0.6.0
Reporter: John Sichi
Assignee: Charles Chen
 Fix For: 0.8.0

 Attachments: HIVE-1218v0.patch, HIVE-1218v2.patch, HIVE-1218v3.patch, 
 HIVE-1218v4.patch, HIVE-1218v5.patch, HIVE-1218v6.patch, HIVE-1218v7.patch, 
 HIVE-1218v8.patch


 I think it should copy only the column definitions from the view metadata.  
 Currently it is copying the entire descriptor, resulting in a new view 
 instead of a new base table.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2284) bucketized map join should allow join key as a superset of bucketized columns

2011-07-15 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066080#comment-13066080
 ] 

jirapos...@reviews.apache.org commented on HIVE-2284:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1136/
---

Review request for hive and namit jain.


Summary
---

Allow bucketed mapjoin if join key is a superset of bucket columns. 


This addresses bug HIVE-2284.
https://issues.apache.org/jira/browse/HIVE-2284


Diffs
-

  trunk/contrib/build.xml 1146922 
  trunk/eclipse-templates/.classpath 1146922 
  
trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/BucketMapJoinOptimizer.java
 1146922 
  trunk/ql/src/test/queries/clientpositive/smb_mapjoin_10.q PRE-CREATION 
  trunk/ql/src/test/results/clientpositive/smb_mapjoin_10.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/1136/diff


Testing
---

passed all unit tests. 


Thanks,

Ning



 bucketized map join should allow join key as a superset of bucketized columns
 -

 Key: HIVE-2284
 URL: https://issues.apache.org/jira/browse/HIVE-2284
 Project: Hive
  Issue Type: Improvement
Reporter: Ning Zhang
Assignee: Ning Zhang
 Fix For: 0.8.0

 Attachments: HIVE-2284.patch


 Currently bucketized mapjoin only allow the join keys being exactly the same 
 as bucketized columns. This is too restrictive and is missing some 
 optimization opportunities. 
 If tables S and T are both bucketized on column A with the same # of buckets, 
 and the query is something like:
 code 
 SELECT /*+ MAPJOIN (S) */ ...
 FROM S join T 
   ON (S.A = T.A AND S.B = T.B)
 code
 We should allow bucketized mapjoin since it's straightforward that bucket 1 
 from S join with bucket 2 from T on such join condition must be empty. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2282) Local mode needs to work well with block sampling

2011-07-15 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066083#comment-13066083
 ] 

jirapos...@reviews.apache.org commented on HIVE-2282:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1132/#review1080
---



ql/src/java/org/apache/hadoop/hive/ql/exec/MapRedTask.java
https://reviews.apache.org/r/1132/#comment2209

This function name seems to be confusing. Looks like the input size is set 
even if there is no sampling, right? Also, can you add comments to this 
function?

Other than that, the patch looks OK.


- Siying


On 2011-07-15 02:16:34, Kevin Wilfong wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1132/
bq.  ---
bq.  
bq.  (Updated 2011-07-15 02:16:34)
bq.  
bq.  
bq.  Review request for hive and Siying Dong.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  A query should run in local mode when block sampling is used and the 
sample is small enough.  The size of the sample is currently being estimated, 
as it is done to estimate the number of reducers.
bq.  
bq.  
bq.  This addresses bug HIVE-2282.
bq.  https://issues.apache.org/jira/browse/HIVE-2282
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.ql/src/test/queries/clientpositive/sample_islocalmode_hook.q 
PRE-CREATION 
bq.ql/src/java/org/apache/hadoop/hive/ql/exec/MapRedTask.java 53769a0 
bq.ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
cd3de76 
bq.ql/src/test/org/apache/hadoop/hive/ql/hooks/VerifyIsLocalModeHook.java 
PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1132/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  TestCliDriver TestNegativeCliDriver, manually tested
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Kevin
bq.  
bq.



 Local mode needs to work well with block sampling
 -

 Key: HIVE-2282
 URL: https://issues.apache.org/jira/browse/HIVE-2282
 Project: Hive
  Issue Type: Improvement
Reporter: Siying Dong
Assignee: Kevin Wilfong
 Attachments: HIVE-2282.1.patch.txt


 Currently, if block sampling is enabled and large set of data are sampled to 
 a small set, local mode needs to be kicked in. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2282) Local mode needs to work well with block sampling

2011-07-15 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066086#comment-13066086
 ] 

jirapos...@reviews.apache.org commented on HIVE-2282:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1132/#review1081
---



ql/src/test/org/apache/hadoop/hive/ql/hooks/VerifyIsLocalModeHook.java
https://reviews.apache.org/r/1132/#comment2210

We need a header for licensing.


- Siying


On 2011-07-15 02:16:34, Kevin Wilfong wrote:
bq.  
bq.  ---
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/1132/
bq.  ---
bq.  
bq.  (Updated 2011-07-15 02:16:34)
bq.  
bq.  
bq.  Review request for hive and Siying Dong.
bq.  
bq.  
bq.  Summary
bq.  ---
bq.  
bq.  A query should run in local mode when block sampling is used and the 
sample is small enough.  The size of the sample is currently being estimated, 
as it is done to estimate the number of reducers.
bq.  
bq.  
bq.  This addresses bug HIVE-2282.
bq.  https://issues.apache.org/jira/browse/HIVE-2282
bq.  
bq.  
bq.  Diffs
bq.  -
bq.  
bq.ql/src/test/queries/clientpositive/sample_islocalmode_hook.q 
PRE-CREATION 
bq.ql/src/java/org/apache/hadoop/hive/ql/exec/MapRedTask.java 53769a0 
bq.ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 
cd3de76 
bq.ql/src/test/org/apache/hadoop/hive/ql/hooks/VerifyIsLocalModeHook.java 
PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/1132/diff
bq.  
bq.  
bq.  Testing
bq.  ---
bq.  
bq.  TestCliDriver TestNegativeCliDriver, manually tested
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Kevin
bq.  
bq.



 Local mode needs to work well with block sampling
 -

 Key: HIVE-2282
 URL: https://issues.apache.org/jira/browse/HIVE-2282
 Project: Hive
  Issue Type: Improvement
Reporter: Siying Dong
Assignee: Kevin Wilfong
 Attachments: HIVE-2282.1.patch.txt


 Currently, if block sampling is enabled and large set of data are sampled to 
 a small set, local mode needs to be kicked in. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2252) Display a sample of partitions created when Fatal Error occurred due to too many partitioned created

2011-07-15 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066108#comment-13066108
 ] 

jirapos...@reviews.apache.org commented on HIVE-2252:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1013/
---

(Updated 2011-07-15 18:20:41.419263)


Review request for hive.


Changes
---

rebased on the current trunk


Summary
---

In dynamic partition inserts, if a mapper created too many partitions, a fatal 
error is raised and the job got killed. Sometimes the error is caused by data 
error and it will be helpful for users to debug if we display a sample of 
dynamic partitions generated.


This addresses bug HIVE-2252.
https://issues.apache.org/jira/browse/HIVE-2252


Diffs (updated)
-

  trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/FileSinkOperator.java 
1147261 

Diff: https://reviews.apache.org/r/1013/diff


Testing
---

passed all unit tests. 


Thanks,

Ning



 Display a sample of partitions created when Fatal Error occurred due to too 
 many partitioned created
 

 Key: HIVE-2252
 URL: https://issues.apache.org/jira/browse/HIVE-2252
 Project: Hive
  Issue Type: Improvement
Reporter: Ning Zhang
Assignee: Ning Zhang
 Attachments: HIVE-2252.patch


 In dynamic partition inserts, if a mapper created too many partitions, a 
 fatal error is raised and the job got killed. Sometimes the error is caused 
 by data error and it will be helpful for users to debug if we display a 
 sample of dynamic partitions generated. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1078) CREATE VIEW followup: CREATE OR REPLACE

2011-07-15 Thread jirapos...@reviews.apache.org (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13066161#comment-13066161
 ] 

jirapos...@reviews.apache.org commented on HIVE-1078:
-


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1058/
---

(Updated 2011-07-15 19:58:33.134187)


Review request for hive.


Changes
---

Alter view in place


Summary
---

https://issues.apache.org/jira/browse/HIVE-1078


This addresses bug HIVE-1078.
https://issues.apache.org/jira/browse/HIVE-1078


Diffs (updated)
-

  
http://svn.apache.org/repos/asf/hive/trunk/metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/DDLTask.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/CreateViewDesc.java
 1146902 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view1.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view2.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view3.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientnegative/create_or_replace_view4.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/queries/clientpositive/create_or_replace_view.q
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view1.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view2.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view3.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientnegative/create_or_replace_view4.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_or_replace_view.q.out
 PRE-CREATION 
  
http://svn.apache.org/repos/asf/hive/trunk/ql/src/test/results/clientpositive/create_view.q.out
 1146902 

Diff: https://reviews.apache.org/r/1058/diff


Testing
---

Passes unit tests


Thanks,

Charles



 CREATE VIEW followup:  CREATE OR REPLACE
 

 Key: HIVE-1078
 URL: https://issues.apache.org/jira/browse/HIVE-1078
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: John Sichi
Assignee: Charles Chen
 Attachments: HIVE-1078v3.patch, HIVE-1078v4.patch, HIVE-1078v5.patch


 Currently, replacing a view requires
 DROP VIEW v;
 CREATE VIEW v AS new-definition;
 CREATE OR REPLACE would allow these to be combined into a single operation.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




<    1   2   3   4   5   6   >