[jira] [Commented] (HIVE-4042) ignore mapjoin hint

2013-02-25 Thread Namit Jain (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13585706#comment-13585706
 ] 

Namit Jain commented on HIVE-4042:
--

Updated the default to true, changed some tests/outputs and added this as a 
query property for easy logging.
All the tests passed

 ignore mapjoin hint
 ---

 Key: HIVE-4042
 URL: https://issues.apache.org/jira/browse/HIVE-4042
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Reporter: Namit Jain
Assignee: Namit Jain
 Attachments: hive.4042.1.patch, hive.4042.2.patch, hive.4042.3.patch, 
 hive.4042.4.patch, hive.4042.5.patch, hive.4042.6.patch


 After HIVE-3784, in a production environment, it can become difficult to
 deploy since a lot of production queries can break.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4068) Size of aggregation buffer which uses non-primitive type is not estimated correctly

2013-02-25 Thread Phabricator (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13585722#comment-13585722
 ] 

Phabricator commented on HIVE-4068:
---

njain has commented on the revision HIVE-4068 [jira] Size of aggregation 
buffer which uses non-primitive type is not estimated correctly.

  I was thinking of a abstract class ---
  AbstractAggregationBuffer with the annotation.

  The default implementation of estimate() is empty

  All known aggregationBuffers extend AbstractAB.

 if (agg instanceof GenericUDAFEvaluator.EstimableBuffer) {
return ((GenericUDAFEvaluator.EstimableBuffer)agg).estimate();
  }

  change instanceof to annotation

  Mark AB as deprecated

REVISION DETAIL
  https://reviews.facebook.net/D8859

To: JIRA, navis
Cc: njain


 Size of aggregation buffer which uses non-primitive type is not estimated 
 correctly
 ---

 Key: HIVE-4068
 URL: https://issues.apache.org/jira/browse/HIVE-4068
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Reporter: Navis
Assignee: Navis
Priority: Minor
 Attachments: HIVE-4068.D8859.1.patch, HIVE-4068.D8859.2.patch


 Currently, hive assumes an aggregation buffer which holds a map is occupying 
 just 256 byte (fixed). If it's bigger than that in real, OutOfMemoryError can 
 be thrown (especially for 1k buffer). 
 workaround : set hive.map.aggr.hash.percentmemory=smaller value than 
 default(0.5)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-2780) Implement more restrictive table sampler

2013-02-25 Thread Navis (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Navis updated HIVE-2780:


Status: Patch Available  (was: Open)

 Implement more restrictive table sampler
 

 Key: HIVE-2780
 URL: https://issues.apache.org/jira/browse/HIVE-2780
 Project: Hive
  Issue Type: Improvement
Reporter: Navis
Assignee: Navis
Priority: Trivial
 Attachments: ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.1.patch, 
 ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.2.patch, HIVE-2780.D1623.3.patch, 
 HIVE-2780.D1623.4.patch


 Current table sampling scans whole block, making more rows included than 
 expected especially for small tables.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-2780) Implement more restrictive table sampler

2013-02-25 Thread Phabricator (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Phabricator updated HIVE-2780:
--

Attachment: HIVE-2780.D1623.4.patch

navis updated the revision HIVE-2780 [jira] Implement more restrictive table 
sampler.

  Addressed comments  rebased to trunk

Reviewers: ashutoshc, JIRA

REVISION DETAIL
  https://reviews.facebook.net/D1623

CHANGE SINCE LAST DIFF
  https://reviews.facebook.net/D1623?vs=25479id=28587#toc

AFFECTED FILES
  common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
  conf/hive-default.xml.template
  ql/src/java/org/apache/hadoop/hive/ql/io/CombineHiveInputFormat.java
  ql/src/java/org/apache/hadoop/hive/ql/io/DefaultSplitSampler.java
  ql/src/java/org/apache/hadoop/hive/ql/io/ShirinkSplitSampler.java
  ql/src/java/org/apache/hadoop/hive/ql/io/SplitSampler.java
  ql/src/test/queries/clientpositive/split_sample_custom.q
  ql/src/test/results/clientpositive/split_sample_custom.q.out

To: JIRA, ashutoshc, navis


 Implement more restrictive table sampler
 

 Key: HIVE-2780
 URL: https://issues.apache.org/jira/browse/HIVE-2780
 Project: Hive
  Issue Type: Improvement
Reporter: Navis
Assignee: Navis
Priority: Trivial
 Attachments: ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.1.patch, 
 ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.2.patch, HIVE-2780.D1623.3.patch, 
 HIVE-2780.D1623.4.patch


 Current table sampling scans whole block, making more rows included than 
 expected especially for small tables.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4068) Size of aggregation buffer which uses non-primitive type is not estimated correctly

2013-02-25 Thread Phabricator (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Phabricator updated HIVE-4068:
--

Attachment: HIVE-4068.D8859.3.patch

navis updated the revision HIVE-4068 [jira] Size of aggregation buffer which 
uses non-primitive type is not estimated correctly.

  Like this?

Reviewers: JIRA

REVISION DETAIL
  https://reviews.facebook.net/D8859

CHANGE SINCE LAST DIFF
  https://reviews.facebook.net/D8859?vs=28539id=28593#toc

AFFECTED FILES
  ql/src/java/org/apache/hadoop/hive/ql/exec/GroupByOperator.java
  ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFEWAHBitmap.java
  ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFEvaluator.java
  
ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFHistogramNumeric.java
  ql/src/java/org/apache/hadoop/hive/ql/udf/generic/NumericHistogram.java

To: JIRA, navis
Cc: njain


 Size of aggregation buffer which uses non-primitive type is not estimated 
 correctly
 ---

 Key: HIVE-4068
 URL: https://issues.apache.org/jira/browse/HIVE-4068
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Reporter: Navis
Assignee: Navis
Priority: Minor
 Attachments: HIVE-4068.D8859.1.patch, HIVE-4068.D8859.2.patch, 
 HIVE-4068.D8859.3.patch


 Currently, hive assumes an aggregation buffer which holds a map is occupying 
 just 256 byte (fixed). If it's bigger than that in real, OutOfMemoryError can 
 be thrown (especially for 1k buffer). 
 workaround : set hive.map.aggr.hash.percentmemory=smaller value than 
 default(0.5)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4018) MapJoin failing with Distributed Cache error

2013-02-25 Thread Amareshwari Sriramadasu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated HIVE-4018:
--

Status: Patch Available  (was: Open)

 MapJoin failing with Distributed Cache error
 

 Key: HIVE-4018
 URL: https://issues.apache.org/jira/browse/HIVE-4018
 Project: Hive
  Issue Type: Bug
  Components: SQL
Affects Versions: 0.11.0
Reporter: Amareshwari Sriramadasu
Assignee: Amareshwari Sriramadasu
 Fix For: 0.11.0

 Attachments: HIVE-4018.patch, hive.4018.test.2.patch, 
 HIVE-4018-test.patch


 When I'm a running a star join query after HIVE-3784, it is failing with 
 following error:
 2013-02-13 08:36:04,584 ERROR org.apache.hadoop.hive.ql.exec.MapJoinOperator: 
 Load Distributed Cache Error
 2013-02-13 08:36:04,585 FATAL ExecMapper: 
 org.apache.hadoop.hive.ql.metadata.HiveException: java.io.EOFException
   at 
 org.apache.hadoop.hive.ql.exec.MapJoinOperator.loadHashTable(MapJoinOperator.java:189)
   at 
 org.apache.hadoop.hive.ql.exec.MapJoinOperator.cleanUpInputFileChangedOp(MapJoinOperator.java:203)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1421)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:614)
   at org.apache.hadoop.hive.ql.exec.ExecMapper.map(ExecMapper.java:144)
   at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)
   at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:391)
   at org.apache.hadoop.mapred.MapTask.run(MapTask.java:325)
   at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
   at java.security.AccessController.doPrivileged(Native Method)
   at javax.security.auth.Subject.doAs(Subject.java:416)
   at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1278)
   at org.apache.hadoop.mapred.Child.main(Child.java:260)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4018) MapJoin failing with Distributed Cache error

2013-02-25 Thread Amareshwari Sriramadasu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated HIVE-4018:
--

Attachment: HIVE-4018.patch

Here is a patch which fixes the issue, with testcase added.

 MapJoin failing with Distributed Cache error
 

 Key: HIVE-4018
 URL: https://issues.apache.org/jira/browse/HIVE-4018
 Project: Hive
  Issue Type: Bug
  Components: SQL
Affects Versions: 0.11.0
Reporter: Amareshwari Sriramadasu
Assignee: Amareshwari Sriramadasu
 Fix For: 0.11.0

 Attachments: HIVE-4018.patch, hive.4018.test.2.patch, 
 HIVE-4018-test.patch


 When I'm a running a star join query after HIVE-3784, it is failing with 
 following error:
 2013-02-13 08:36:04,584 ERROR org.apache.hadoop.hive.ql.exec.MapJoinOperator: 
 Load Distributed Cache Error
 2013-02-13 08:36:04,585 FATAL ExecMapper: 
 org.apache.hadoop.hive.ql.metadata.HiveException: java.io.EOFException
   at 
 org.apache.hadoop.hive.ql.exec.MapJoinOperator.loadHashTable(MapJoinOperator.java:189)
   at 
 org.apache.hadoop.hive.ql.exec.MapJoinOperator.cleanUpInputFileChangedOp(MapJoinOperator.java:203)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1421)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425)
   at 
 org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:614)
   at org.apache.hadoop.hive.ql.exec.ExecMapper.map(ExecMapper.java:144)
   at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)
   at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:391)
   at org.apache.hadoop.mapred.MapTask.run(MapTask.java:325)
   at org.apache.hadoop.mapred.Child$4.run(Child.java:266)
   at java.security.AccessController.doPrivileged(Native Method)
   at javax.security.auth.Subject.doAs(Subject.java:416)
   at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1278)
   at org.apache.hadoop.mapred.Child.main(Child.java:260)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3629) Try bucket split more localized by referencing block locations of small aliases

2013-02-25 Thread Navis (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Navis updated HIVE-3629:


Status: Patch Available  (was: Open)

 Try bucket split more localized by referencing block locations of small 
 aliases
 ---

 Key: HIVE-3629
 URL: https://issues.apache.org/jira/browse/HIVE-3629
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Reporter: Navis
Assignee: Navis
Priority: Minor
 Attachments: HIVE-3629.D6273.1.patch, HIVE-3629.D6273.2.patch


 Currently splits for SMBMapJoin does not concern about location of small 
 aliases. If all inputs could be on single server, joining would be more 
 faster.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3629) Try bucket split more localized by referencing block locations of small aliases

2013-02-25 Thread Phabricator (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Phabricator updated HIVE-3629:
--

Attachment: HIVE-3629.D6273.2.patch

navis updated the revision HIVE-3629 [jira] Try bucket split more localized by 
referencing block locations of small aliases.

  Rebased to trunk

Reviewers: JIRA

REVISION DETAIL
  https://reviews.facebook.net/D6273

CHANGE SINCE LAST DIFF
  https://reviews.facebook.net/D6273?vs=20583id=28599#toc

AFFECTED FILES
  ql/src/java/org/apache/hadoop/hive/ql/exec/SMBMapJoinOperator.java
  ql/src/java/org/apache/hadoop/hive/ql/io/BucketizedHiveInputFormat.java
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/MapJoinFactory.java
  ql/src/java/org/apache/hadoop/hive/ql/plan/BucketMapJoinContext.java
  ql/src/java/org/apache/hadoop/hive/ql/plan/MapredWork.java

To: JIRA, navis


 Try bucket split more localized by referencing block locations of small 
 aliases
 ---

 Key: HIVE-3629
 URL: https://issues.apache.org/jira/browse/HIVE-3629
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Reporter: Navis
Assignee: Navis
Priority: Minor
 Attachments: HIVE-3629.D6273.1.patch, HIVE-3629.D6273.2.patch


 Currently splits for SMBMapJoin does not concern about location of small 
 aliases. If all inputs could be on single server, joining would be more 
 faster.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3235) Support jobId notification for MaredTasks

2013-02-25 Thread Navis (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Navis updated HIVE-3235:


Affects Version/s: (was: 0.10.0)
   Status: Patch Available  (was: Open)

 Support jobId notification for MaredTasks
 -

 Key: HIVE-3235
 URL: https://issues.apache.org/jira/browse/HIVE-3235
 Project: Hive
  Issue Type: Improvement
  Components: Diagnosability
Reporter: Navis
Assignee: Navis
Priority: Trivial

 JobId for each MapredTask is requested for monitoring purpose.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3235) Support jobId notification for MaredTasks

2013-02-25 Thread Phabricator (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Phabricator updated HIVE-3235:
--

Attachment: HIVE-3235.D4017.2.patch

navis updated the revision HIVE-3235 [jira] Support jobId notification for 
MaredTasks.

  Addressed comments  rebased to trunk

Reviewers: JIRA

REVISION DETAIL
  https://reviews.facebook.net/D4017

CHANGE SINCE LAST DIFF
  https://reviews.facebook.net/D4017?vs=12885id=28605#toc

AFFECTED FILES
  common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
  ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java
  ql/src/java/org/apache/hadoop/hive/ql/exec/Task.java

To: JIRA, navis
Cc: zhenxiao


 Support jobId notification for MaredTasks
 -

 Key: HIVE-3235
 URL: https://issues.apache.org/jira/browse/HIVE-3235
 Project: Hive
  Issue Type: Improvement
  Components: Diagnosability
Reporter: Navis
Assignee: Navis
Priority: Trivial
 Attachments: HIVE-3235.D4017.2.patch


 JobId for each MapredTask is requested for monitoring purpose.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-948) more query plan optimization rules

2013-02-25 Thread Ashutosh Chauhan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13585939#comment-13585939
 ] 

Ashutosh Chauhan commented on HIVE-948:
---

Thanks Navis for updating patch. Running tests again.

 more query plan optimization rules 
 ---

 Key: HIVE-948
 URL: https://issues.apache.org/jira/browse/HIVE-948
 Project: Hive
  Issue Type: Improvement
Reporter: Ning Zhang
Assignee: Navis
 Attachments: HIVE-948.D8463.1.patch, HIVE-948.D8463.2.patch, 
 HIVE-948.D8463.3.patch, HIVE-948.D8463.3.patch, HIVE-948.D8463.4.patch, 
 HIVE-948.D8463.5.patch, HIVE-948.testresult_only.1.txt


 Many query plans are not optimal in that they contain redundant operators. 
 Some examples are unnecessary select operators (select followed by select, 
 select output being the same as input etc.). Even though these operators are 
 not very expensive, they could account for around 10% of CPU time in some 
 simple queries. It seems they are low-hanging fruits that we should pick 
 first. 
 BTW, it seems these optimization rules should be added at the last stage of 
 the physical optimization phase since some redundant operators are added to 
 facilitate physical plan generation. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4070) Like operator in Hive is case sensitive while in MySQL (and most likely other DBs) it's case insensitive

2013-02-25 Thread Sean Mackrory (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Mackrory updated HIVE-4070:


Summary: Like operator in Hive is case sensitive while in MySQL (and most 
likely other DBs) it's case insensitive  (was: Like operator in Hive is case 
sensitive while in MySQL (and most likely other DBs) it's case sensitive)

 Like operator in Hive is case sensitive while in MySQL (and most likely other 
 DBs) it's case insensitive
 

 Key: HIVE-4070
 URL: https://issues.apache.org/jira/browse/HIVE-4070
 Project: Hive
  Issue Type: Bug
  Components: UDF
Affects Versions: 0.10.0
Reporter: Mark Grover
Assignee: Mark Grover
 Fix For: 0.11.0


 Hive's like operator seems to be case sensitive.
 See 
 https://github.com/apache/hive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFLike.java#L164
 However, MySQL's like operator is case insensitive. I don't have other DB's 
 (like PostgreSQL) installed and handy but I am guessing their LIKE is case 
 insensitive as well.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4070) Like operator in Hive is case sensitive while in MySQL (and most likely other DBs) it's case sensitive

2013-02-25 Thread Sean Mackrory (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13585944#comment-13585944
 ] 

Sean Mackrory commented on HIVE-4070:
-

Though admittedly it was a question of mine trying to clarify HiveQL that 
spawned this thought, I don't necessarily think HiveQL's LIKE operator _should_ 
be case insensitive. It would certainly make HiveQL more consistent with SQL if 
it was, but when running extremely long queries over certain types of data, I 
could see it being a nice feature that LIKE gives you the flexibility of 
wild-cards without the overhead of case insensitivity. There's always the 
option of using RLIKE when one is looking for a very flexible match, but I 
think this is a toss-up between compliance with SQL's behavior and being able 
to limit the overhead of the query when possible.

 Like operator in Hive is case sensitive while in MySQL (and most likely other 
 DBs) it's case sensitive
 --

 Key: HIVE-4070
 URL: https://issues.apache.org/jira/browse/HIVE-4070
 Project: Hive
  Issue Type: Bug
  Components: UDF
Affects Versions: 0.10.0
Reporter: Mark Grover
Assignee: Mark Grover
 Fix For: 0.11.0


 Hive's like operator seems to be case sensitive.
 See 
 https://github.com/apache/hive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFLike.java#L164
 However, MySQL's like operator is case insensitive. I don't have other DB's 
 (like PostgreSQL) installed and handy but I am guessing their LIKE is case 
 insensitive as well.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-3945) union all datatype do not match may result wrong result

2013-02-25 Thread Brock Noland (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13585947#comment-13585947
 ] 

Brock Noland commented on HIVE-3945:


[~caofangkun] can you clarify when you see the ArrayIndexOutOfBoundsException 
above?

 union all datatype do not match may result wrong result 
 

 Key: HIVE-3945
 URL: https://issues.apache.org/jira/browse/HIVE-3945
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.9.0
Reporter: caofangkun
Priority: Minor

 hive (default) desc src;
 key   string  
 value string  
 select key, value FROM 
 ( select 'key' as key, 'value' as value -- datatype: string, string
   from src s1 limit 1
   UNION  ALL  
   select s2.key as key, sum(s2.value) as value  -- datatype: strung, double 
   from src s2 group by s2.key
  ) unionsrc;
 this query exec normally but has wrong result:
 key   2.4081029415476845E-282-- expected is 'value'
   35.0
 100   100.0
 480.0
 and sometimes when the string title is too long it may case 
 ArrayIndexOutOfBoundsException:
 Caused by: java.lang.ArrayIndexOutOfBoundsException
 at java.lang.System.arraycopy(Native Method)
 at org.apache.hadoop.io.Text.set(Text.java:205)
 at 
 org.apache.hadoop.hive.serde2.lazybinary.LazyBinaryString.init(LazyBinaryString.java:48)
 at 
 org.apache.hadoop.hive.serde2.lazybinary.LazyBinaryStruct.uncheckedGetField(LazyBinaryStruct.java:216)
 at 
 org.apache.hadoop.hive.serde2.lazybinary.LazyBinaryStruct.getField(LazyBinaryStruct.java:197)
 at 
 org.apache.hadoop.hive.serde2.lazybinary.objectinspector.LazyBinaryStructObjectInspector.getStructFieldData(LazyBinaryStructObjectInspector.java:61)
 at 
 org.apache.hadoop.hive.ql.exec.UnionOperator.processOp(UnionOperator.java:125)
 at org.apache.hadoop.hive.ql.exec.Operator.process(Operator.java:471)
 at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:762)
 at org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:531)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3874) Create a new Optimized Row Columnar file format for Hive

2013-02-25 Thread Phabricator (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Phabricator updated HIVE-3874:
--

Attachment: HIVE-3874.D8871.1.patch

omalley requested code review of HIVE-3874 [jira] Create a new Optimized Row 
Columnar file format for Hive.

Reviewers: JIRA

improve some of the comments on WriterImpl

There are several limitations of the current RC File format that I'd like to 
address by creating a new format:

each column value is stored as a binary blob, which means:

the entire column value must be read, decompressed, and 
deserialized
the file format can't use smarter type-specific compression
push down filters can't be evaluated

the start of each row group needs to be found by scanning
user metadata can only be added to the file when the file is created
the file doesn't store the number of rows per a file or row group
there is no mechanism for seeking to a particular row number, which is 
required for external indexes.
there is no mechanism for storing light weight indexes within the file 
to enable push-down filters to skip entire row groups.
the type of the rows aren't stored in the file

TEST PLAN
  EMPTY

REVISION DETAIL
  https://reviews.facebook.net/D8871

AFFECTED FILES
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/WriterImpl.java

MANAGE HERALD RULES
  https://reviews.facebook.net/herald/view/differential/

WHY DID I GET THIS EMAIL?
  https://reviews.facebook.net/herald/transcript/21597/

To: JIRA, omalley


 Create a new Optimized Row Columnar file format for Hive
 

 Key: HIVE-3874
 URL: https://issues.apache.org/jira/browse/HIVE-3874
 Project: Hive
  Issue Type: Improvement
  Components: Serializers/Deserializers
Reporter: Owen O'Malley
Assignee: Owen O'Malley
 Attachments: hive.3874.2.patch, HIVE-3874.D8529.1.patch, 
 HIVE-3874.D8529.2.patch, HIVE-3874.D8529.3.patch, HIVE-3874.D8871.1.patch, 
 OrcFileIntro.pptx, orc.tgz


 There are several limitations of the current RC File format that I'd like to 
 address by creating a new format:
 * each column value is stored as a binary blob, which means:
 ** the entire column value must be read, decompressed, and deserialized
 ** the file format can't use smarter type-specific compression
 ** push down filters can't be evaluated
 * the start of each row group needs to be found by scanning
 * user metadata can only be added to the file when the file is created
 * the file doesn't store the number of rows per a file or row group
 * there is no mechanism for seeking to a particular row number, which is 
 required for external indexes.
 * there is no mechanism for storing light weight indexes within the file to 
 enable push-down filters to skip entire row groups.
 * the type of the rows aren't stored in the file

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-3874) Create a new Optimized Row Columnar file format for Hive

2013-02-25 Thread Phabricator (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586008#comment-13586008
 ] 

Phabricator commented on HIVE-3874:
---

omalley has abandoned the revision HIVE-3874 [jira] Create a new Optimized Row 
Columnar file format for Hive.

REVISION DETAIL
  https://reviews.facebook.net/D8871

To: JIRA, omalley


 Create a new Optimized Row Columnar file format for Hive
 

 Key: HIVE-3874
 URL: https://issues.apache.org/jira/browse/HIVE-3874
 Project: Hive
  Issue Type: Improvement
  Components: Serializers/Deserializers
Reporter: Owen O'Malley
Assignee: Owen O'Malley
 Attachments: hive.3874.2.patch, HIVE-3874.D8529.1.patch, 
 HIVE-3874.D8529.2.patch, HIVE-3874.D8529.3.patch, HIVE-3874.D8871.1.patch, 
 OrcFileIntro.pptx, orc.tgz


 There are several limitations of the current RC File format that I'd like to 
 address by creating a new format:
 * each column value is stored as a binary blob, which means:
 ** the entire column value must be read, decompressed, and deserialized
 ** the file format can't use smarter type-specific compression
 ** push down filters can't be evaluated
 * the start of each row group needs to be found by scanning
 * user metadata can only be added to the file when the file is created
 * the file doesn't store the number of rows per a file or row group
 * there is no mechanism for seeking to a particular row number, which is 
 required for external indexes.
 * there is no mechanism for storing light weight indexes within the file to 
 enable push-down filters to skip entire row groups.
 * the type of the rows aren't stored in the file

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3874) Create a new Optimized Row Columnar file format for Hive

2013-02-25 Thread Phabricator (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Phabricator updated HIVE-3874:
--

Attachment: HIVE-3874.D8529.4.patch

omalley updated the revision HIVE-3874 [jira] Create a new Optimized Row 
Columnar file format for Hive.

- started updating comments
- more style changes
- fix compilation
- fix unit tests
- fix more unit tests
- added more comments

Reviewers: JIRA

REVISION DETAIL
  https://reviews.facebook.net/D8529

CHANGE SINCE LAST DIFF
  https://reviews.facebook.net/D8529?vs=28305id=28617#toc

AFFECTED FILES
  build.properties
  build.xml
  ivy/libraries.properties
  ql/build.xml
  ql/ivy.xml
  ql/src/gen/protobuf/gen-java/org/apache/hadoop/hive/ql/io/orc/OrcProto.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/BitFieldReader.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/BitFieldWriter.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/BooleanColumnStatistics.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/ColumnStatistics.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/ColumnStatisticsImpl.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/CompressionCodec.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/CompressionKind.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/DoubleColumnStatistics.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/DynamicByteArray.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/DynamicIntArray.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/FileDump.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/InStream.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/IntegerColumnStatistics.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcFile.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcInputFormat.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcOutputFormat.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcSerde.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcStruct.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcUnion.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/OutStream.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/PositionProvider.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/PositionRecorder.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/PositionedOutputStream.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/Reader.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/ReaderImpl.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/RecordReader.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/RecordReaderImpl.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/RedBlackTree.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/RunLengthByteReader.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/RunLengthByteWriter.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/RunLengthIntegerReader.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/RunLengthIntegerWriter.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/SerializationUtils.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/SnappyCodec.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/StreamName.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/StringColumnStatistics.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/StringRedBlackTree.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/StripeInformation.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/Writer.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/WriterImpl.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/ZlibCodec.java
  ql/src/java/org/apache/hadoop/hive/ql/io/orc/package-info.java
  ql/src/protobuf/org/apache/hadoop/hive/ql/io/orc/orc_proto.proto
  ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestBitFieldReader.java
  ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestDynamicArray.java
  ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestFileDump.java
  ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestInStream.java
  ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestInputOutputFormat.java
  ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestOrcFile.java
  ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestOrcStruct.java
  ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestRunLengthByteReader.java
  ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestRunLengthIntegerReader.java
  ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestSerializationUtils.java
  ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestStreamName.java
  ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestStringRedBlackTree.java
  ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestZlib.java
  ql/src/test/resources/orc-file-dump.out

To: JIRA, omalley
Cc: kevinwilfong, njain


 Create a new Optimized Row Columnar file format for Hive
 

 Key: HIVE-3874
 URL: https://issues.apache.org/jira/browse/HIVE-3874
 Project: Hive
  Issue Type: 

[jira] [Commented] (HIVE-3874) Create a new Optimized Row Columnar file format for Hive

2013-02-25 Thread Owen O'Malley (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586018#comment-13586018
 ] 

Owen O'Malley commented on HIVE-3874:
-

Ok, I added some additional comments in the Writer as Namit asked and all of 
the unit tests cases pass.

 Create a new Optimized Row Columnar file format for Hive
 

 Key: HIVE-3874
 URL: https://issues.apache.org/jira/browse/HIVE-3874
 Project: Hive
  Issue Type: Improvement
  Components: Serializers/Deserializers
Reporter: Owen O'Malley
Assignee: Owen O'Malley
 Attachments: hive.3874.2.patch, HIVE-3874.D8529.1.patch, 
 HIVE-3874.D8529.2.patch, HIVE-3874.D8529.3.patch, HIVE-3874.D8529.4.patch, 
 HIVE-3874.D8871.1.patch, OrcFileIntro.pptx, orc.tgz


 There are several limitations of the current RC File format that I'd like to 
 address by creating a new format:
 * each column value is stored as a binary blob, which means:
 ** the entire column value must be read, decompressed, and deserialized
 ** the file format can't use smarter type-specific compression
 ** push down filters can't be evaluated
 * the start of each row group needs to be found by scanning
 * user metadata can only be added to the file when the file is created
 * the file doesn't store the number of rows per a file or row group
 * there is no mechanism for seeking to a particular row number, which is 
 required for external indexes.
 * there is no mechanism for storing light weight indexes within the file to 
 enable push-down filters to skip entire row groups.
 * the type of the rows aren't stored in the file

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3874) Create a new Optimized Row Columnar file format for Hive

2013-02-25 Thread Owen O'Malley (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Owen O'Malley updated HIVE-3874:


Status: Patch Available  (was: Open)

 Create a new Optimized Row Columnar file format for Hive
 

 Key: HIVE-3874
 URL: https://issues.apache.org/jira/browse/HIVE-3874
 Project: Hive
  Issue Type: Improvement
  Components: Serializers/Deserializers
Reporter: Owen O'Malley
Assignee: Owen O'Malley
 Attachments: hive.3874.2.patch, HIVE-3874.D8529.1.patch, 
 HIVE-3874.D8529.2.patch, HIVE-3874.D8529.3.patch, HIVE-3874.D8529.4.patch, 
 HIVE-3874.D8871.1.patch, OrcFileIntro.pptx, orc.tgz


 There are several limitations of the current RC File format that I'd like to 
 address by creating a new format:
 * each column value is stored as a binary blob, which means:
 ** the entire column value must be read, decompressed, and deserialized
 ** the file format can't use smarter type-specific compression
 ** push down filters can't be evaluated
 * the start of each row group needs to be found by scanning
 * user metadata can only be added to the file when the file is created
 * the file doesn't store the number of rows per a file or row group
 * there is no mechanism for seeking to a particular row number, which is 
 required for external indexes.
 * there is no mechanism for storing light weight indexes within the file to 
 enable push-down filters to skip entire row groups.
 * the type of the rows aren't stored in the file

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4058) make ORC versioned

2013-02-25 Thread Owen O'Malley (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586021#comment-13586021
 ] 

Owen O'Malley commented on HIVE-4058:
-

The metadata is versioned, it just doesn't have a global version. The intent is 
that new fields can be added to the protobuf and the reader will check if those 
new fields are defined.

 make ORC versioned
 --

 Key: HIVE-4058
 URL: https://issues.apache.org/jira/browse/HIVE-4058
 Project: Hive
  Issue Type: Improvement
  Components: Serializers/Deserializers
Reporter: Namit Jain



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Assigned] (HIVE-4059) Make Column statistics for ORC optional

2013-02-25 Thread Owen O'Malley (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Owen O'Malley reassigned HIVE-4059:
---

Assignee: Owen O'Malley

 Make Column statistics for ORC optional
 ---

 Key: HIVE-4059
 URL: https://issues.apache.org/jira/browse/HIVE-4059
 Project: Hive
  Issue Type: Improvement
  Components: Serializers/Deserializers
Reporter: Namit Jain
Assignee: Owen O'Malley



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Assigned] (HIVE-4015) Add ORC file to the grammar as a file format

2013-02-25 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner reassigned HIVE-4015:


Assignee: Gunther Hagleitner

 Add ORC file to the grammar as a file format
 

 Key: HIVE-4015
 URL: https://issues.apache.org/jira/browse/HIVE-4015
 Project: Hive
  Issue Type: Improvement
Reporter: Owen O'Malley
Assignee: Gunther Hagleitner

 It would be much more convenient for users if we enable them to use ORC as a 
 file format in the HQL grammar. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4015) Add ORC file to the grammar as a file format

2013-02-25 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-4015:
-

Attachment: HIVE-4015.1.patch

Allows for create table ... stored as orc;

 Add ORC file to the grammar as a file format
 

 Key: HIVE-4015
 URL: https://issues.apache.org/jira/browse/HIVE-4015
 Project: Hive
  Issue Type: Improvement
Reporter: Owen O'Malley
Assignee: Gunther Hagleitner
 Attachments: HIVE-4015.1.patch


 It would be much more convenient for users if we enable them to use ORC as a 
 file format in the HQL grammar. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-2264) Hive server is SHUTTING DOWN when invalid queries beeing executed.

2013-02-25 Thread Brock Noland (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586320#comment-13586320
 ] 

Brock Noland commented on HIVE-2264:


Ran into this myself. +1.

 Hive server is SHUTTING DOWN when invalid queries beeing executed.
 --

 Key: HIVE-2264
 URL: https://issues.apache.org/jira/browse/HIVE-2264
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.9.0
 Environment: SuSE-Linux-11
Reporter: rohithsharma
Assignee: Navis
Priority: Critical
 Attachments: HIVE-2264.1.patch.txt


 When invalid query is beeing executed, Hive server is shutting down.
 {noformat}
 CREATE TABLE SAMPLETABLE(IP STRING , showtime BIGINT ) partitioned by (ds 
 string,ipz int) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\040'
 ALTER TABLE SAMPLETABLE add Partition(ds='sf') location 
 '/user/hive/warehouse' Partition(ipz=100) location '/user/hive/warehouse'
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4071) Map-join outer join produces incorrect results.

2013-02-25 Thread Vikram Dixit K (JIRA)
Vikram Dixit K created HIVE-4071:


 Summary: Map-join outer join produces incorrect results.
 Key: HIVE-4071
 URL: https://issues.apache.org/jira/browse/HIVE-4071
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.11.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K


For example, if one sets the size of noConditionalTask.size to 10 with 
corresponding auto join configurations set to true in auto_join28.q instead of 
the current smalltable.filesize configuration, we will observe different 
results if a select query is run. (The test only has explain statements at 
present).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4052) Disallow partition/sort and distribute/order combinations in windowing and partitioning spec

2013-02-25 Thread Harish Butani (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harish Butani updated HIVE-4052:


Attachment: HIVE-4052.1.patch.txt

 Disallow partition/sort and distribute/order combinations in windowing and 
 partitioning spec
 

 Key: HIVE-4052
 URL: https://issues.apache.org/jira/browse/HIVE-4052
 Project: Hive
  Issue Type: Bug
  Components: PTF-Windowing
Reporter: Ashutosh Chauhan
 Attachments: HIVE-4052.1.patch.txt




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4052) Disallow partition/sort and distribute/order combinations in windowing and partitioning spec

2013-02-25 Thread Harish Butani (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586323#comment-13586323
 ] 

Harish Butani commented on HIVE-4052:
-

patch attached.

 Disallow partition/sort and distribute/order combinations in windowing and 
 partitioning spec
 

 Key: HIVE-4052
 URL: https://issues.apache.org/jira/browse/HIVE-4052
 Project: Hive
  Issue Type: Bug
  Components: PTF-Windowing
Reporter: Ashutosh Chauhan
 Attachments: HIVE-4052.1.patch.txt




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4015) Add ORC file to the grammar as a file format

2013-02-25 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-4015:
-

Attachment: HIVE-4015.2.patch

Rebased

 Add ORC file to the grammar as a file format
 

 Key: HIVE-4015
 URL: https://issues.apache.org/jira/browse/HIVE-4015
 Project: Hive
  Issue Type: Improvement
Reporter: Owen O'Malley
Assignee: Gunther Hagleitner
 Attachments: HIVE-4015.1.patch, HIVE-4015.2.patch


 It would be much more convenient for users if we enable them to use ORC as a 
 file format in the HQL grammar. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4071) Map-join outer join produces incorrect results.

2013-02-25 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-4071:
-

Attachment: HIVE-4071.patch

Added modified versions of the auto_join28.q test case and corresponding .out 
files. Also updated test results for existing tests.

 Map-join outer join produces incorrect results.
 ---

 Key: HIVE-4071
 URL: https://issues.apache.org/jira/browse/HIVE-4071
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.11.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-4071.patch


 For example, if one sets the size of noConditionalTask.size to 10 with 
 corresponding auto join configurations set to true in auto_join28.q instead 
 of the current smalltable.filesize configuration, we will observe different 
 results if a select query is run. (The test only has explain statements at 
 present).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4071) Map-join outer join produces incorrect results.

2013-02-25 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-4071:
-

Status: Patch Available  (was: Open)

 Map-join outer join produces incorrect results.
 ---

 Key: HIVE-4071
 URL: https://issues.apache.org/jira/browse/HIVE-4071
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.11.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-4071.patch


 For example, if one sets the size of noConditionalTask.size to 10 with 
 corresponding auto join configurations set to true in auto_join28.q instead 
 of the current smalltable.filesize configuration, we will observe different 
 results if a select query is run. (The test only has explain statements at 
 present).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4015) Add ORC file to the grammar as a file format

2013-02-25 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-4015:
-

Attachment: HIVE-4015.3.patch

.3 is rebased against the right branch...

 Add ORC file to the grammar as a file format
 

 Key: HIVE-4015
 URL: https://issues.apache.org/jira/browse/HIVE-4015
 Project: Hive
  Issue Type: Improvement
Reporter: Owen O'Malley
Assignee: Gunther Hagleitner
 Attachments: HIVE-4015.1.patch, HIVE-4015.2.patch, HIVE-4015.3.patch


 It would be much more convenient for users if we enable them to use ORC as a 
 file format in the HQL grammar. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4072) Hive eclipse build path update for string template jar

2013-02-25 Thread Vikram Dixit K (JIRA)
Vikram Dixit K created HIVE-4072:


 Summary: Hive eclipse build path update for string template jar
 Key: HIVE-4072
 URL: https://issues.apache.org/jira/browse/HIVE-4072
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Affects Versions: 0.11.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Minor


StringTemplate jar version needs to be updated for hive to work with eclipse 
without user intervention.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4072) Hive eclipse build path update for string template jar

2013-02-25 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-4072:
-

Attachment: HIVE-4072.patch

 Hive eclipse build path update for string template jar
 --

 Key: HIVE-4072
 URL: https://issues.apache.org/jira/browse/HIVE-4072
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Affects Versions: 0.11.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Minor
 Attachments: HIVE-4072.patch


 StringTemplate jar version needs to be updated for hive to work with eclipse 
 without user intervention.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4072) Hive eclipse build path update for string template jar

2013-02-25 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-4072:
-

Status: Patch Available  (was: Open)

 Hive eclipse build path update for string template jar
 --

 Key: HIVE-4072
 URL: https://issues.apache.org/jira/browse/HIVE-4072
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Affects Versions: 0.11.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Minor
 Attachments: HIVE-4072.patch


 StringTemplate jar version needs to be updated for hive to work with eclipse 
 without user intervention.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


Jenkins build is back to normal : Hive-0.10.0-SNAPSHOT-h0.20.1 #76

2013-02-25 Thread Apache Jenkins Server
See https://builds.apache.org/job/Hive-0.10.0-SNAPSHOT-h0.20.1/76/



[jira] [Updated] (HIVE-4071) Map-join outer join produces incorrect results.

2013-02-25 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-4071:
-

Attachment: (was: HIVE-4071.patch)

 Map-join outer join produces incorrect results.
 ---

 Key: HIVE-4071
 URL: https://issues.apache.org/jira/browse/HIVE-4071
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.11.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-4071.patch


 For example, if one sets the size of noConditionalTask.size to 10 with 
 corresponding auto join configurations set to true in auto_join28.q instead 
 of the current smalltable.filesize configuration, we will observe different 
 results if a select query is run. (The test only has explain statements at 
 present).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4071) Map-join outer join produces incorrect results.

2013-02-25 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-4071:
-

Attachment: HIVE-4071.patch

 Map-join outer join produces incorrect results.
 ---

 Key: HIVE-4071
 URL: https://issues.apache.org/jira/browse/HIVE-4071
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.11.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-4071.patch


 For example, if one sets the size of noConditionalTask.size to 10 with 
 corresponding auto join configurations set to true in auto_join28.q instead 
 of the current smalltable.filesize configuration, we will observe different 
 results if a select query is run. (The test only has explain statements at 
 present).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4058) make ORC versioned

2013-02-25 Thread Owen O'Malley (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586398#comment-13586398
 ] 

Owen O'Malley commented on HIVE-4058:
-

I should also note that if it is required at some point, we can always create 
such a field in the footer and treat that missing field as a version 0.

 make ORC versioned
 --

 Key: HIVE-4058
 URL: https://issues.apache.org/jira/browse/HIVE-4058
 Project: Hive
  Issue Type: Improvement
  Components: Serializers/Deserializers
Reporter: Namit Jain



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Resolved] (HIVE-4061) skip columns which are not accessed in the query for ORC

2013-02-25 Thread Owen O'Malley (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Owen O'Malley resolved HIVE-4061.
-

Resolution: Cannot Reproduce

This is already done.

 skip columns which are not accessed in the query for ORC
 

 Key: HIVE-4061
 URL: https://issues.apache.org/jira/browse/HIVE-4061
 Project: Hive
  Issue Type: Improvement
  Components: Serializers/Deserializers
Reporter: Namit Jain



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Resolved] (HIVE-4058) make ORC versioned

2013-02-25 Thread Owen O'Malley (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Owen O'Malley resolved HIVE-4058.
-

Resolution: Won't Fix

 make ORC versioned
 --

 Key: HIVE-4058
 URL: https://issues.apache.org/jira/browse/HIVE-4058
 Project: Hive
  Issue Type: Improvement
  Components: Serializers/Deserializers
Reporter: Namit Jain



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4053) Add support for phonetic algorithms in Hive

2013-02-25 Thread Krishna (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Krishna updated HIVE-4053:
--

Attachment: HIVE-4053.1.patch.txt

 Add support for phonetic algorithms in Hive
 ---

 Key: HIVE-4053
 URL: https://issues.apache.org/jira/browse/HIVE-4053
 Project: Hive
  Issue Type: New Feature
  Components: UDF
Reporter: Krishna
 Attachments: FunctionRegistry.java, GenericUDFRefinedSoundex.java, 
 HIVE-4053.1.patch.txt


 Following phonetic algorithms should be considered, which are very useful in 
 search:
 Soundex
 Refined Soundex
 Daitch–Mokotoff Soundex
 Metaphone and Double Metaphone
 New York State Identification and Intelligence System (NYSIIS)
 Caverphone

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4053) Add support for phonetic algorithms in Hive

2013-02-25 Thread Krishna (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586422#comment-13586422
 ] 

Krishna commented on HIVE-4053:
---

I've attached the patch to JIRA. How do I post it for review on reviewboard?

 Add support for phonetic algorithms in Hive
 ---

 Key: HIVE-4053
 URL: https://issues.apache.org/jira/browse/HIVE-4053
 Project: Hive
  Issue Type: New Feature
  Components: UDF
Reporter: Krishna
 Attachments: FunctionRegistry.java, GenericUDFRefinedSoundex.java, 
 HIVE-4053.1.patch.txt


 Following phonetic algorithms should be considered, which are very useful in 
 search:
 Soundex
 Refined Soundex
 Daitch–Mokotoff Soundex
 Metaphone and Double Metaphone
 New York State Identification and Intelligence System (NYSIIS)
 Caverphone

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4014) Hive+RCFile is not doing column pruning and reading much more data than necessary

2013-02-25 Thread Tamas Tarjanyi (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586425#comment-13586425
 ] 

Tamas Tarjanyi commented on HIVE-4014:
--

I can not see affected versions here but I found the same issue on 

CDH4.1.3 - which is using hadoop-2.0.0+556 / hive-0.9.0+158

Than I have downloaded and tested 
hadoop 1.0.3 / hive 0.10.0 and
hadoop 1.0.4 / hive 0.10.0 
Both working fine and pruning is effective in these case.


 Hive+RCFile is not doing column pruning and reading much more data than 
 necessary
 -

 Key: HIVE-4014
 URL: https://issues.apache.org/jira/browse/HIVE-4014
 Project: Hive
  Issue Type: Bug
Reporter: Vinod Kumar Vavilapalli
Assignee: Vinod Kumar Vavilapalli

 With even simple projection queries, I see that HDFS bytes read counter 
 doesn't show any reduction in the amount of data read.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


Re: Merging HCatalog into Hive

2013-02-25 Thread Alan Gates

On Feb 24, 2013, at 12:22 PM, Brock Noland wrote:

 Looks good from my perspective and I glad to see this moving forward.
 
 Regarding #4 (JIRA)
 
 I don't know if there's a way to upload existing JIRAs into Hive's JIRA,
 but I think it would be better to leave them where they are.
 
 JIRA has a bulk move feature, but I am curious as why we would leave them
 under the old project? There might be good reason to orphan them, but my
 first thought is that it would be nice to have them under the HIVE project
 simply for search purposes.

I was thinking it would be hard for people who had bookmarks or pointers to the 
existing JIRAs.  Also, since it would change all the JIRA numbers on closed 
JIRAs it would make records from previous releases a mess.  But I see what 
you're saying about making search hard.  Maybe there's a way to leave the 
historical info where it is while importing any active JIRAs into Hive so 
people can search them.

Alan.

 
 Brock
 
 
 
 



Re: Merging HCatalog into Hive

2013-02-25 Thread Brock Noland
Hi,
On Mon, Feb 25, 2013 at 5:20 PM, Alan Gates ga...@hortonworks.com wrote:


 On Feb 24, 2013, at 12:22 PM, Brock Noland wrote:

  Looks good from my perspective and I glad to see this moving forward.
 
  Regarding #4 (JIRA)
 
  I don't know if there's a way to upload existing JIRAs into Hive's JIRA,
  but I think it would be better to leave them where they are.
 
  JIRA has a bulk move feature, but I am curious as why we would leave them
  under the old project? There might be good reason to orphan them, but my
  first thought is that it would be nice to have them under the HIVE
 project
  simply for search purposes.

 I was thinking it would be hard for people who had bookmarks or pointers
 to the existing JIRAs.  Also, since it would change all the JIRA numbers on
 closed JIRAs it would make records from previous releases a mess.  But I
 see what you're saying about making search hard.  Maybe there's a way to
 leave the historical info where it is while importing any active JIRAs into
 Hive so people can search them.


Yeah new jira numbers for old issues, might be a mess. The bulk move
operator allows you to select any combination of issues such as open
issues. Also, once moved the old JIRA will redirect to the new jira. If we
are interested in pursuing this, I'd be happy to help.

Cheers,
Brock


[jira] [Updated] (HIVE-4053) Add support for phonetic algorithms in Hive

2013-02-25 Thread Krishna (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Krishna updated HIVE-4053:
--

Fix Version/s: 0.10.0
   Labels: patch  (was: )
Affects Version/s: 0.10.0
 Release Note: Implementation of the phonetic algorithm - Refined 
Soundex
   Status: Patch Available  (was: Open)

 Add support for phonetic algorithms in Hive
 ---

 Key: HIVE-4053
 URL: https://issues.apache.org/jira/browse/HIVE-4053
 Project: Hive
  Issue Type: New Feature
  Components: UDF
Affects Versions: 0.10.0
Reporter: Krishna
  Labels: patch
 Fix For: 0.10.0

 Attachments: FunctionRegistry.java, GenericUDFRefinedSoundex.java, 
 HIVE-4053.1.patch.txt


 Following phonetic algorithms should be considered, which are very useful in 
 search:
 Soundex
 Refined Soundex
 Daitch–Mokotoff Soundex
 Metaphone and Double Metaphone
 New York State Identification and Intelligence System (NYSIIS)
 Caverphone

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4053) Add support for phonetic algorithms in Hive

2013-02-25 Thread Krishna (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586499#comment-13586499
 ] 

Krishna commented on HIVE-4053:
---

I have submitted the patch; please review the code.

 Add support for phonetic algorithms in Hive
 ---

 Key: HIVE-4053
 URL: https://issues.apache.org/jira/browse/HIVE-4053
 Project: Hive
  Issue Type: New Feature
  Components: UDF
Affects Versions: 0.10.0
Reporter: Krishna
  Labels: patch
 Fix For: 0.10.0

 Attachments: FunctionRegistry.java, GenericUDFRefinedSoundex.java, 
 HIVE-4053.1.patch.txt


 Following phonetic algorithms should be considered, which are very useful in 
 search:
 Soundex
 Refined Soundex
 Daitch–Mokotoff Soundex
 Metaphone and Double Metaphone
 New York State Identification and Intelligence System (NYSIIS)
 Caverphone

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3490) Implement * or a.* for arguments to UDFs

2013-02-25 Thread Navis (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Navis updated HIVE-3490:


Assignee: Navis
  Status: Patch Available  (was: Open)

 Implement * or a.* for arguments to UDFs
 

 Key: HIVE-3490
 URL: https://issues.apache.org/jira/browse/HIVE-3490
 Project: Hive
  Issue Type: Bug
  Components: Query Processor, UDF
Reporter: Adam Kramer
Assignee: Navis

 For a random UDF, we should be able to use * or a.* to refer to all of the 
 columns in their natural order. This is not currently implemented.
 I'm reporting this as a bug because it is a manner in which Hive is 
 inconsistent with the SQL spec, and because Hive claims to implement *.
 hive select all_non_null(a.*) from table a where a.ds='2012-09-01';
 FAILED: ParseException line 1:25 mismatched input '*' expecting Identifier 
 near '.' in expression specification

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3490) Implement * or a.* for arguments to UDFs

2013-02-25 Thread Phabricator (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Phabricator updated HIVE-3490:
--

Attachment: HIVE-3490.D8889.1.patch

navis requested code review of HIVE-3490 [jira] Implement * or a.* for 
arguments to UDFs.

Reviewers: JIRA

HIVE-3490 Implement * or a.* for arguments to UDFs

For a random UDF, we should be able to use * or a.* to refer to all of the 
columns in their natural order. This is not currently implemented.

I'm reporting this as a bug because it is a manner in which Hive is 
inconsistent with the SQL spec, and because Hive claims to implement *.

hive select all_non_null(a.*) from table a where a.ds='2012-09-01';
FAILED: ParseException line 1:25 mismatched input '*' expecting Identifier near 
'.' in expression specification

TEST PLAN
  EMPTY

REVISION DETAIL
  https://reviews.facebook.net/D8889

AFFECTED FILES
  ql/src/java/org/apache/hadoop/hive/ql/parse/IdentifiersParser.g
  ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
  ql/src/java/org/apache/hadoop/hive/ql/parse/TypeCheckProcFactory.java
  ql/src/java/org/apache/hadoop/hive/ql/plan/ExprNodeColumnListDesc.java
  ql/src/test/queries/clientpositive/allcolref_in_udf.q
  ql/src/test/results/clientpositive/allcolref_in_udf.q.out

MANAGE HERALD RULES
  https://reviews.facebook.net/herald/view/differential/

WHY DID I GET THIS EMAIL?
  https://reviews.facebook.net/herald/transcript/21621/

To: JIRA, navis


 Implement * or a.* for arguments to UDFs
 

 Key: HIVE-3490
 URL: https://issues.apache.org/jira/browse/HIVE-3490
 Project: Hive
  Issue Type: Bug
  Components: Query Processor, UDF
Reporter: Adam Kramer
Assignee: Navis
 Attachments: HIVE-3490.D8889.1.patch


 For a random UDF, we should be able to use * or a.* to refer to all of the 
 columns in their natural order. This is not currently implemented.
 I'm reporting this as a bug because it is a manner in which Hive is 
 inconsistent with the SQL spec, and because Hive claims to implement *.
 hive select all_non_null(a.*) from table a where a.ds='2012-09-01';
 FAILED: ParseException line 1:25 mismatched input '*' expecting Identifier 
 near '.' in expression specification

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-948) more query plan optimization rules

2013-02-25 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-948:
--

   Resolution: Fixed
Fix Version/s: 0.11.0
   Status: Resolved  (was: Patch Available)

Committed to trunk. Thanks, Navis!

 more query plan optimization rules 
 ---

 Key: HIVE-948
 URL: https://issues.apache.org/jira/browse/HIVE-948
 Project: Hive
  Issue Type: Improvement
Reporter: Ning Zhang
Assignee: Navis
 Fix For: 0.11.0

 Attachments: HIVE-948.D8463.1.patch, HIVE-948.D8463.2.patch, 
 HIVE-948.D8463.3.patch, HIVE-948.D8463.3.patch, HIVE-948.D8463.4.patch, 
 HIVE-948.D8463.5.patch, HIVE-948.testresult_only.1.txt


 Many query plans are not optimal in that they contain redundant operators. 
 Some examples are unnecessary select operators (select followed by select, 
 select output being the same as input etc.). Even though these operators are 
 not very expensive, they could account for around 10% of CPU time in some 
 simple queries. It seems they are low-hanging fruits that we should pick 
 first. 
 BTW, it seems these optimization rules should be added at the last stage of 
 the physical optimization phase since some redundant operators are added to 
 facilitate physical plan generation. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-1402) Add parallel ORDER BY to Hive

2013-02-25 Thread Phabricator (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Phabricator updated HIVE-1402:
--

Attachment: HIVE-1402.D8895.1.patch

navis requested code review of HIVE-1402 [jira] Add parallel ORDER BY to Hive.

Reviewers: JIRA

HIVE-1402 Add parallel ORDER BY to Hive

TEST PLAN
  EMPTY

REVISION DETAIL
  https://reviews.facebook.net/D8895

AFFECTED FILES
  common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
  ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java
  ql/src/java/org/apache/hadoop/hive/ql/exec/FetchOperator.java
  ql/src/java/org/apache/hadoop/hive/ql/exec/HiveTotalOrderPartitioner.java
  ql/src/java/org/apache/hadoop/hive/ql/exec/OperatorUtils.java
  ql/src/java/org/apache/hadoop/hive/ql/exec/PartitionKeySampler.java
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/BucketingSortingCtx.java
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/BucketingSortingInferenceOptimizer.java
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/CommonJoinResolver.java
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/IndexWhereResolver.java
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/MapJoinResolver.java
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/MetadataOnlyOptimizer.java
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/PhysicalContext.java
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/PhysicalOptimizer.java
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/SamplingOptimizer.java
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/SkewJoinResolver.java
  ql/src/java/org/apache/hadoop/hive/ql/plan/MapredWork.java
  ql/src/java/org/apache/hadoop/hive/ql/plan/PartitionDesc.java
  ql/src/java/org/apache/hadoop/hive/ql/udf/UDAFPercentile.java
  ql/src/test/queries/clientpositive/parallel_orderby.q
  ql/src/test/results/clientpositive/parallel_orderby.q.out

MANAGE HERALD RULES
  https://reviews.facebook.net/herald/view/differential/

WHY DID I GET THIS EMAIL?
  https://reviews.facebook.net/herald/transcript/21627/

To: JIRA, navis


 Add parallel ORDER BY to Hive
 -

 Key: HIVE-1402
 URL: https://issues.apache.org/jira/browse/HIVE-1402
 Project: Hive
  Issue Type: New Feature
  Components: Query Processor
Reporter: Jeff Hammerbacher
Assignee: Venkatesh Seetharam
  Labels: optimization
 Attachments: HIVE-1402.D8895.1.patch




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4073) Make partition by optional in over clause

2013-02-25 Thread Ashutosh Chauhan (JIRA)
Ashutosh Chauhan created HIVE-4073:
--

 Summary: Make partition by optional in over clause
 Key: HIVE-4073
 URL: https://issues.apache.org/jira/browse/HIVE-4073
 Project: Hive
  Issue Type: Bug
  Components: PTF-Windowing
Reporter: Ashutosh Chauhan


select s, sum(i) over() from tt; should work. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4073) Make partition by optional in over clause

2013-02-25 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-4073:
---

Description: select s, sum( i ) over() from tt; should work.   (was: select 
s, sum(i) over() from tt; should work. )

 Make partition by optional in over clause
 -

 Key: HIVE-4073
 URL: https://issues.apache.org/jira/browse/HIVE-4073
 Project: Hive
  Issue Type: Bug
  Components: PTF-Windowing
Reporter: Ashutosh Chauhan

 select s, sum( i ) over() from tt; should work. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4073) Make partition by optional in over clause

2013-02-25 Thread Ashutosh Chauhan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586599#comment-13586599
 ] 

Ashutosh Chauhan commented on HIVE-4073:


sql spec allows empty specification in over clause. Currently, our parser fails 
to parse this query. We have order-by optional but we don't need user to 
specify a partitioning spec. Infact we should relax that requirement and if 
user chose not to specify the partitioning column than partition by some 
constant value which achieves desired effect.

 Make partition by optional in over clause
 -

 Key: HIVE-4073
 URL: https://issues.apache.org/jira/browse/HIVE-4073
 Project: Hive
  Issue Type: Bug
  Components: PTF-Windowing
Reporter: Ashutosh Chauhan

 select s, sum( i ) over() from tt; should work. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Comment Edited] (HIVE-4073) Make partition by optional in over clause

2013-02-25 Thread Ashutosh Chauhan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586599#comment-13586599
 ] 

Ashutosh Chauhan edited comment on HIVE-4073 at 2/26/13 1:39 AM:
-

sql spec allows empty specification in over clause. Currently, our parser fails 
to parse this query. We have order-by optional but we do need user to specify a 
partitioning spec. We should relax that requirement and if user chose not to 
specify the partitioning column than partition by some constant value which 
achieves desired effect.

  was (Author: ashutoshc):
sql spec allows empty specification in over clause. Currently, our parser 
fails to parse this query. We have order-by optional but we don't need user to 
specify a partitioning spec. Infact we should relax that requirement and if 
user chose not to specify the partitioning column than partition by some 
constant value which achieves desired effect.
  
 Make partition by optional in over clause
 -

 Key: HIVE-4073
 URL: https://issues.apache.org/jira/browse/HIVE-4073
 Project: Hive
  Issue Type: Bug
  Components: PTF-Windowing
Reporter: Ashutosh Chauhan

 select s, sum( i ) over() from tt; should work. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


Hive-trunk-h0.21 - Build # 1986 - Fixed

2013-02-25 Thread Apache Jenkins Server
Changes for Build #1985

Changes for Build #1986



All tests passed

The Apache Jenkins build system has built Hive-trunk-h0.21 (build #1986)

Status: Fixed

Check console output at https://builds.apache.org/job/Hive-trunk-h0.21/1986/ to 
view the results.

[jira] [Updated] (HIVE-2935) Implement HiveServer2

2013-02-25 Thread Prasad Mujumdar (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasad Mujumdar updated HIVE-2935:
--

Attachment: HIVE-2935-4.patch
HIVE-2935-4.nothrift.patch
HIVE-2935-4.changed-files-only.patch

Rebased patch

 Implement HiveServer2
 -

 Key: HIVE-2935
 URL: https://issues.apache.org/jira/browse/HIVE-2935
 Project: Hive
  Issue Type: New Feature
  Components: Server Infrastructure
Reporter: Carl Steinbach
Assignee: Carl Steinbach
  Labels: HiveServer2
 Attachments: beelinepositive.tar.gz, HIVE-2935.1.notest.patch.txt, 
 HIVE-2935.2.notest.patch.txt, HIVE-2935.2.nothrift.patch.txt, 
 HIVE-2935.3.patch.gz, HIVE-2935-4.changed-files-only.patch, 
 HIVE-2935-4.nothrift.patch, HIVE-2935-4.patch, 
 HIVE-2935.fix.unsecuredoAs.patch, HS2-changed-files-only.patch, 
 HS2-with-thrift-patch-rebased.patch




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-2935) Implement HiveServer2

2013-02-25 Thread Prasad Mujumdar (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586621#comment-13586621
 ] 

Prasad Mujumdar commented on HIVE-2935:
---

Attached updated patch -
  Rebased with latest trunk
  Cherry-picked changes from Thejas's patch
  Additional fixes for user impersonation including support for secure remote 
metastore
  plug-able authentication support

 Implement HiveServer2
 -

 Key: HIVE-2935
 URL: https://issues.apache.org/jira/browse/HIVE-2935
 Project: Hive
  Issue Type: New Feature
  Components: Server Infrastructure
Reporter: Carl Steinbach
Assignee: Carl Steinbach
  Labels: HiveServer2
 Attachments: beelinepositive.tar.gz, HIVE-2935.1.notest.patch.txt, 
 HIVE-2935.2.notest.patch.txt, HIVE-2935.2.nothrift.patch.txt, 
 HIVE-2935.3.patch.gz, HIVE-2935-4.changed-files-only.patch, 
 HIVE-2935-4.nothrift.patch, HIVE-2935-4.patch, 
 HIVE-2935.fix.unsecuredoAs.patch, HS2-changed-files-only.patch, 
 HS2-with-thrift-patch-rebased.patch




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4034) Should be able to specify windowing spec without needing Between

2013-02-25 Thread Brock Noland (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brock Noland updated HIVE-4034:
---

Summary: Should be able to specify windowing spec without needing Between  
(was: Should be able to specify windwoing spec without needing Between)

 Should be able to specify windowing spec without needing Between
 

 Key: HIVE-4034
 URL: https://issues.apache.org/jira/browse/HIVE-4034
 Project: Hive
  Issue Type: Bug
  Components: PTF-Windowing
Reporter: Ashutosh Chauhan

 Currently user need to do following:
 {noformat}
 select s, sum(b) over (distribute by i sort by si rows between unbounded 
 preceding and current row) from over100k;
 {noformat}
 but sql spec allows following as well:
 {noformat}
 select s, sum(b) over (distribute by i sort by si rows unbounded preceding) 
 from over100k;
 {noformat}
 In such cases {{current row}} should be assumed implicitly.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-2935) Implement HiveServer2

2013-02-25 Thread Prasad Mujumdar (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586630#comment-13586630
 ] 

Prasad Mujumdar commented on HIVE-2935:
---

Review request for the changed files at https://reviews.facebook.net/D7281

 Implement HiveServer2
 -

 Key: HIVE-2935
 URL: https://issues.apache.org/jira/browse/HIVE-2935
 Project: Hive
  Issue Type: New Feature
  Components: Server Infrastructure
Reporter: Carl Steinbach
Assignee: Carl Steinbach
  Labels: HiveServer2
 Attachments: beelinepositive.tar.gz, HIVE-2935.1.notest.patch.txt, 
 HIVE-2935.2.notest.patch.txt, HIVE-2935.2.nothrift.patch.txt, 
 HIVE-2935.3.patch.gz, HIVE-2935-4.changed-files-only.patch, 
 HIVE-2935-4.nothrift.patch, HIVE-2935-4.patch, 
 HIVE-2935.fix.unsecuredoAs.patch, HS2-changed-files-only.patch, 
 HS2-with-thrift-patch-rebased.patch




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4073) Make partition by optional in over clause

2013-02-25 Thread Brock Noland (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brock Noland updated HIVE-4073:
---

Assignee: Brock Noland

 Make partition by optional in over clause
 -

 Key: HIVE-4073
 URL: https://issues.apache.org/jira/browse/HIVE-4073
 Project: Hive
  Issue Type: Bug
  Components: PTF-Windowing
Reporter: Ashutosh Chauhan
Assignee: Brock Noland

 select s, sum( i ) over() from tt; should work. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4014) Hive+RCFile is not doing column pruning and reading much more data than necessary

2013-02-25 Thread Lianhui Wang (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586701#comment-13586701
 ] 

Lianhui Wang commented on HIVE-4014:


i donot think that.
i see the code.
in HiveInputFormat and CombineHiveInputFormat's getRecordReader(), it calls 
pushProjectionsAndFilters().
also in pushProjectionsAndFilters(), from TableScanOperator it get needed 
columns and  set these ids to hive.io.file.readcolumn.ids.
and then in RCFile.Reader will read hive.io.file.readcolumn.ids to skip column.
maybe the counter has some mistakes.
if i have mistake,please tell me.thx.

 Hive+RCFile is not doing column pruning and reading much more data than 
 necessary
 -

 Key: HIVE-4014
 URL: https://issues.apache.org/jira/browse/HIVE-4014
 Project: Hive
  Issue Type: Bug
Reporter: Vinod Kumar Vavilapalli
Assignee: Vinod Kumar Vavilapalli

 With even simple projection queries, I see that HDFS bytes read counter 
 doesn't show any reduction in the amount of data read.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4052) Disallow partition/sort and distribute/order combinations in windowing and partitioning spec

2013-02-25 Thread Phabricator (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Phabricator updated HIVE-4052:
--

Attachment: HIVE-4052.D8901.1.patch

hbutani requested code review of HIVE-4052 [jira] Disallow partition/sort and 
distribute/order combinations in windowing and partitioning spec.

Reviewers: JIRA

change grammar so that only Part-Order or Dist-Sort are valid combinations

TEST PLAN
  EMPTY

REVISION DETAIL
  https://reviews.facebook.net/D8901

AFFECTED FILES
  data/files/flights_tiny.txt
  data/files/part.rc
  data/files/part.seq
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/ColumnPrunerProcFactory.java
  ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g
  ql/src/test/queries/clientnegative/ptf_negative_DistributeByOrderBy.q
  ql/src/test/queries/clientnegative/ptf_negative_PartitionBySortBy.q
  ql/src/test/queries/clientpositive/ptf_general_queries.q
  ql/src/test/results/clientnegative/ptf_negative_DistributeByOrderBy.q.out
  ql/src/test/results/clientnegative/ptf_negative_PartitionBySortBy.q.out
  ql/src/test/results/clientpositive/ptf_general_queries.q.out

MANAGE HERALD RULES
  https://reviews.facebook.net/herald/view/differential/

WHY DID I GET THIS EMAIL?
  https://reviews.facebook.net/herald/transcript/21639/

To: JIRA, hbutani


 Disallow partition/sort and distribute/order combinations in windowing and 
 partitioning spec
 

 Key: HIVE-4052
 URL: https://issues.apache.org/jira/browse/HIVE-4052
 Project: Hive
  Issue Type: Bug
  Components: PTF-Windowing
Reporter: Ashutosh Chauhan
 Attachments: HIVE-4052.1.patch.txt, HIVE-4052.D8901.1.patch




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4052) Disallow partition/sort and distribute/order combinations in windowing and partitioning spec

2013-02-25 Thread Phabricator (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586758#comment-13586758
 ] 

Phabricator commented on HIVE-4052:
---

ashutoshc has accepted the revision HIVE-4052 [jira] Disallow partition/sort 
and distribute/order combinations in windowing and partitioning spec.

  +1

REVISION DETAIL
  https://reviews.facebook.net/D8901

BRANCH
  HIVE-4052

ARCANIST PROJECT
  hive

To: JIRA, ashutoshc, hbutani


 Disallow partition/sort and distribute/order combinations in windowing and 
 partitioning spec
 

 Key: HIVE-4052
 URL: https://issues.apache.org/jira/browse/HIVE-4052
 Project: Hive
  Issue Type: Bug
  Components: PTF-Windowing
Reporter: Ashutosh Chauhan
 Attachments: HIVE-4052.1.patch.txt, HIVE-4052.D8901.1.patch




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-948) more query plan optimization rules

2013-02-25 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586789#comment-13586789
 ] 

Hudson commented on HIVE-948:
-

Integrated in Hive-trunk-h0.21 #1987 (See 
[https://builds.apache.org/job/Hive-trunk-h0.21/1987/])
HIVE-948: more query plan optimization rules (Navis via Ashutosh Chauhan) 
(Revision 1449981)

 Result = FAILURE
hashutosh : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1449981
Files : 
* /hive/trunk/contrib/src/test/results/clientpositive/serde_typedbytes.q.out
* /hive/trunk/contrib/src/test/results/clientpositive/serde_typedbytes5.q.out
* /hive/trunk/hbase-handler/src/test/results/positive/hbase_queries.q.out
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Operator.java
* 
/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/NonBlockingOpDeDupProc.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/Optimizer.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/ExprNodeDescUtils.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java
* 
/hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/PredicateTransitivePropagate.java
* /hive/trunk/ql/src/test/queries/clientpositive/nonblock_op_deduplicate.q
* /hive/trunk/ql/src/test/results/clientnegative/bucket_mapjoin_mismatch1.q.out
* 
/hive/trunk/ql/src/test/results/clientnegative/sortmerge_mapjoin_mismatch_1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/alias_casted_column.q.out
* /hive/trunk/ql/src/test/results/clientpositive/ambiguous_col.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join12.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join14_hadoop20.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join17.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join19.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join2.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join20.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join22.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join26.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join28.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join29.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join3.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join4.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join5.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join6.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join7.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join8.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_join9.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_smb_mapjoin_14.q.out
* /hive/trunk/ql/src/test/results/clientpositive/auto_sortmerge_join_9.q.out
* /hive/trunk/ql/src/test/results/clientpositive/binarysortable_1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucket_groupby.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucket_map_join_1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucket_map_join_2.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_2.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_3.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_4.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_5.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_6.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_7.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_8.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketizedhiveinputformat.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin10.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin11.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin12.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin13.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin2.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin3.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin4.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin5.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin7.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin8.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin9.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin_negative.q.out
* /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin_negative2.q.out
* 

Hive-trunk-h0.21 - Build # 1987 - Failure

2013-02-25 Thread Apache Jenkins Server
Changes for Build #1987
[hashutosh] HIVE-948: more query plan optimization rules (Navis via Ashutosh 
Chauhan)




No tests ran.

The Apache Jenkins build system has built Hive-trunk-h0.21 (build #1987)

Status: Failure

Check console output at https://builds.apache.org/job/Hive-trunk-h0.21/1987/ to 
view the results.

[jira] [Updated] (HIVE-684) add UDF make_set

2013-02-25 Thread PRETTY SITHARA (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

PRETTY SITHARA updated HIVE-684:


Attachment: HIVE-684.2.patch.txt

 add UDF make_set
 

 Key: HIVE-684
 URL: https://issues.apache.org/jira/browse/HIVE-684
 Project: Hive
  Issue Type: New Feature
  Components: Query Processor
Reporter: Namit Jain
Assignee: PRETTY SITHARA
 Attachments: HIVE-684.1.patch.txt, HIVE-684.2.patch.txt, 
 input.txt.txt, make_set.q, make_set.q.out


 add UDFmake_set
 look at
 http://dev.mysql.com/doc/refman/5.0/en/func-op-summary-ref.html
 for details

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-684) add UDF make_set

2013-02-25 Thread PRETTY SITHARA (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

PRETTY SITHARA updated HIVE-684:


Status: Patch Available  (was: Open)

Patch for HIVE-684 version 2

 add UDF make_set
 

 Key: HIVE-684
 URL: https://issues.apache.org/jira/browse/HIVE-684
 Project: Hive
  Issue Type: New Feature
  Components: Query Processor
Reporter: Namit Jain
Assignee: PRETTY SITHARA
 Attachments: HIVE-684.1.patch.txt, HIVE-684.2.patch.txt, 
 input.txt.txt, make_set.q, make_set.q.out


 add UDFmake_set
 look at
 http://dev.mysql.com/doc/refman/5.0/en/func-op-summary-ref.html
 for details

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-684) add UDF make_set

2013-02-25 Thread PRETTY SITHARA (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

PRETTY SITHARA updated HIVE-684:


Attachment: (was: HIVE-684.2.patch.txt)

 add UDF make_set
 

 Key: HIVE-684
 URL: https://issues.apache.org/jira/browse/HIVE-684
 Project: Hive
  Issue Type: New Feature
  Components: Query Processor
Reporter: Namit Jain
Assignee: PRETTY SITHARA
 Attachments: HIVE-684.1.patch.txt, input.txt.txt, make_set.q, 
 make_set.q.out


 add UDFmake_set
 look at
 http://dev.mysql.com/doc/refman/5.0/en/func-op-summary-ref.html
 for details

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-684) add UDF make_set

2013-02-25 Thread PRETTY SITHARA (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

PRETTY SITHARA updated HIVE-684:


Status: Open  (was: Patch Available)

 add UDF make_set
 

 Key: HIVE-684
 URL: https://issues.apache.org/jira/browse/HIVE-684
 Project: Hive
  Issue Type: New Feature
  Components: Query Processor
Reporter: Namit Jain
Assignee: PRETTY SITHARA
 Attachments: HIVE-684.1.patch.txt, input.txt.txt, make_set.q, 
 make_set.q.out


 add UDFmake_set
 look at
 http://dev.mysql.com/doc/refman/5.0/en/func-op-summary-ref.html
 for details

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Resolved] (HIVE-4052) Disallow partition/sort and distribute/order combinations in windowing and partitioning spec

2013-02-25 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan resolved HIVE-4052.


Resolution: Fixed
  Assignee: Harish Butani

Committed to branch. Thanks, Harish!

 Disallow partition/sort and distribute/order combinations in windowing and 
 partitioning spec
 

 Key: HIVE-4052
 URL: https://issues.apache.org/jira/browse/HIVE-4052
 Project: Hive
  Issue Type: Bug
  Components: PTF-Windowing
Reporter: Ashutosh Chauhan
Assignee: Harish Butani
 Attachments: HIVE-4052.1.patch.txt, HIVE-4052.D8901.1.patch




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


Re: Phabricator issues, libphutil v1 libraries are no longer supported

2013-02-25 Thread amareshwari sriramdasu
Hit the same problem as ' libphutil v1 libraries are no longer supported'.

Doing the git checkout as suggested above resolved it.

But doing arc diff --jira HIVE-4018 is just hung with below output :

Linting...
LINT OKAY  No lint problems.
Running unit tests...
No unit test engine is configured for this project.

Is there is any other way to do this?

Thanks
Amareshwari


On Fri, Jan 25, 2013 at 8:44 PM, Bennie Schut bsc...@ebuddy.com wrote:

 I've been having issues using Phabricator to create a patch for hive. It
 looks like some versioning issues with arc_jira_lib and libphutil.
 I kept getting this error: libphutil v1 libraries are no longer
 supported.
 To work around it go to the libphutil directory you checked out of git and
 revert to an older commit:
 git checkout 870bcc76434410344d27a3fa4604ac96200bf7f6

 The next problem is the arc client:
 ERR-BAD-VERSION: Your 'arc' client version is '6', which is newer than the
 server version, '5'. Upgrade your Phabricator install.
 So also get an older version:
 git checkout 6cb8d483b23c132da4c82934438cd1af0e5ba32b

 I'll leave the actual fixing to someone with a bit more php experience.

 After that I tried to arc diff HIVE-3308 added the message then:
 Linting...
 LINT OKAY  No lint problems.
 Running unit tests...
 No unit test engine is configured for this project.
 Usage Exception: There are no changes to generate a diff from!

 In the end I manually uploaded It as D8187. And added the patch to Jira
 even tough I guess that's what arc is suppose to do.

 Bennie.




Re: Phabricator issues, libphutil v1 libraries are no longer supported

2013-02-25 Thread Ashutosh Chauhan
Try following git hash-tags:

arcanist ccdf9ae957f995154374a0dc344fb260b112ce2b

libphutil 10965d5b649e84a60fc2d692f3b1f6a63288cd9d
Ashutosh
On Mon, Feb 25, 2013 at 10:54 PM, amareshwari sriramdasu 
amareshw...@gmail.com wrote:

 Hit the same problem as ' libphutil v1 libraries are no longer supported'.

 Doing the git checkout as suggested above resolved it.

 But doing arc diff --jira HIVE-4018 is just hung with below output :

 Linting...
 LINT OKAY  No lint problems.
 Running unit tests...
 No unit test engine is configured for this project.

 Is there is any other way to do this?

 Thanks
 Amareshwari


 On Fri, Jan 25, 2013 at 8:44 PM, Bennie Schut bsc...@ebuddy.com wrote:

  I've been having issues using Phabricator to create a patch for hive. It
  looks like some versioning issues with arc_jira_lib and libphutil.
  I kept getting this error: libphutil v1 libraries are no longer
  supported.
  To work around it go to the libphutil directory you checked out of git
 and
  revert to an older commit:
  git checkout 870bcc76434410344d27a3fa4604ac96200bf7f6
 
  The next problem is the arc client:
  ERR-BAD-VERSION: Your 'arc' client version is '6', which is newer than
 the
  server version, '5'. Upgrade your Phabricator install.
  So also get an older version:
  git checkout 6cb8d483b23c132da4c82934438cd1af0e5ba32b
 
  I'll leave the actual fixing to someone with a bit more php experience.
 
  After that I tried to arc diff HIVE-3308 added the message then:
  Linting...
  LINT OKAY  No lint problems.
  Running unit tests...
  No unit test engine is configured for this project.
  Usage Exception: There are no changes to generate a diff from!
 
  In the end I manually uploaded It as D8187. And added the patch to Jira
  even tough I guess that's what arc is suppose to do.
 
  Bennie.
 
 



[jira] [Created] (HIVE-4074) Doc update for .8, .9 and .10

2013-02-25 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-4074:


 Summary: Doc update for .8, .9 and .10
 Key: HIVE-4074
 URL: https://issues.apache.org/jira/browse/HIVE-4074
 Project: Hive
  Issue Type: Bug
  Components: Documentation
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


Need to update the javadocs for releases 8, 9 and 10.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira