[jira] [Commented] (HIVE-4042) ignore mapjoin hint
[ https://issues.apache.org/jira/browse/HIVE-4042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13585706#comment-13585706 ] Namit Jain commented on HIVE-4042: -- Updated the default to true, changed some tests/outputs and added this as a query property for easy logging. All the tests passed ignore mapjoin hint --- Key: HIVE-4042 URL: https://issues.apache.org/jira/browse/HIVE-4042 Project: Hive Issue Type: Improvement Components: Query Processor Reporter: Namit Jain Assignee: Namit Jain Attachments: hive.4042.1.patch, hive.4042.2.patch, hive.4042.3.patch, hive.4042.4.patch, hive.4042.5.patch, hive.4042.6.patch After HIVE-3784, in a production environment, it can become difficult to deploy since a lot of production queries can break. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4068) Size of aggregation buffer which uses non-primitive type is not estimated correctly
[ https://issues.apache.org/jira/browse/HIVE-4068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13585722#comment-13585722 ] Phabricator commented on HIVE-4068: --- njain has commented on the revision HIVE-4068 [jira] Size of aggregation buffer which uses non-primitive type is not estimated correctly. I was thinking of a abstract class --- AbstractAggregationBuffer with the annotation. The default implementation of estimate() is empty All known aggregationBuffers extend AbstractAB. if (agg instanceof GenericUDAFEvaluator.EstimableBuffer) { return ((GenericUDAFEvaluator.EstimableBuffer)agg).estimate(); } change instanceof to annotation Mark AB as deprecated REVISION DETAIL https://reviews.facebook.net/D8859 To: JIRA, navis Cc: njain Size of aggregation buffer which uses non-primitive type is not estimated correctly --- Key: HIVE-4068 URL: https://issues.apache.org/jira/browse/HIVE-4068 Project: Hive Issue Type: Improvement Components: Query Processor Reporter: Navis Assignee: Navis Priority: Minor Attachments: HIVE-4068.D8859.1.patch, HIVE-4068.D8859.2.patch Currently, hive assumes an aggregation buffer which holds a map is occupying just 256 byte (fixed). If it's bigger than that in real, OutOfMemoryError can be thrown (especially for 1k buffer). workaround : set hive.map.aggr.hash.percentmemory=smaller value than default(0.5) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-2780) Implement more restrictive table sampler
[ https://issues.apache.org/jira/browse/HIVE-2780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navis updated HIVE-2780: Status: Patch Available (was: Open) Implement more restrictive table sampler Key: HIVE-2780 URL: https://issues.apache.org/jira/browse/HIVE-2780 Project: Hive Issue Type: Improvement Reporter: Navis Assignee: Navis Priority: Trivial Attachments: ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.1.patch, ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.2.patch, HIVE-2780.D1623.3.patch, HIVE-2780.D1623.4.patch Current table sampling scans whole block, making more rows included than expected especially for small tables. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-2780) Implement more restrictive table sampler
[ https://issues.apache.org/jira/browse/HIVE-2780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phabricator updated HIVE-2780: -- Attachment: HIVE-2780.D1623.4.patch navis updated the revision HIVE-2780 [jira] Implement more restrictive table sampler. Addressed comments rebased to trunk Reviewers: ashutoshc, JIRA REVISION DETAIL https://reviews.facebook.net/D1623 CHANGE SINCE LAST DIFF https://reviews.facebook.net/D1623?vs=25479id=28587#toc AFFECTED FILES common/src/java/org/apache/hadoop/hive/conf/HiveConf.java conf/hive-default.xml.template ql/src/java/org/apache/hadoop/hive/ql/io/CombineHiveInputFormat.java ql/src/java/org/apache/hadoop/hive/ql/io/DefaultSplitSampler.java ql/src/java/org/apache/hadoop/hive/ql/io/ShirinkSplitSampler.java ql/src/java/org/apache/hadoop/hive/ql/io/SplitSampler.java ql/src/test/queries/clientpositive/split_sample_custom.q ql/src/test/results/clientpositive/split_sample_custom.q.out To: JIRA, ashutoshc, navis Implement more restrictive table sampler Key: HIVE-2780 URL: https://issues.apache.org/jira/browse/HIVE-2780 Project: Hive Issue Type: Improvement Reporter: Navis Assignee: Navis Priority: Trivial Attachments: ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.1.patch, ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.2.patch, HIVE-2780.D1623.3.patch, HIVE-2780.D1623.4.patch Current table sampling scans whole block, making more rows included than expected especially for small tables. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4068) Size of aggregation buffer which uses non-primitive type is not estimated correctly
[ https://issues.apache.org/jira/browse/HIVE-4068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phabricator updated HIVE-4068: -- Attachment: HIVE-4068.D8859.3.patch navis updated the revision HIVE-4068 [jira] Size of aggregation buffer which uses non-primitive type is not estimated correctly. Like this? Reviewers: JIRA REVISION DETAIL https://reviews.facebook.net/D8859 CHANGE SINCE LAST DIFF https://reviews.facebook.net/D8859?vs=28539id=28593#toc AFFECTED FILES ql/src/java/org/apache/hadoop/hive/ql/exec/GroupByOperator.java ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFEWAHBitmap.java ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFEvaluator.java ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDAFHistogramNumeric.java ql/src/java/org/apache/hadoop/hive/ql/udf/generic/NumericHistogram.java To: JIRA, navis Cc: njain Size of aggregation buffer which uses non-primitive type is not estimated correctly --- Key: HIVE-4068 URL: https://issues.apache.org/jira/browse/HIVE-4068 Project: Hive Issue Type: Improvement Components: Query Processor Reporter: Navis Assignee: Navis Priority: Minor Attachments: HIVE-4068.D8859.1.patch, HIVE-4068.D8859.2.patch, HIVE-4068.D8859.3.patch Currently, hive assumes an aggregation buffer which holds a map is occupying just 256 byte (fixed). If it's bigger than that in real, OutOfMemoryError can be thrown (especially for 1k buffer). workaround : set hive.map.aggr.hash.percentmemory=smaller value than default(0.5) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4018) MapJoin failing with Distributed Cache error
[ https://issues.apache.org/jira/browse/HIVE-4018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Amareshwari Sriramadasu updated HIVE-4018: -- Status: Patch Available (was: Open) MapJoin failing with Distributed Cache error Key: HIVE-4018 URL: https://issues.apache.org/jira/browse/HIVE-4018 Project: Hive Issue Type: Bug Components: SQL Affects Versions: 0.11.0 Reporter: Amareshwari Sriramadasu Assignee: Amareshwari Sriramadasu Fix For: 0.11.0 Attachments: HIVE-4018.patch, hive.4018.test.2.patch, HIVE-4018-test.patch When I'm a running a star join query after HIVE-3784, it is failing with following error: 2013-02-13 08:36:04,584 ERROR org.apache.hadoop.hive.ql.exec.MapJoinOperator: Load Distributed Cache Error 2013-02-13 08:36:04,585 FATAL ExecMapper: org.apache.hadoop.hive.ql.metadata.HiveException: java.io.EOFException at org.apache.hadoop.hive.ql.exec.MapJoinOperator.loadHashTable(MapJoinOperator.java:189) at org.apache.hadoop.hive.ql.exec.MapJoinOperator.cleanUpInputFileChangedOp(MapJoinOperator.java:203) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1421) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:614) at org.apache.hadoop.hive.ql.exec.ExecMapper.map(ExecMapper.java:144) at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50) at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:391) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:325) at org.apache.hadoop.mapred.Child$4.run(Child.java:266) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:416) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1278) at org.apache.hadoop.mapred.Child.main(Child.java:260) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4018) MapJoin failing with Distributed Cache error
[ https://issues.apache.org/jira/browse/HIVE-4018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Amareshwari Sriramadasu updated HIVE-4018: -- Attachment: HIVE-4018.patch Here is a patch which fixes the issue, with testcase added. MapJoin failing with Distributed Cache error Key: HIVE-4018 URL: https://issues.apache.org/jira/browse/HIVE-4018 Project: Hive Issue Type: Bug Components: SQL Affects Versions: 0.11.0 Reporter: Amareshwari Sriramadasu Assignee: Amareshwari Sriramadasu Fix For: 0.11.0 Attachments: HIVE-4018.patch, hive.4018.test.2.patch, HIVE-4018-test.patch When I'm a running a star join query after HIVE-3784, it is failing with following error: 2013-02-13 08:36:04,584 ERROR org.apache.hadoop.hive.ql.exec.MapJoinOperator: Load Distributed Cache Error 2013-02-13 08:36:04,585 FATAL ExecMapper: org.apache.hadoop.hive.ql.metadata.HiveException: java.io.EOFException at org.apache.hadoop.hive.ql.exec.MapJoinOperator.loadHashTable(MapJoinOperator.java:189) at org.apache.hadoop.hive.ql.exec.MapJoinOperator.cleanUpInputFileChangedOp(MapJoinOperator.java:203) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1421) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.Operator.cleanUpInputFileChanged(Operator.java:1425) at org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:614) at org.apache.hadoop.hive.ql.exec.ExecMapper.map(ExecMapper.java:144) at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50) at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:391) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:325) at org.apache.hadoop.mapred.Child$4.run(Child.java:266) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:416) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1278) at org.apache.hadoop.mapred.Child.main(Child.java:260) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3629) Try bucket split more localized by referencing block locations of small aliases
[ https://issues.apache.org/jira/browse/HIVE-3629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navis updated HIVE-3629: Status: Patch Available (was: Open) Try bucket split more localized by referencing block locations of small aliases --- Key: HIVE-3629 URL: https://issues.apache.org/jira/browse/HIVE-3629 Project: Hive Issue Type: Improvement Components: Query Processor Reporter: Navis Assignee: Navis Priority: Minor Attachments: HIVE-3629.D6273.1.patch, HIVE-3629.D6273.2.patch Currently splits for SMBMapJoin does not concern about location of small aliases. If all inputs could be on single server, joining would be more faster. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3629) Try bucket split more localized by referencing block locations of small aliases
[ https://issues.apache.org/jira/browse/HIVE-3629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phabricator updated HIVE-3629: -- Attachment: HIVE-3629.D6273.2.patch navis updated the revision HIVE-3629 [jira] Try bucket split more localized by referencing block locations of small aliases. Rebased to trunk Reviewers: JIRA REVISION DETAIL https://reviews.facebook.net/D6273 CHANGE SINCE LAST DIFF https://reviews.facebook.net/D6273?vs=20583id=28599#toc AFFECTED FILES ql/src/java/org/apache/hadoop/hive/ql/exec/SMBMapJoinOperator.java ql/src/java/org/apache/hadoop/hive/ql/io/BucketizedHiveInputFormat.java ql/src/java/org/apache/hadoop/hive/ql/optimizer/MapJoinFactory.java ql/src/java/org/apache/hadoop/hive/ql/plan/BucketMapJoinContext.java ql/src/java/org/apache/hadoop/hive/ql/plan/MapredWork.java To: JIRA, navis Try bucket split more localized by referencing block locations of small aliases --- Key: HIVE-3629 URL: https://issues.apache.org/jira/browse/HIVE-3629 Project: Hive Issue Type: Improvement Components: Query Processor Reporter: Navis Assignee: Navis Priority: Minor Attachments: HIVE-3629.D6273.1.patch, HIVE-3629.D6273.2.patch Currently splits for SMBMapJoin does not concern about location of small aliases. If all inputs could be on single server, joining would be more faster. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3235) Support jobId notification for MaredTasks
[ https://issues.apache.org/jira/browse/HIVE-3235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navis updated HIVE-3235: Affects Version/s: (was: 0.10.0) Status: Patch Available (was: Open) Support jobId notification for MaredTasks - Key: HIVE-3235 URL: https://issues.apache.org/jira/browse/HIVE-3235 Project: Hive Issue Type: Improvement Components: Diagnosability Reporter: Navis Assignee: Navis Priority: Trivial JobId for each MapredTask is requested for monitoring purpose. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3235) Support jobId notification for MaredTasks
[ https://issues.apache.org/jira/browse/HIVE-3235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phabricator updated HIVE-3235: -- Attachment: HIVE-3235.D4017.2.patch navis updated the revision HIVE-3235 [jira] Support jobId notification for MaredTasks. Addressed comments rebased to trunk Reviewers: JIRA REVISION DETAIL https://reviews.facebook.net/D4017 CHANGE SINCE LAST DIFF https://reviews.facebook.net/D4017?vs=12885id=28605#toc AFFECTED FILES common/src/java/org/apache/hadoop/hive/conf/HiveConf.java ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java ql/src/java/org/apache/hadoop/hive/ql/exec/Task.java To: JIRA, navis Cc: zhenxiao Support jobId notification for MaredTasks - Key: HIVE-3235 URL: https://issues.apache.org/jira/browse/HIVE-3235 Project: Hive Issue Type: Improvement Components: Diagnosability Reporter: Navis Assignee: Navis Priority: Trivial Attachments: HIVE-3235.D4017.2.patch JobId for each MapredTask is requested for monitoring purpose. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-948) more query plan optimization rules
[ https://issues.apache.org/jira/browse/HIVE-948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13585939#comment-13585939 ] Ashutosh Chauhan commented on HIVE-948: --- Thanks Navis for updating patch. Running tests again. more query plan optimization rules --- Key: HIVE-948 URL: https://issues.apache.org/jira/browse/HIVE-948 Project: Hive Issue Type: Improvement Reporter: Ning Zhang Assignee: Navis Attachments: HIVE-948.D8463.1.patch, HIVE-948.D8463.2.patch, HIVE-948.D8463.3.patch, HIVE-948.D8463.3.patch, HIVE-948.D8463.4.patch, HIVE-948.D8463.5.patch, HIVE-948.testresult_only.1.txt Many query plans are not optimal in that they contain redundant operators. Some examples are unnecessary select operators (select followed by select, select output being the same as input etc.). Even though these operators are not very expensive, they could account for around 10% of CPU time in some simple queries. It seems they are low-hanging fruits that we should pick first. BTW, it seems these optimization rules should be added at the last stage of the physical optimization phase since some redundant operators are added to facilitate physical plan generation. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4070) Like operator in Hive is case sensitive while in MySQL (and most likely other DBs) it's case insensitive
[ https://issues.apache.org/jira/browse/HIVE-4070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Mackrory updated HIVE-4070: Summary: Like operator in Hive is case sensitive while in MySQL (and most likely other DBs) it's case insensitive (was: Like operator in Hive is case sensitive while in MySQL (and most likely other DBs) it's case sensitive) Like operator in Hive is case sensitive while in MySQL (and most likely other DBs) it's case insensitive Key: HIVE-4070 URL: https://issues.apache.org/jira/browse/HIVE-4070 Project: Hive Issue Type: Bug Components: UDF Affects Versions: 0.10.0 Reporter: Mark Grover Assignee: Mark Grover Fix For: 0.11.0 Hive's like operator seems to be case sensitive. See https://github.com/apache/hive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFLike.java#L164 However, MySQL's like operator is case insensitive. I don't have other DB's (like PostgreSQL) installed and handy but I am guessing their LIKE is case insensitive as well. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4070) Like operator in Hive is case sensitive while in MySQL (and most likely other DBs) it's case sensitive
[ https://issues.apache.org/jira/browse/HIVE-4070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13585944#comment-13585944 ] Sean Mackrory commented on HIVE-4070: - Though admittedly it was a question of mine trying to clarify HiveQL that spawned this thought, I don't necessarily think HiveQL's LIKE operator _should_ be case insensitive. It would certainly make HiveQL more consistent with SQL if it was, but when running extremely long queries over certain types of data, I could see it being a nice feature that LIKE gives you the flexibility of wild-cards without the overhead of case insensitivity. There's always the option of using RLIKE when one is looking for a very flexible match, but I think this is a toss-up between compliance with SQL's behavior and being able to limit the overhead of the query when possible. Like operator in Hive is case sensitive while in MySQL (and most likely other DBs) it's case sensitive -- Key: HIVE-4070 URL: https://issues.apache.org/jira/browse/HIVE-4070 Project: Hive Issue Type: Bug Components: UDF Affects Versions: 0.10.0 Reporter: Mark Grover Assignee: Mark Grover Fix For: 0.11.0 Hive's like operator seems to be case sensitive. See https://github.com/apache/hive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFLike.java#L164 However, MySQL's like operator is case insensitive. I don't have other DB's (like PostgreSQL) installed and handy but I am guessing their LIKE is case insensitive as well. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3945) union all datatype do not match may result wrong result
[ https://issues.apache.org/jira/browse/HIVE-3945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13585947#comment-13585947 ] Brock Noland commented on HIVE-3945: [~caofangkun] can you clarify when you see the ArrayIndexOutOfBoundsException above? union all datatype do not match may result wrong result Key: HIVE-3945 URL: https://issues.apache.org/jira/browse/HIVE-3945 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.9.0 Reporter: caofangkun Priority: Minor hive (default) desc src; key string value string select key, value FROM ( select 'key' as key, 'value' as value -- datatype: string, string from src s1 limit 1 UNION ALL select s2.key as key, sum(s2.value) as value -- datatype: strung, double from src s2 group by s2.key ) unionsrc; this query exec normally but has wrong result: key 2.4081029415476845E-282-- expected is 'value' 35.0 100 100.0 480.0 and sometimes when the string title is too long it may case ArrayIndexOutOfBoundsException: Caused by: java.lang.ArrayIndexOutOfBoundsException at java.lang.System.arraycopy(Native Method) at org.apache.hadoop.io.Text.set(Text.java:205) at org.apache.hadoop.hive.serde2.lazybinary.LazyBinaryString.init(LazyBinaryString.java:48) at org.apache.hadoop.hive.serde2.lazybinary.LazyBinaryStruct.uncheckedGetField(LazyBinaryStruct.java:216) at org.apache.hadoop.hive.serde2.lazybinary.LazyBinaryStruct.getField(LazyBinaryStruct.java:197) at org.apache.hadoop.hive.serde2.lazybinary.objectinspector.LazyBinaryStructObjectInspector.getStructFieldData(LazyBinaryStructObjectInspector.java:61) at org.apache.hadoop.hive.ql.exec.UnionOperator.processOp(UnionOperator.java:125) at org.apache.hadoop.hive.ql.exec.Operator.process(Operator.java:471) at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:762) at org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:531) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3874) Create a new Optimized Row Columnar file format for Hive
[ https://issues.apache.org/jira/browse/HIVE-3874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phabricator updated HIVE-3874: -- Attachment: HIVE-3874.D8871.1.patch omalley requested code review of HIVE-3874 [jira] Create a new Optimized Row Columnar file format for Hive. Reviewers: JIRA improve some of the comments on WriterImpl There are several limitations of the current RC File format that I'd like to address by creating a new format: each column value is stored as a binary blob, which means: the entire column value must be read, decompressed, and deserialized the file format can't use smarter type-specific compression push down filters can't be evaluated the start of each row group needs to be found by scanning user metadata can only be added to the file when the file is created the file doesn't store the number of rows per a file or row group there is no mechanism for seeking to a particular row number, which is required for external indexes. there is no mechanism for storing light weight indexes within the file to enable push-down filters to skip entire row groups. the type of the rows aren't stored in the file TEST PLAN EMPTY REVISION DETAIL https://reviews.facebook.net/D8871 AFFECTED FILES ql/src/java/org/apache/hadoop/hive/ql/io/orc/WriterImpl.java MANAGE HERALD RULES https://reviews.facebook.net/herald/view/differential/ WHY DID I GET THIS EMAIL? https://reviews.facebook.net/herald/transcript/21597/ To: JIRA, omalley Create a new Optimized Row Columnar file format for Hive Key: HIVE-3874 URL: https://issues.apache.org/jira/browse/HIVE-3874 Project: Hive Issue Type: Improvement Components: Serializers/Deserializers Reporter: Owen O'Malley Assignee: Owen O'Malley Attachments: hive.3874.2.patch, HIVE-3874.D8529.1.patch, HIVE-3874.D8529.2.patch, HIVE-3874.D8529.3.patch, HIVE-3874.D8871.1.patch, OrcFileIntro.pptx, orc.tgz There are several limitations of the current RC File format that I'd like to address by creating a new format: * each column value is stored as a binary blob, which means: ** the entire column value must be read, decompressed, and deserialized ** the file format can't use smarter type-specific compression ** push down filters can't be evaluated * the start of each row group needs to be found by scanning * user metadata can only be added to the file when the file is created * the file doesn't store the number of rows per a file or row group * there is no mechanism for seeking to a particular row number, which is required for external indexes. * there is no mechanism for storing light weight indexes within the file to enable push-down filters to skip entire row groups. * the type of the rows aren't stored in the file -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3874) Create a new Optimized Row Columnar file format for Hive
[ https://issues.apache.org/jira/browse/HIVE-3874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586008#comment-13586008 ] Phabricator commented on HIVE-3874: --- omalley has abandoned the revision HIVE-3874 [jira] Create a new Optimized Row Columnar file format for Hive. REVISION DETAIL https://reviews.facebook.net/D8871 To: JIRA, omalley Create a new Optimized Row Columnar file format for Hive Key: HIVE-3874 URL: https://issues.apache.org/jira/browse/HIVE-3874 Project: Hive Issue Type: Improvement Components: Serializers/Deserializers Reporter: Owen O'Malley Assignee: Owen O'Malley Attachments: hive.3874.2.patch, HIVE-3874.D8529.1.patch, HIVE-3874.D8529.2.patch, HIVE-3874.D8529.3.patch, HIVE-3874.D8871.1.patch, OrcFileIntro.pptx, orc.tgz There are several limitations of the current RC File format that I'd like to address by creating a new format: * each column value is stored as a binary blob, which means: ** the entire column value must be read, decompressed, and deserialized ** the file format can't use smarter type-specific compression ** push down filters can't be evaluated * the start of each row group needs to be found by scanning * user metadata can only be added to the file when the file is created * the file doesn't store the number of rows per a file or row group * there is no mechanism for seeking to a particular row number, which is required for external indexes. * there is no mechanism for storing light weight indexes within the file to enable push-down filters to skip entire row groups. * the type of the rows aren't stored in the file -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3874) Create a new Optimized Row Columnar file format for Hive
[ https://issues.apache.org/jira/browse/HIVE-3874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phabricator updated HIVE-3874: -- Attachment: HIVE-3874.D8529.4.patch omalley updated the revision HIVE-3874 [jira] Create a new Optimized Row Columnar file format for Hive. - started updating comments - more style changes - fix compilation - fix unit tests - fix more unit tests - added more comments Reviewers: JIRA REVISION DETAIL https://reviews.facebook.net/D8529 CHANGE SINCE LAST DIFF https://reviews.facebook.net/D8529?vs=28305id=28617#toc AFFECTED FILES build.properties build.xml ivy/libraries.properties ql/build.xml ql/ivy.xml ql/src/gen/protobuf/gen-java/org/apache/hadoop/hive/ql/io/orc/OrcProto.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/BitFieldReader.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/BitFieldWriter.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/BooleanColumnStatistics.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/ColumnStatistics.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/ColumnStatisticsImpl.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/CompressionCodec.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/CompressionKind.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/DoubleColumnStatistics.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/DynamicByteArray.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/DynamicIntArray.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/FileDump.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/InStream.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/IntegerColumnStatistics.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcFile.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcInputFormat.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcOutputFormat.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcSerde.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcStruct.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcUnion.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/OutStream.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/PositionProvider.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/PositionRecorder.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/PositionedOutputStream.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/Reader.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/ReaderImpl.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/RecordReader.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/RecordReaderImpl.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/RedBlackTree.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/RunLengthByteReader.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/RunLengthByteWriter.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/RunLengthIntegerReader.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/RunLengthIntegerWriter.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/SerializationUtils.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/SnappyCodec.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/StreamName.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/StringColumnStatistics.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/StringRedBlackTree.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/StripeInformation.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/Writer.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/WriterImpl.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/ZlibCodec.java ql/src/java/org/apache/hadoop/hive/ql/io/orc/package-info.java ql/src/protobuf/org/apache/hadoop/hive/ql/io/orc/orc_proto.proto ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestBitFieldReader.java ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestDynamicArray.java ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestFileDump.java ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestInStream.java ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestInputOutputFormat.java ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestOrcFile.java ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestOrcStruct.java ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestRunLengthByteReader.java ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestRunLengthIntegerReader.java ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestSerializationUtils.java ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestStreamName.java ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestStringRedBlackTree.java ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestZlib.java ql/src/test/resources/orc-file-dump.out To: JIRA, omalley Cc: kevinwilfong, njain Create a new Optimized Row Columnar file format for Hive Key: HIVE-3874 URL: https://issues.apache.org/jira/browse/HIVE-3874 Project: Hive Issue Type:
[jira] [Commented] (HIVE-3874) Create a new Optimized Row Columnar file format for Hive
[ https://issues.apache.org/jira/browse/HIVE-3874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586018#comment-13586018 ] Owen O'Malley commented on HIVE-3874: - Ok, I added some additional comments in the Writer as Namit asked and all of the unit tests cases pass. Create a new Optimized Row Columnar file format for Hive Key: HIVE-3874 URL: https://issues.apache.org/jira/browse/HIVE-3874 Project: Hive Issue Type: Improvement Components: Serializers/Deserializers Reporter: Owen O'Malley Assignee: Owen O'Malley Attachments: hive.3874.2.patch, HIVE-3874.D8529.1.patch, HIVE-3874.D8529.2.patch, HIVE-3874.D8529.3.patch, HIVE-3874.D8529.4.patch, HIVE-3874.D8871.1.patch, OrcFileIntro.pptx, orc.tgz There are several limitations of the current RC File format that I'd like to address by creating a new format: * each column value is stored as a binary blob, which means: ** the entire column value must be read, decompressed, and deserialized ** the file format can't use smarter type-specific compression ** push down filters can't be evaluated * the start of each row group needs to be found by scanning * user metadata can only be added to the file when the file is created * the file doesn't store the number of rows per a file or row group * there is no mechanism for seeking to a particular row number, which is required for external indexes. * there is no mechanism for storing light weight indexes within the file to enable push-down filters to skip entire row groups. * the type of the rows aren't stored in the file -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3874) Create a new Optimized Row Columnar file format for Hive
[ https://issues.apache.org/jira/browse/HIVE-3874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley updated HIVE-3874: Status: Patch Available (was: Open) Create a new Optimized Row Columnar file format for Hive Key: HIVE-3874 URL: https://issues.apache.org/jira/browse/HIVE-3874 Project: Hive Issue Type: Improvement Components: Serializers/Deserializers Reporter: Owen O'Malley Assignee: Owen O'Malley Attachments: hive.3874.2.patch, HIVE-3874.D8529.1.patch, HIVE-3874.D8529.2.patch, HIVE-3874.D8529.3.patch, HIVE-3874.D8529.4.patch, HIVE-3874.D8871.1.patch, OrcFileIntro.pptx, orc.tgz There are several limitations of the current RC File format that I'd like to address by creating a new format: * each column value is stored as a binary blob, which means: ** the entire column value must be read, decompressed, and deserialized ** the file format can't use smarter type-specific compression ** push down filters can't be evaluated * the start of each row group needs to be found by scanning * user metadata can only be added to the file when the file is created * the file doesn't store the number of rows per a file or row group * there is no mechanism for seeking to a particular row number, which is required for external indexes. * there is no mechanism for storing light weight indexes within the file to enable push-down filters to skip entire row groups. * the type of the rows aren't stored in the file -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4058) make ORC versioned
[ https://issues.apache.org/jira/browse/HIVE-4058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586021#comment-13586021 ] Owen O'Malley commented on HIVE-4058: - The metadata is versioned, it just doesn't have a global version. The intent is that new fields can be added to the protobuf and the reader will check if those new fields are defined. make ORC versioned -- Key: HIVE-4058 URL: https://issues.apache.org/jira/browse/HIVE-4058 Project: Hive Issue Type: Improvement Components: Serializers/Deserializers Reporter: Namit Jain -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Assigned] (HIVE-4059) Make Column statistics for ORC optional
[ https://issues.apache.org/jira/browse/HIVE-4059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned HIVE-4059: --- Assignee: Owen O'Malley Make Column statistics for ORC optional --- Key: HIVE-4059 URL: https://issues.apache.org/jira/browse/HIVE-4059 Project: Hive Issue Type: Improvement Components: Serializers/Deserializers Reporter: Namit Jain Assignee: Owen O'Malley -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Assigned] (HIVE-4015) Add ORC file to the grammar as a file format
[ https://issues.apache.org/jira/browse/HIVE-4015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gunther Hagleitner reassigned HIVE-4015: Assignee: Gunther Hagleitner Add ORC file to the grammar as a file format Key: HIVE-4015 URL: https://issues.apache.org/jira/browse/HIVE-4015 Project: Hive Issue Type: Improvement Reporter: Owen O'Malley Assignee: Gunther Hagleitner It would be much more convenient for users if we enable them to use ORC as a file format in the HQL grammar. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4015) Add ORC file to the grammar as a file format
[ https://issues.apache.org/jira/browse/HIVE-4015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gunther Hagleitner updated HIVE-4015: - Attachment: HIVE-4015.1.patch Allows for create table ... stored as orc; Add ORC file to the grammar as a file format Key: HIVE-4015 URL: https://issues.apache.org/jira/browse/HIVE-4015 Project: Hive Issue Type: Improvement Reporter: Owen O'Malley Assignee: Gunther Hagleitner Attachments: HIVE-4015.1.patch It would be much more convenient for users if we enable them to use ORC as a file format in the HQL grammar. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-2264) Hive server is SHUTTING DOWN when invalid queries beeing executed.
[ https://issues.apache.org/jira/browse/HIVE-2264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586320#comment-13586320 ] Brock Noland commented on HIVE-2264: Ran into this myself. +1. Hive server is SHUTTING DOWN when invalid queries beeing executed. -- Key: HIVE-2264 URL: https://issues.apache.org/jira/browse/HIVE-2264 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.9.0 Environment: SuSE-Linux-11 Reporter: rohithsharma Assignee: Navis Priority: Critical Attachments: HIVE-2264.1.patch.txt When invalid query is beeing executed, Hive server is shutting down. {noformat} CREATE TABLE SAMPLETABLE(IP STRING , showtime BIGINT ) partitioned by (ds string,ipz int) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\040' ALTER TABLE SAMPLETABLE add Partition(ds='sf') location '/user/hive/warehouse' Partition(ipz=100) location '/user/hive/warehouse' {noformat} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (HIVE-4071) Map-join outer join produces incorrect results.
Vikram Dixit K created HIVE-4071: Summary: Map-join outer join produces incorrect results. Key: HIVE-4071 URL: https://issues.apache.org/jira/browse/HIVE-4071 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.11.0 Reporter: Vikram Dixit K Assignee: Vikram Dixit K For example, if one sets the size of noConditionalTask.size to 10 with corresponding auto join configurations set to true in auto_join28.q instead of the current smalltable.filesize configuration, we will observe different results if a select query is run. (The test only has explain statements at present). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4052) Disallow partition/sort and distribute/order combinations in windowing and partitioning spec
[ https://issues.apache.org/jira/browse/HIVE-4052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Harish Butani updated HIVE-4052: Attachment: HIVE-4052.1.patch.txt Disallow partition/sort and distribute/order combinations in windowing and partitioning spec Key: HIVE-4052 URL: https://issues.apache.org/jira/browse/HIVE-4052 Project: Hive Issue Type: Bug Components: PTF-Windowing Reporter: Ashutosh Chauhan Attachments: HIVE-4052.1.patch.txt -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4052) Disallow partition/sort and distribute/order combinations in windowing and partitioning spec
[ https://issues.apache.org/jira/browse/HIVE-4052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586323#comment-13586323 ] Harish Butani commented on HIVE-4052: - patch attached. Disallow partition/sort and distribute/order combinations in windowing and partitioning spec Key: HIVE-4052 URL: https://issues.apache.org/jira/browse/HIVE-4052 Project: Hive Issue Type: Bug Components: PTF-Windowing Reporter: Ashutosh Chauhan Attachments: HIVE-4052.1.patch.txt -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4015) Add ORC file to the grammar as a file format
[ https://issues.apache.org/jira/browse/HIVE-4015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gunther Hagleitner updated HIVE-4015: - Attachment: HIVE-4015.2.patch Rebased Add ORC file to the grammar as a file format Key: HIVE-4015 URL: https://issues.apache.org/jira/browse/HIVE-4015 Project: Hive Issue Type: Improvement Reporter: Owen O'Malley Assignee: Gunther Hagleitner Attachments: HIVE-4015.1.patch, HIVE-4015.2.patch It would be much more convenient for users if we enable them to use ORC as a file format in the HQL grammar. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4071) Map-join outer join produces incorrect results.
[ https://issues.apache.org/jira/browse/HIVE-4071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vikram Dixit K updated HIVE-4071: - Attachment: HIVE-4071.patch Added modified versions of the auto_join28.q test case and corresponding .out files. Also updated test results for existing tests. Map-join outer join produces incorrect results. --- Key: HIVE-4071 URL: https://issues.apache.org/jira/browse/HIVE-4071 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.11.0 Reporter: Vikram Dixit K Assignee: Vikram Dixit K Attachments: HIVE-4071.patch For example, if one sets the size of noConditionalTask.size to 10 with corresponding auto join configurations set to true in auto_join28.q instead of the current smalltable.filesize configuration, we will observe different results if a select query is run. (The test only has explain statements at present). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4071) Map-join outer join produces incorrect results.
[ https://issues.apache.org/jira/browse/HIVE-4071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vikram Dixit K updated HIVE-4071: - Status: Patch Available (was: Open) Map-join outer join produces incorrect results. --- Key: HIVE-4071 URL: https://issues.apache.org/jira/browse/HIVE-4071 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.11.0 Reporter: Vikram Dixit K Assignee: Vikram Dixit K Attachments: HIVE-4071.patch For example, if one sets the size of noConditionalTask.size to 10 with corresponding auto join configurations set to true in auto_join28.q instead of the current smalltable.filesize configuration, we will observe different results if a select query is run. (The test only has explain statements at present). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4015) Add ORC file to the grammar as a file format
[ https://issues.apache.org/jira/browse/HIVE-4015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gunther Hagleitner updated HIVE-4015: - Attachment: HIVE-4015.3.patch .3 is rebased against the right branch... Add ORC file to the grammar as a file format Key: HIVE-4015 URL: https://issues.apache.org/jira/browse/HIVE-4015 Project: Hive Issue Type: Improvement Reporter: Owen O'Malley Assignee: Gunther Hagleitner Attachments: HIVE-4015.1.patch, HIVE-4015.2.patch, HIVE-4015.3.patch It would be much more convenient for users if we enable them to use ORC as a file format in the HQL grammar. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (HIVE-4072) Hive eclipse build path update for string template jar
Vikram Dixit K created HIVE-4072: Summary: Hive eclipse build path update for string template jar Key: HIVE-4072 URL: https://issues.apache.org/jira/browse/HIVE-4072 Project: Hive Issue Type: Bug Components: Build Infrastructure Affects Versions: 0.11.0 Reporter: Vikram Dixit K Assignee: Vikram Dixit K Priority: Minor StringTemplate jar version needs to be updated for hive to work with eclipse without user intervention. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4072) Hive eclipse build path update for string template jar
[ https://issues.apache.org/jira/browse/HIVE-4072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vikram Dixit K updated HIVE-4072: - Attachment: HIVE-4072.patch Hive eclipse build path update for string template jar -- Key: HIVE-4072 URL: https://issues.apache.org/jira/browse/HIVE-4072 Project: Hive Issue Type: Bug Components: Build Infrastructure Affects Versions: 0.11.0 Reporter: Vikram Dixit K Assignee: Vikram Dixit K Priority: Minor Attachments: HIVE-4072.patch StringTemplate jar version needs to be updated for hive to work with eclipse without user intervention. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4072) Hive eclipse build path update for string template jar
[ https://issues.apache.org/jira/browse/HIVE-4072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vikram Dixit K updated HIVE-4072: - Status: Patch Available (was: Open) Hive eclipse build path update for string template jar -- Key: HIVE-4072 URL: https://issues.apache.org/jira/browse/HIVE-4072 Project: Hive Issue Type: Bug Components: Build Infrastructure Affects Versions: 0.11.0 Reporter: Vikram Dixit K Assignee: Vikram Dixit K Priority: Minor Attachments: HIVE-4072.patch StringTemplate jar version needs to be updated for hive to work with eclipse without user intervention. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
Jenkins build is back to normal : Hive-0.10.0-SNAPSHOT-h0.20.1 #76
See https://builds.apache.org/job/Hive-0.10.0-SNAPSHOT-h0.20.1/76/
[jira] [Updated] (HIVE-4071) Map-join outer join produces incorrect results.
[ https://issues.apache.org/jira/browse/HIVE-4071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vikram Dixit K updated HIVE-4071: - Attachment: (was: HIVE-4071.patch) Map-join outer join produces incorrect results. --- Key: HIVE-4071 URL: https://issues.apache.org/jira/browse/HIVE-4071 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.11.0 Reporter: Vikram Dixit K Assignee: Vikram Dixit K Attachments: HIVE-4071.patch For example, if one sets the size of noConditionalTask.size to 10 with corresponding auto join configurations set to true in auto_join28.q instead of the current smalltable.filesize configuration, we will observe different results if a select query is run. (The test only has explain statements at present). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4071) Map-join outer join produces incorrect results.
[ https://issues.apache.org/jira/browse/HIVE-4071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vikram Dixit K updated HIVE-4071: - Attachment: HIVE-4071.patch Map-join outer join produces incorrect results. --- Key: HIVE-4071 URL: https://issues.apache.org/jira/browse/HIVE-4071 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.11.0 Reporter: Vikram Dixit K Assignee: Vikram Dixit K Attachments: HIVE-4071.patch For example, if one sets the size of noConditionalTask.size to 10 with corresponding auto join configurations set to true in auto_join28.q instead of the current smalltable.filesize configuration, we will observe different results if a select query is run. (The test only has explain statements at present). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4058) make ORC versioned
[ https://issues.apache.org/jira/browse/HIVE-4058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586398#comment-13586398 ] Owen O'Malley commented on HIVE-4058: - I should also note that if it is required at some point, we can always create such a field in the footer and treat that missing field as a version 0. make ORC versioned -- Key: HIVE-4058 URL: https://issues.apache.org/jira/browse/HIVE-4058 Project: Hive Issue Type: Improvement Components: Serializers/Deserializers Reporter: Namit Jain -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (HIVE-4061) skip columns which are not accessed in the query for ORC
[ https://issues.apache.org/jira/browse/HIVE-4061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved HIVE-4061. - Resolution: Cannot Reproduce This is already done. skip columns which are not accessed in the query for ORC Key: HIVE-4061 URL: https://issues.apache.org/jira/browse/HIVE-4061 Project: Hive Issue Type: Improvement Components: Serializers/Deserializers Reporter: Namit Jain -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (HIVE-4058) make ORC versioned
[ https://issues.apache.org/jira/browse/HIVE-4058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved HIVE-4058. - Resolution: Won't Fix make ORC versioned -- Key: HIVE-4058 URL: https://issues.apache.org/jira/browse/HIVE-4058 Project: Hive Issue Type: Improvement Components: Serializers/Deserializers Reporter: Namit Jain -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4053) Add support for phonetic algorithms in Hive
[ https://issues.apache.org/jira/browse/HIVE-4053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna updated HIVE-4053: -- Attachment: HIVE-4053.1.patch.txt Add support for phonetic algorithms in Hive --- Key: HIVE-4053 URL: https://issues.apache.org/jira/browse/HIVE-4053 Project: Hive Issue Type: New Feature Components: UDF Reporter: Krishna Attachments: FunctionRegistry.java, GenericUDFRefinedSoundex.java, HIVE-4053.1.patch.txt Following phonetic algorithms should be considered, which are very useful in search: Soundex Refined Soundex Daitch–Mokotoff Soundex Metaphone and Double Metaphone New York State Identification and Intelligence System (NYSIIS) Caverphone -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4053) Add support for phonetic algorithms in Hive
[ https://issues.apache.org/jira/browse/HIVE-4053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586422#comment-13586422 ] Krishna commented on HIVE-4053: --- I've attached the patch to JIRA. How do I post it for review on reviewboard? Add support for phonetic algorithms in Hive --- Key: HIVE-4053 URL: https://issues.apache.org/jira/browse/HIVE-4053 Project: Hive Issue Type: New Feature Components: UDF Reporter: Krishna Attachments: FunctionRegistry.java, GenericUDFRefinedSoundex.java, HIVE-4053.1.patch.txt Following phonetic algorithms should be considered, which are very useful in search: Soundex Refined Soundex Daitch–Mokotoff Soundex Metaphone and Double Metaphone New York State Identification and Intelligence System (NYSIIS) Caverphone -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4014) Hive+RCFile is not doing column pruning and reading much more data than necessary
[ https://issues.apache.org/jira/browse/HIVE-4014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586425#comment-13586425 ] Tamas Tarjanyi commented on HIVE-4014: -- I can not see affected versions here but I found the same issue on CDH4.1.3 - which is using hadoop-2.0.0+556 / hive-0.9.0+158 Than I have downloaded and tested hadoop 1.0.3 / hive 0.10.0 and hadoop 1.0.4 / hive 0.10.0 Both working fine and pruning is effective in these case. Hive+RCFile is not doing column pruning and reading much more data than necessary - Key: HIVE-4014 URL: https://issues.apache.org/jira/browse/HIVE-4014 Project: Hive Issue Type: Bug Reporter: Vinod Kumar Vavilapalli Assignee: Vinod Kumar Vavilapalli With even simple projection queries, I see that HDFS bytes read counter doesn't show any reduction in the amount of data read. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
Re: Merging HCatalog into Hive
On Feb 24, 2013, at 12:22 PM, Brock Noland wrote: Looks good from my perspective and I glad to see this moving forward. Regarding #4 (JIRA) I don't know if there's a way to upload existing JIRAs into Hive's JIRA, but I think it would be better to leave them where they are. JIRA has a bulk move feature, but I am curious as why we would leave them under the old project? There might be good reason to orphan them, but my first thought is that it would be nice to have them under the HIVE project simply for search purposes. I was thinking it would be hard for people who had bookmarks or pointers to the existing JIRAs. Also, since it would change all the JIRA numbers on closed JIRAs it would make records from previous releases a mess. But I see what you're saying about making search hard. Maybe there's a way to leave the historical info where it is while importing any active JIRAs into Hive so people can search them. Alan. Brock
Re: Merging HCatalog into Hive
Hi, On Mon, Feb 25, 2013 at 5:20 PM, Alan Gates ga...@hortonworks.com wrote: On Feb 24, 2013, at 12:22 PM, Brock Noland wrote: Looks good from my perspective and I glad to see this moving forward. Regarding #4 (JIRA) I don't know if there's a way to upload existing JIRAs into Hive's JIRA, but I think it would be better to leave them where they are. JIRA has a bulk move feature, but I am curious as why we would leave them under the old project? There might be good reason to orphan them, but my first thought is that it would be nice to have them under the HIVE project simply for search purposes. I was thinking it would be hard for people who had bookmarks or pointers to the existing JIRAs. Also, since it would change all the JIRA numbers on closed JIRAs it would make records from previous releases a mess. But I see what you're saying about making search hard. Maybe there's a way to leave the historical info where it is while importing any active JIRAs into Hive so people can search them. Yeah new jira numbers for old issues, might be a mess. The bulk move operator allows you to select any combination of issues such as open issues. Also, once moved the old JIRA will redirect to the new jira. If we are interested in pursuing this, I'd be happy to help. Cheers, Brock
[jira] [Updated] (HIVE-4053) Add support for phonetic algorithms in Hive
[ https://issues.apache.org/jira/browse/HIVE-4053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krishna updated HIVE-4053: -- Fix Version/s: 0.10.0 Labels: patch (was: ) Affects Version/s: 0.10.0 Release Note: Implementation of the phonetic algorithm - Refined Soundex Status: Patch Available (was: Open) Add support for phonetic algorithms in Hive --- Key: HIVE-4053 URL: https://issues.apache.org/jira/browse/HIVE-4053 Project: Hive Issue Type: New Feature Components: UDF Affects Versions: 0.10.0 Reporter: Krishna Labels: patch Fix For: 0.10.0 Attachments: FunctionRegistry.java, GenericUDFRefinedSoundex.java, HIVE-4053.1.patch.txt Following phonetic algorithms should be considered, which are very useful in search: Soundex Refined Soundex Daitch–Mokotoff Soundex Metaphone and Double Metaphone New York State Identification and Intelligence System (NYSIIS) Caverphone -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4053) Add support for phonetic algorithms in Hive
[ https://issues.apache.org/jira/browse/HIVE-4053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586499#comment-13586499 ] Krishna commented on HIVE-4053: --- I have submitted the patch; please review the code. Add support for phonetic algorithms in Hive --- Key: HIVE-4053 URL: https://issues.apache.org/jira/browse/HIVE-4053 Project: Hive Issue Type: New Feature Components: UDF Affects Versions: 0.10.0 Reporter: Krishna Labels: patch Fix For: 0.10.0 Attachments: FunctionRegistry.java, GenericUDFRefinedSoundex.java, HIVE-4053.1.patch.txt Following phonetic algorithms should be considered, which are very useful in search: Soundex Refined Soundex Daitch–Mokotoff Soundex Metaphone and Double Metaphone New York State Identification and Intelligence System (NYSIIS) Caverphone -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3490) Implement * or a.* for arguments to UDFs
[ https://issues.apache.org/jira/browse/HIVE-3490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navis updated HIVE-3490: Assignee: Navis Status: Patch Available (was: Open) Implement * or a.* for arguments to UDFs Key: HIVE-3490 URL: https://issues.apache.org/jira/browse/HIVE-3490 Project: Hive Issue Type: Bug Components: Query Processor, UDF Reporter: Adam Kramer Assignee: Navis For a random UDF, we should be able to use * or a.* to refer to all of the columns in their natural order. This is not currently implemented. I'm reporting this as a bug because it is a manner in which Hive is inconsistent with the SQL spec, and because Hive claims to implement *. hive select all_non_null(a.*) from table a where a.ds='2012-09-01'; FAILED: ParseException line 1:25 mismatched input '*' expecting Identifier near '.' in expression specification -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3490) Implement * or a.* for arguments to UDFs
[ https://issues.apache.org/jira/browse/HIVE-3490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phabricator updated HIVE-3490: -- Attachment: HIVE-3490.D8889.1.patch navis requested code review of HIVE-3490 [jira] Implement * or a.* for arguments to UDFs. Reviewers: JIRA HIVE-3490 Implement * or a.* for arguments to UDFs For a random UDF, we should be able to use * or a.* to refer to all of the columns in their natural order. This is not currently implemented. I'm reporting this as a bug because it is a manner in which Hive is inconsistent with the SQL spec, and because Hive claims to implement *. hive select all_non_null(a.*) from table a where a.ds='2012-09-01'; FAILED: ParseException line 1:25 mismatched input '*' expecting Identifier near '.' in expression specification TEST PLAN EMPTY REVISION DETAIL https://reviews.facebook.net/D8889 AFFECTED FILES ql/src/java/org/apache/hadoop/hive/ql/parse/IdentifiersParser.g ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java ql/src/java/org/apache/hadoop/hive/ql/parse/TypeCheckProcFactory.java ql/src/java/org/apache/hadoop/hive/ql/plan/ExprNodeColumnListDesc.java ql/src/test/queries/clientpositive/allcolref_in_udf.q ql/src/test/results/clientpositive/allcolref_in_udf.q.out MANAGE HERALD RULES https://reviews.facebook.net/herald/view/differential/ WHY DID I GET THIS EMAIL? https://reviews.facebook.net/herald/transcript/21621/ To: JIRA, navis Implement * or a.* for arguments to UDFs Key: HIVE-3490 URL: https://issues.apache.org/jira/browse/HIVE-3490 Project: Hive Issue Type: Bug Components: Query Processor, UDF Reporter: Adam Kramer Assignee: Navis Attachments: HIVE-3490.D8889.1.patch For a random UDF, we should be able to use * or a.* to refer to all of the columns in their natural order. This is not currently implemented. I'm reporting this as a bug because it is a manner in which Hive is inconsistent with the SQL spec, and because Hive claims to implement *. hive select all_non_null(a.*) from table a where a.ds='2012-09-01'; FAILED: ParseException line 1:25 mismatched input '*' expecting Identifier near '.' in expression specification -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-948) more query plan optimization rules
[ https://issues.apache.org/jira/browse/HIVE-948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated HIVE-948: -- Resolution: Fixed Fix Version/s: 0.11.0 Status: Resolved (was: Patch Available) Committed to trunk. Thanks, Navis! more query plan optimization rules --- Key: HIVE-948 URL: https://issues.apache.org/jira/browse/HIVE-948 Project: Hive Issue Type: Improvement Reporter: Ning Zhang Assignee: Navis Fix For: 0.11.0 Attachments: HIVE-948.D8463.1.patch, HIVE-948.D8463.2.patch, HIVE-948.D8463.3.patch, HIVE-948.D8463.3.patch, HIVE-948.D8463.4.patch, HIVE-948.D8463.5.patch, HIVE-948.testresult_only.1.txt Many query plans are not optimal in that they contain redundant operators. Some examples are unnecessary select operators (select followed by select, select output being the same as input etc.). Even though these operators are not very expensive, they could account for around 10% of CPU time in some simple queries. It seems they are low-hanging fruits that we should pick first. BTW, it seems these optimization rules should be added at the last stage of the physical optimization phase since some redundant operators are added to facilitate physical plan generation. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-1402) Add parallel ORDER BY to Hive
[ https://issues.apache.org/jira/browse/HIVE-1402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phabricator updated HIVE-1402: -- Attachment: HIVE-1402.D8895.1.patch navis requested code review of HIVE-1402 [jira] Add parallel ORDER BY to Hive. Reviewers: JIRA HIVE-1402 Add parallel ORDER BY to Hive TEST PLAN EMPTY REVISION DETAIL https://reviews.facebook.net/D8895 AFFECTED FILES common/src/java/org/apache/hadoop/hive/conf/HiveConf.java ql/src/java/org/apache/hadoop/hive/ql/exec/ExecDriver.java ql/src/java/org/apache/hadoop/hive/ql/exec/FetchOperator.java ql/src/java/org/apache/hadoop/hive/ql/exec/HiveTotalOrderPartitioner.java ql/src/java/org/apache/hadoop/hive/ql/exec/OperatorUtils.java ql/src/java/org/apache/hadoop/hive/ql/exec/PartitionKeySampler.java ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/BucketingSortingCtx.java ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/BucketingSortingInferenceOptimizer.java ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/CommonJoinResolver.java ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/IndexWhereResolver.java ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/MapJoinResolver.java ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/MetadataOnlyOptimizer.java ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/PhysicalContext.java ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/PhysicalOptimizer.java ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/SamplingOptimizer.java ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/SkewJoinResolver.java ql/src/java/org/apache/hadoop/hive/ql/plan/MapredWork.java ql/src/java/org/apache/hadoop/hive/ql/plan/PartitionDesc.java ql/src/java/org/apache/hadoop/hive/ql/udf/UDAFPercentile.java ql/src/test/queries/clientpositive/parallel_orderby.q ql/src/test/results/clientpositive/parallel_orderby.q.out MANAGE HERALD RULES https://reviews.facebook.net/herald/view/differential/ WHY DID I GET THIS EMAIL? https://reviews.facebook.net/herald/transcript/21627/ To: JIRA, navis Add parallel ORDER BY to Hive - Key: HIVE-1402 URL: https://issues.apache.org/jira/browse/HIVE-1402 Project: Hive Issue Type: New Feature Components: Query Processor Reporter: Jeff Hammerbacher Assignee: Venkatesh Seetharam Labels: optimization Attachments: HIVE-1402.D8895.1.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (HIVE-4073) Make partition by optional in over clause
Ashutosh Chauhan created HIVE-4073: -- Summary: Make partition by optional in over clause Key: HIVE-4073 URL: https://issues.apache.org/jira/browse/HIVE-4073 Project: Hive Issue Type: Bug Components: PTF-Windowing Reporter: Ashutosh Chauhan select s, sum(i) over() from tt; should work. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4073) Make partition by optional in over clause
[ https://issues.apache.org/jira/browse/HIVE-4073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated HIVE-4073: --- Description: select s, sum( i ) over() from tt; should work. (was: select s, sum(i) over() from tt; should work. ) Make partition by optional in over clause - Key: HIVE-4073 URL: https://issues.apache.org/jira/browse/HIVE-4073 Project: Hive Issue Type: Bug Components: PTF-Windowing Reporter: Ashutosh Chauhan select s, sum( i ) over() from tt; should work. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4073) Make partition by optional in over clause
[ https://issues.apache.org/jira/browse/HIVE-4073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586599#comment-13586599 ] Ashutosh Chauhan commented on HIVE-4073: sql spec allows empty specification in over clause. Currently, our parser fails to parse this query. We have order-by optional but we don't need user to specify a partitioning spec. Infact we should relax that requirement and if user chose not to specify the partitioning column than partition by some constant value which achieves desired effect. Make partition by optional in over clause - Key: HIVE-4073 URL: https://issues.apache.org/jira/browse/HIVE-4073 Project: Hive Issue Type: Bug Components: PTF-Windowing Reporter: Ashutosh Chauhan select s, sum( i ) over() from tt; should work. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Comment Edited] (HIVE-4073) Make partition by optional in over clause
[ https://issues.apache.org/jira/browse/HIVE-4073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586599#comment-13586599 ] Ashutosh Chauhan edited comment on HIVE-4073 at 2/26/13 1:39 AM: - sql spec allows empty specification in over clause. Currently, our parser fails to parse this query. We have order-by optional but we do need user to specify a partitioning spec. We should relax that requirement and if user chose not to specify the partitioning column than partition by some constant value which achieves desired effect. was (Author: ashutoshc): sql spec allows empty specification in over clause. Currently, our parser fails to parse this query. We have order-by optional but we don't need user to specify a partitioning spec. Infact we should relax that requirement and if user chose not to specify the partitioning column than partition by some constant value which achieves desired effect. Make partition by optional in over clause - Key: HIVE-4073 URL: https://issues.apache.org/jira/browse/HIVE-4073 Project: Hive Issue Type: Bug Components: PTF-Windowing Reporter: Ashutosh Chauhan select s, sum( i ) over() from tt; should work. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
Hive-trunk-h0.21 - Build # 1986 - Fixed
Changes for Build #1985 Changes for Build #1986 All tests passed The Apache Jenkins build system has built Hive-trunk-h0.21 (build #1986) Status: Fixed Check console output at https://builds.apache.org/job/Hive-trunk-h0.21/1986/ to view the results.
[jira] [Updated] (HIVE-2935) Implement HiveServer2
[ https://issues.apache.org/jira/browse/HIVE-2935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasad Mujumdar updated HIVE-2935: -- Attachment: HIVE-2935-4.patch HIVE-2935-4.nothrift.patch HIVE-2935-4.changed-files-only.patch Rebased patch Implement HiveServer2 - Key: HIVE-2935 URL: https://issues.apache.org/jira/browse/HIVE-2935 Project: Hive Issue Type: New Feature Components: Server Infrastructure Reporter: Carl Steinbach Assignee: Carl Steinbach Labels: HiveServer2 Attachments: beelinepositive.tar.gz, HIVE-2935.1.notest.patch.txt, HIVE-2935.2.notest.patch.txt, HIVE-2935.2.nothrift.patch.txt, HIVE-2935.3.patch.gz, HIVE-2935-4.changed-files-only.patch, HIVE-2935-4.nothrift.patch, HIVE-2935-4.patch, HIVE-2935.fix.unsecuredoAs.patch, HS2-changed-files-only.patch, HS2-with-thrift-patch-rebased.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-2935) Implement HiveServer2
[ https://issues.apache.org/jira/browse/HIVE-2935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586621#comment-13586621 ] Prasad Mujumdar commented on HIVE-2935: --- Attached updated patch - Rebased with latest trunk Cherry-picked changes from Thejas's patch Additional fixes for user impersonation including support for secure remote metastore plug-able authentication support Implement HiveServer2 - Key: HIVE-2935 URL: https://issues.apache.org/jira/browse/HIVE-2935 Project: Hive Issue Type: New Feature Components: Server Infrastructure Reporter: Carl Steinbach Assignee: Carl Steinbach Labels: HiveServer2 Attachments: beelinepositive.tar.gz, HIVE-2935.1.notest.patch.txt, HIVE-2935.2.notest.patch.txt, HIVE-2935.2.nothrift.patch.txt, HIVE-2935.3.patch.gz, HIVE-2935-4.changed-files-only.patch, HIVE-2935-4.nothrift.patch, HIVE-2935-4.patch, HIVE-2935.fix.unsecuredoAs.patch, HS2-changed-files-only.patch, HS2-with-thrift-patch-rebased.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4034) Should be able to specify windowing spec without needing Between
[ https://issues.apache.org/jira/browse/HIVE-4034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brock Noland updated HIVE-4034: --- Summary: Should be able to specify windowing spec without needing Between (was: Should be able to specify windwoing spec without needing Between) Should be able to specify windowing spec without needing Between Key: HIVE-4034 URL: https://issues.apache.org/jira/browse/HIVE-4034 Project: Hive Issue Type: Bug Components: PTF-Windowing Reporter: Ashutosh Chauhan Currently user need to do following: {noformat} select s, sum(b) over (distribute by i sort by si rows between unbounded preceding and current row) from over100k; {noformat} but sql spec allows following as well: {noformat} select s, sum(b) over (distribute by i sort by si rows unbounded preceding) from over100k; {noformat} In such cases {{current row}} should be assumed implicitly. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-2935) Implement HiveServer2
[ https://issues.apache.org/jira/browse/HIVE-2935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586630#comment-13586630 ] Prasad Mujumdar commented on HIVE-2935: --- Review request for the changed files at https://reviews.facebook.net/D7281 Implement HiveServer2 - Key: HIVE-2935 URL: https://issues.apache.org/jira/browse/HIVE-2935 Project: Hive Issue Type: New Feature Components: Server Infrastructure Reporter: Carl Steinbach Assignee: Carl Steinbach Labels: HiveServer2 Attachments: beelinepositive.tar.gz, HIVE-2935.1.notest.patch.txt, HIVE-2935.2.notest.patch.txt, HIVE-2935.2.nothrift.patch.txt, HIVE-2935.3.patch.gz, HIVE-2935-4.changed-files-only.patch, HIVE-2935-4.nothrift.patch, HIVE-2935-4.patch, HIVE-2935.fix.unsecuredoAs.patch, HS2-changed-files-only.patch, HS2-with-thrift-patch-rebased.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4073) Make partition by optional in over clause
[ https://issues.apache.org/jira/browse/HIVE-4073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brock Noland updated HIVE-4073: --- Assignee: Brock Noland Make partition by optional in over clause - Key: HIVE-4073 URL: https://issues.apache.org/jira/browse/HIVE-4073 Project: Hive Issue Type: Bug Components: PTF-Windowing Reporter: Ashutosh Chauhan Assignee: Brock Noland select s, sum( i ) over() from tt; should work. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4014) Hive+RCFile is not doing column pruning and reading much more data than necessary
[ https://issues.apache.org/jira/browse/HIVE-4014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586701#comment-13586701 ] Lianhui Wang commented on HIVE-4014: i donot think that. i see the code. in HiveInputFormat and CombineHiveInputFormat's getRecordReader(), it calls pushProjectionsAndFilters(). also in pushProjectionsAndFilters(), from TableScanOperator it get needed columns and set these ids to hive.io.file.readcolumn.ids. and then in RCFile.Reader will read hive.io.file.readcolumn.ids to skip column. maybe the counter has some mistakes. if i have mistake,please tell me.thx. Hive+RCFile is not doing column pruning and reading much more data than necessary - Key: HIVE-4014 URL: https://issues.apache.org/jira/browse/HIVE-4014 Project: Hive Issue Type: Bug Reporter: Vinod Kumar Vavilapalli Assignee: Vinod Kumar Vavilapalli With even simple projection queries, I see that HDFS bytes read counter doesn't show any reduction in the amount of data read. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4052) Disallow partition/sort and distribute/order combinations in windowing and partitioning spec
[ https://issues.apache.org/jira/browse/HIVE-4052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phabricator updated HIVE-4052: -- Attachment: HIVE-4052.D8901.1.patch hbutani requested code review of HIVE-4052 [jira] Disallow partition/sort and distribute/order combinations in windowing and partitioning spec. Reviewers: JIRA change grammar so that only Part-Order or Dist-Sort are valid combinations TEST PLAN EMPTY REVISION DETAIL https://reviews.facebook.net/D8901 AFFECTED FILES data/files/flights_tiny.txt data/files/part.rc data/files/part.seq ql/src/java/org/apache/hadoop/hive/ql/optimizer/ColumnPrunerProcFactory.java ql/src/java/org/apache/hadoop/hive/ql/parse/Hive.g ql/src/test/queries/clientnegative/ptf_negative_DistributeByOrderBy.q ql/src/test/queries/clientnegative/ptf_negative_PartitionBySortBy.q ql/src/test/queries/clientpositive/ptf_general_queries.q ql/src/test/results/clientnegative/ptf_negative_DistributeByOrderBy.q.out ql/src/test/results/clientnegative/ptf_negative_PartitionBySortBy.q.out ql/src/test/results/clientpositive/ptf_general_queries.q.out MANAGE HERALD RULES https://reviews.facebook.net/herald/view/differential/ WHY DID I GET THIS EMAIL? https://reviews.facebook.net/herald/transcript/21639/ To: JIRA, hbutani Disallow partition/sort and distribute/order combinations in windowing and partitioning spec Key: HIVE-4052 URL: https://issues.apache.org/jira/browse/HIVE-4052 Project: Hive Issue Type: Bug Components: PTF-Windowing Reporter: Ashutosh Chauhan Attachments: HIVE-4052.1.patch.txt, HIVE-4052.D8901.1.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4052) Disallow partition/sort and distribute/order combinations in windowing and partitioning spec
[ https://issues.apache.org/jira/browse/HIVE-4052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586758#comment-13586758 ] Phabricator commented on HIVE-4052: --- ashutoshc has accepted the revision HIVE-4052 [jira] Disallow partition/sort and distribute/order combinations in windowing and partitioning spec. +1 REVISION DETAIL https://reviews.facebook.net/D8901 BRANCH HIVE-4052 ARCANIST PROJECT hive To: JIRA, ashutoshc, hbutani Disallow partition/sort and distribute/order combinations in windowing and partitioning spec Key: HIVE-4052 URL: https://issues.apache.org/jira/browse/HIVE-4052 Project: Hive Issue Type: Bug Components: PTF-Windowing Reporter: Ashutosh Chauhan Attachments: HIVE-4052.1.patch.txt, HIVE-4052.D8901.1.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-948) more query plan optimization rules
[ https://issues.apache.org/jira/browse/HIVE-948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13586789#comment-13586789 ] Hudson commented on HIVE-948: - Integrated in Hive-trunk-h0.21 #1987 (See [https://builds.apache.org/job/Hive-trunk-h0.21/1987/]) HIVE-948: more query plan optimization rules (Navis via Ashutosh Chauhan) (Revision 1449981) Result = FAILURE hashutosh : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1449981 Files : * /hive/trunk/contrib/src/test/results/clientpositive/serde_typedbytes.q.out * /hive/trunk/contrib/src/test/results/clientpositive/serde_typedbytes5.q.out * /hive/trunk/hbase-handler/src/test/results/positive/hbase_queries.q.out * /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Operator.java * /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/NonBlockingOpDeDupProc.java * /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/Optimizer.java * /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/plan/ExprNodeDescUtils.java * /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/OpProcFactory.java * /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/ppd/PredicateTransitivePropagate.java * /hive/trunk/ql/src/test/queries/clientpositive/nonblock_op_deduplicate.q * /hive/trunk/ql/src/test/results/clientnegative/bucket_mapjoin_mismatch1.q.out * /hive/trunk/ql/src/test/results/clientnegative/sortmerge_mapjoin_mismatch_1.q.out * /hive/trunk/ql/src/test/results/clientpositive/alias_casted_column.q.out * /hive/trunk/ql/src/test/results/clientpositive/ambiguous_col.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join1.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join12.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join14_hadoop20.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join17.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join19.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join2.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join20.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join22.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join26.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join28.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join29.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join3.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join4.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join5.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join6.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join7.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join8.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_join9.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_smb_mapjoin_14.q.out * /hive/trunk/ql/src/test/results/clientpositive/auto_sortmerge_join_9.q.out * /hive/trunk/ql/src/test/results/clientpositive/binarysortable_1.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucket_groupby.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucket_map_join_1.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucket_map_join_2.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_1.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_2.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_3.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_4.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_5.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_6.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_7.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketcontext_8.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketizedhiveinputformat.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin1.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin10.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin11.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin12.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin13.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin2.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin3.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin4.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin5.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin7.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin8.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin9.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin_negative.q.out * /hive/trunk/ql/src/test/results/clientpositive/bucketmapjoin_negative2.q.out *
Hive-trunk-h0.21 - Build # 1987 - Failure
Changes for Build #1987 [hashutosh] HIVE-948: more query plan optimization rules (Navis via Ashutosh Chauhan) No tests ran. The Apache Jenkins build system has built Hive-trunk-h0.21 (build #1987) Status: Failure Check console output at https://builds.apache.org/job/Hive-trunk-h0.21/1987/ to view the results.
[jira] [Updated] (HIVE-684) add UDF make_set
[ https://issues.apache.org/jira/browse/HIVE-684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] PRETTY SITHARA updated HIVE-684: Attachment: HIVE-684.2.patch.txt add UDF make_set Key: HIVE-684 URL: https://issues.apache.org/jira/browse/HIVE-684 Project: Hive Issue Type: New Feature Components: Query Processor Reporter: Namit Jain Assignee: PRETTY SITHARA Attachments: HIVE-684.1.patch.txt, HIVE-684.2.patch.txt, input.txt.txt, make_set.q, make_set.q.out add UDFmake_set look at http://dev.mysql.com/doc/refman/5.0/en/func-op-summary-ref.html for details -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-684) add UDF make_set
[ https://issues.apache.org/jira/browse/HIVE-684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] PRETTY SITHARA updated HIVE-684: Status: Patch Available (was: Open) Patch for HIVE-684 version 2 add UDF make_set Key: HIVE-684 URL: https://issues.apache.org/jira/browse/HIVE-684 Project: Hive Issue Type: New Feature Components: Query Processor Reporter: Namit Jain Assignee: PRETTY SITHARA Attachments: HIVE-684.1.patch.txt, HIVE-684.2.patch.txt, input.txt.txt, make_set.q, make_set.q.out add UDFmake_set look at http://dev.mysql.com/doc/refman/5.0/en/func-op-summary-ref.html for details -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-684) add UDF make_set
[ https://issues.apache.org/jira/browse/HIVE-684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] PRETTY SITHARA updated HIVE-684: Attachment: (was: HIVE-684.2.patch.txt) add UDF make_set Key: HIVE-684 URL: https://issues.apache.org/jira/browse/HIVE-684 Project: Hive Issue Type: New Feature Components: Query Processor Reporter: Namit Jain Assignee: PRETTY SITHARA Attachments: HIVE-684.1.patch.txt, input.txt.txt, make_set.q, make_set.q.out add UDFmake_set look at http://dev.mysql.com/doc/refman/5.0/en/func-op-summary-ref.html for details -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-684) add UDF make_set
[ https://issues.apache.org/jira/browse/HIVE-684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] PRETTY SITHARA updated HIVE-684: Status: Open (was: Patch Available) add UDF make_set Key: HIVE-684 URL: https://issues.apache.org/jira/browse/HIVE-684 Project: Hive Issue Type: New Feature Components: Query Processor Reporter: Namit Jain Assignee: PRETTY SITHARA Attachments: HIVE-684.1.patch.txt, input.txt.txt, make_set.q, make_set.q.out add UDFmake_set look at http://dev.mysql.com/doc/refman/5.0/en/func-op-summary-ref.html for details -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (HIVE-4052) Disallow partition/sort and distribute/order combinations in windowing and partitioning spec
[ https://issues.apache.org/jira/browse/HIVE-4052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan resolved HIVE-4052. Resolution: Fixed Assignee: Harish Butani Committed to branch. Thanks, Harish! Disallow partition/sort and distribute/order combinations in windowing and partitioning spec Key: HIVE-4052 URL: https://issues.apache.org/jira/browse/HIVE-4052 Project: Hive Issue Type: Bug Components: PTF-Windowing Reporter: Ashutosh Chauhan Assignee: Harish Butani Attachments: HIVE-4052.1.patch.txt, HIVE-4052.D8901.1.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
Re: Phabricator issues, libphutil v1 libraries are no longer supported
Hit the same problem as ' libphutil v1 libraries are no longer supported'. Doing the git checkout as suggested above resolved it. But doing arc diff --jira HIVE-4018 is just hung with below output : Linting... LINT OKAY No lint problems. Running unit tests... No unit test engine is configured for this project. Is there is any other way to do this? Thanks Amareshwari On Fri, Jan 25, 2013 at 8:44 PM, Bennie Schut bsc...@ebuddy.com wrote: I've been having issues using Phabricator to create a patch for hive. It looks like some versioning issues with arc_jira_lib and libphutil. I kept getting this error: libphutil v1 libraries are no longer supported. To work around it go to the libphutil directory you checked out of git and revert to an older commit: git checkout 870bcc76434410344d27a3fa4604ac96200bf7f6 The next problem is the arc client: ERR-BAD-VERSION: Your 'arc' client version is '6', which is newer than the server version, '5'. Upgrade your Phabricator install. So also get an older version: git checkout 6cb8d483b23c132da4c82934438cd1af0e5ba32b I'll leave the actual fixing to someone with a bit more php experience. After that I tried to arc diff HIVE-3308 added the message then: Linting... LINT OKAY No lint problems. Running unit tests... No unit test engine is configured for this project. Usage Exception: There are no changes to generate a diff from! In the end I manually uploaded It as D8187. And added the patch to Jira even tough I guess that's what arc is suppose to do. Bennie.
Re: Phabricator issues, libphutil v1 libraries are no longer supported
Try following git hash-tags: arcanist ccdf9ae957f995154374a0dc344fb260b112ce2b libphutil 10965d5b649e84a60fc2d692f3b1f6a63288cd9d Ashutosh On Mon, Feb 25, 2013 at 10:54 PM, amareshwari sriramdasu amareshw...@gmail.com wrote: Hit the same problem as ' libphutil v1 libraries are no longer supported'. Doing the git checkout as suggested above resolved it. But doing arc diff --jira HIVE-4018 is just hung with below output : Linting... LINT OKAY No lint problems. Running unit tests... No unit test engine is configured for this project. Is there is any other way to do this? Thanks Amareshwari On Fri, Jan 25, 2013 at 8:44 PM, Bennie Schut bsc...@ebuddy.com wrote: I've been having issues using Phabricator to create a patch for hive. It looks like some versioning issues with arc_jira_lib and libphutil. I kept getting this error: libphutil v1 libraries are no longer supported. To work around it go to the libphutil directory you checked out of git and revert to an older commit: git checkout 870bcc76434410344d27a3fa4604ac96200bf7f6 The next problem is the arc client: ERR-BAD-VERSION: Your 'arc' client version is '6', which is newer than the server version, '5'. Upgrade your Phabricator install. So also get an older version: git checkout 6cb8d483b23c132da4c82934438cd1af0e5ba32b I'll leave the actual fixing to someone with a bit more php experience. After that I tried to arc diff HIVE-3308 added the message then: Linting... LINT OKAY No lint problems. Running unit tests... No unit test engine is configured for this project. Usage Exception: There are no changes to generate a diff from! In the end I manually uploaded It as D8187. And added the patch to Jira even tough I guess that's what arc is suppose to do. Bennie.
[jira] [Created] (HIVE-4074) Doc update for .8, .9 and .10
Gunther Hagleitner created HIVE-4074: Summary: Doc update for .8, .9 and .10 Key: HIVE-4074 URL: https://issues.apache.org/jira/browse/HIVE-4074 Project: Hive Issue Type: Bug Components: Documentation Reporter: Gunther Hagleitner Assignee: Gunther Hagleitner Need to update the javadocs for releases 8, 9 and 10. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira