[jira] [Commented] (HIVE-5049) Create an ORC test case that has a 0.11 ORC file
[ https://issues.apache.org/jira/browse/HIVE-5049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758029#comment-13758029 ] Hudson commented on HIVE-5049: -- SUCCESS: Integrated in Hive-trunk-h0.21 #2309 (See [https://builds.apache.org/job/Hive-trunk-h0.21/2309/]) HIVE-5049 : Create an ORC test case that has a 0.11 ORC file (Prasanth J via Ashutosh Chauhan) (hashutosh: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519868) * /hive/trunk/ql/src/test/org/apache/hadoop/hive/ql/io/orc/TestOrcFile.java * /hive/trunk/ql/src/test/resources/orc-file-11-format.orc Create an ORC test case that has a 0.11 ORC file Key: HIVE-5049 URL: https://issues.apache.org/jira/browse/HIVE-5049 Project: Hive Issue Type: Bug Reporter: Owen O'Malley Assignee: Prasanth J Fix For: 0.12.0 Attachments: HIVE-5049.patch.txt, orc-file-11-format.orc We should add a test case that includes a 0.11.0 ORC file to ensure compatibility for reading old ORC files is kept correct. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5129) Multiple table insert fails on count(distinct)
[ https://issues.apache.org/jira/browse/HIVE-5129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758024#comment-13758024 ] Hudson commented on HIVE-5129: -- SUCCESS: Integrated in Hive-trunk-h0.21 #2309 (See [https://builds.apache.org/job/Hive-trunk-h0.21/2309/]) HIVE-5129 Multiple table insert fails on count distinct (Vikram Dixit via Harish Butani) (rhbutani: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519764) * /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java * /hive/trunk/ql/src/test/queries/clientpositive/multi_insert_gby3.q * /hive/trunk/ql/src/test/results/clientpositive/multi_insert_gby3.q.out Multiple table insert fails on count(distinct) -- Key: HIVE-5129 URL: https://issues.apache.org/jira/browse/HIVE-5129 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.11.0 Reporter: Vikram Dixit K Assignee: Vikram Dixit K Fix For: 0.12.0 Attachments: aggrTestMultiInsertData1.txt, aggrTestMultiInsertData.txt, aggrTestMultiInsert.q, HIVE-5129.1.patch.txt, HIVE-5129.2.WIP.patch.txt, HIVE-5129.3.patch.txt, HIVE-5129.4.patch, HIVE-5129.4.patch.txt Hive fails with a class cast exception on queries of the form: {noformat} from studenttab10k insert overwrite table multi_insert_2_1 select name, avg(age) as avgage group by name insert overwrite table multi_insert_2_2 select name, age, sum(gpa) as sumgpa group by name, age insert overwrite table multi_insert_2_3 select name, count(distinct age) as distage group by name; {noformat} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4750) Fix TestCliDriver.list_bucket_dml_{6,7,8}.q on 0.23
[ https://issues.apache.org/jira/browse/HIVE-4750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758020#comment-13758020 ] Hudson commented on HIVE-4750: -- SUCCESS: Integrated in Hive-trunk-h0.21 #2309 (See [https://builds.apache.org/job/Hive-trunk-h0.21/2309/]) HIVE-4750 : Fix TestCliDriver.list_bucket_dml_{6,7,8}.q on 0.23 (Prasanth J via Ashutosh Chauhan) (hashutosh: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519857) * /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_dml_6.q * /hive/trunk/ql/src/test/results/clientpositive/list_bucket_dml_6.q.out Fix TestCliDriver.list_bucket_dml_{6,7,8}.q on 0.23 --- Key: HIVE-4750 URL: https://issues.apache.org/jira/browse/HIVE-4750 Project: Hive Issue Type: Sub-task Affects Versions: 0.12.0 Reporter: Brock Noland Assignee: Prasanth J Fix For: 0.12.0 Attachments: HIVE-4750.2.patch, HIVE-4750.patch Removing 6,7,8 from the scope of HIVE-4746. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
Re: HIVE-5104 patch status?
Hi Karl, Looks like we crossed streams. :) I started reviewing your patch last thursday, and committed it yesterday. I had to run some tests before incorporating your patch. I did initially comment up asking for patch regeneration since your patch did not apply cleanly for me on trunk, and also had some whitespacing errors, but I decided to go ahead and tidy up the patch to apply instead of holding you up. Since 0.11 has been released, unless it's an important bugfix, or unless we do a re-release of 0.11, we would not be applying it on that branch. In this case, since it's more akin to a feature introduction, it should be part of our next 0.12 release. Thanks for your contribution! On Wed, Sep 4, 2013 at 9:21 AM, Edward Capriolo edlinuxg...@gmail.com wrote: I'm not sure I understand what you are saying I see plenty of comments on your ticket. http://www.apache.org/dev/committers.html#committer-responsibilities Applying patches In order to grow and maintain healthy communities, committers need to discuss, review and apply patches submitted by volunteers. The Committers are also responsible for the quality and IP clearance of the code that goes into ASF repositories. Helping users Committers should monitor both the dev and user lists for the projects that they work on and (collectively) provide prompt and useful responses to questions from users. Monitoring commits and issues Committers should review commit email messages for their projects and point out anything that looks funny or that may bring in IP issues. Monitoring Bugzilla / Jira for bugs or enhancement requests is also a responsibility of Committers. Generally we (committers pmc) attempt to wade though the open jira issues and review and commit open issues. Not all of us are full time, and not all of us specialize across the entire code base, so the length of time a patch can stay out there is variable. My process is this: Review jira for things marked PATCH AVAILABLE If I understand the scope of the issue I review it and leave comments. If ready I wait for the test and then commit. I personally try to maintain a balance of 75% writing code 25% review but lately I am about 80% review 20% code. I would like us to move to a road map and feature based releases with smaller tuck-in issues. At the moment everything that flows into our jira looks the same to me as we have people going into several different directions, it makes it fairly hard to order the incoming issues in priority. On Wed, Sep 4, 2013 at 8:42 AM, Brock Noland br...@cloudera.com wrote: Hi, Looks like your patch has been committed, but I just wanted to confirm for those lurking. If you have a patch up on a JIRA, emailing the dev list is one of the correct ways of engaging a committer. IRC is also a viable strategy. Thanks for your contribution!! Cheers, Brock On Tue, Sep 3, 2013 at 1:59 PM, Karl Gierach k...@sourcethought.com wrote: Hi, We at Sourcethought have contributed at patch to fix JIRA issue HIVE-5104 about 2 weeks back, although it appears that this patch has not received any comments. The patch is relevant for the both github Hive branch-0.11 and the trunk, as both source files were not changed since the 0.11 branch was created. What steps need to be taken to incorporate this patch into the next release? Do we need to request approval from a committer? Best Regards, Karl Gierach Lead Engineer SourceThought, Inc. -- Apache MRUnit - Unit testing MapReduce - http://mrunit.apache.org
[jira] [Commented] (HIVE-4442) [HCatalog] WebHCat should not override user.name parameter for Queue call
[ https://issues.apache.org/jira/browse/HIVE-4442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758022#comment-13758022 ] Hudson commented on HIVE-4442: -- SUCCESS: Integrated in Hive-trunk-h0.21 #2309 (See [https://builds.apache.org/job/Hive-trunk-h0.21/2309/]) HIVE-4442: WebHCat should not override user.name parameter for Queue call (Daniel Dai) (daijy: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519875) * /hive/trunk/RELEASE_NOTES.txt * /hive/trunk/hcatalog/webhcat/svr/src/main/java/org/apache/hcatalog/templeton/DeleteDelegator.java * /hive/trunk/hcatalog/webhcat/svr/src/main/java/org/apache/hcatalog/templeton/ListDelegator.java * /hive/trunk/hcatalog/webhcat/svr/src/main/java/org/apache/hcatalog/templeton/Server.java * /hive/trunk/hcatalog/webhcat/svr/src/main/java/org/apache/hcatalog/templeton/StatusDelegator.java * /hive/trunk/shims/src/0.20/java/org/apache/hadoop/hive/shims/Hadoop20Shims.java * /hive/trunk/shims/src/0.20S/java/org/apache/hadoop/hive/shims/Hadoop20SShims.java * /hive/trunk/shims/src/0.20S/java/org/apache/hadoop/mapred/WebHCatJTShim20S.java * /hive/trunk/shims/src/0.23/java/org/apache/hadoop/hive/shims/Hadoop23Shims.java * /hive/trunk/shims/src/0.23/java/org/apache/hadoop/mapred/WebHCatJTShim23.java * /hive/trunk/shims/src/common/java/org/apache/hadoop/hive/shims/HadoopShims.java [HCatalog] WebHCat should not override user.name parameter for Queue call - Key: HIVE-4442 URL: https://issues.apache.org/jira/browse/HIVE-4442 Project: Hive Issue Type: Bug Components: HCatalog Reporter: Daniel Dai Assignee: Daniel Dai Fix For: 0.12.0 Attachments: HIVE-4442-1.patch, HIVE-4442-2.patch, HIVE-4442-3.patch Currently templeton for the Queue call uses the user.name to filter the results of the call in addition to the default security. Ideally the filter is an optional parameter to the call independent of the security check. I would suggest a parameter in addition to GET queue (jobs) give you all the jobs a user have permission: GET queue?showall=true -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4586) [HCatalog] WebHCat should return 404 error for undefined resource
[ https://issues.apache.org/jira/browse/HIVE-4586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758027#comment-13758027 ] Hudson commented on HIVE-4586: -- SUCCESS: Integrated in Hive-trunk-h0.21 #2309 (See [https://builds.apache.org/job/Hive-trunk-h0.21/2309/]) [HIVE-4586] - WebHCat should return 404 error for undefined resource (Daniel Dai) (daijy: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519818) * /hive/trunk/RELEASE_NOTES.txt * /hive/trunk/hcatalog/webhcat/svr/src/main/java/org/apache/hcatalog/templeton/CatchallExceptionMapper.java * /hive/trunk/hcatalog/webhcat/svr/src/test/java/org/apache/hcatalog/templeton/TestWebHCatE2e.java [HCatalog] WebHCat should return 404 error for undefined resource - Key: HIVE-4586 URL: https://issues.apache.org/jira/browse/HIVE-4586 Project: Hive Issue Type: Bug Components: HCatalog Affects Versions: 0.11.0 Reporter: Daniel Dai Assignee: Daniel Dai Fix For: 0.12.0 Attachments: HIVE-4586-1.patch, HIVE-4586-2.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5145) Fix TestCliDriver.list_bucket_query_multiskew_2.q on hadoop 0.23
[ https://issues.apache.org/jira/browse/HIVE-5145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758026#comment-13758026 ] Hudson commented on HIVE-5145: -- SUCCESS: Integrated in Hive-trunk-h0.21 #2309 (See [https://builds.apache.org/job/Hive-trunk-h0.21/2309/]) HIVE-5145 : Fix TestCliDriver.list_bucket_query_multiskew_2.q on hadoop 0.23 (Prasanth J via Ashutosh Chauhan) (hashutosh: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519862) * /hive/trunk/ql/src/test/queries/clientpositive/list_bucket_query_multiskew_2.q * /hive/trunk/ql/src/test/results/clientpositive/list_bucket_query_multiskew_2.q.out Fix TestCliDriver.list_bucket_query_multiskew_2.q on hadoop 0.23 Key: HIVE-5145 URL: https://issues.apache.org/jira/browse/HIVE-5145 Project: Hive Issue Type: Bug Components: Tests Reporter: Prasanth J Assignee: Prasanth J Fix For: 0.12.0 Attachments: HIVE-5145.2.patch, HIVE-5145.patch there is some determinism related to the output of list_bucket_query_multiskew_2.q test case. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5096) Add q file tests for ORC predicate pushdown
[ https://issues.apache.org/jira/browse/HIVE-5096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758028#comment-13758028 ] Hudson commented on HIVE-5096: -- SUCCESS: Integrated in Hive-trunk-h0.21 #2309 (See [https://builds.apache.org/job/Hive-trunk-h0.21/2309/]) HIVE-5096 : Add q file tests for ORC predicate pushdown (Prasanth J via Ashutosh Chauhan) (hashutosh: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519883) * /hive/trunk/data/files/over1k * /hive/trunk/ql/src/test/queries/clientpositive/orc_predicate_pushdown.q * /hive/trunk/ql/src/test/results/clientpositive/orc_predicate_pushdown.q.out Add q file tests for ORC predicate pushdown --- Key: HIVE-5096 URL: https://issues.apache.org/jira/browse/HIVE-5096 Project: Hive Issue Type: Test Components: CLI, File Formats, StorageHandler Affects Versions: 0.12.0 Reporter: Prasanth J Assignee: Prasanth J Labels: orcfile Fix For: 0.12.0 Attachments: HIVE-5096.patch Add q file tests that checks the validity of the results when predicate pushdown is turned on and off. Also test for filter expressions in table scan operator when predicate pushdown is turned on for ORC. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5149) ReduceSinkDeDuplication can pick the wrong partitioning columns
[ https://issues.apache.org/jira/browse/HIVE-5149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758030#comment-13758030 ] Hudson commented on HIVE-5149: -- SUCCESS: Integrated in Hive-trunk-h0.21 #2309 (See [https://builds.apache.org/job/Hive-trunk-h0.21/2309/]) HIVE-5149 : ReduceSinkDeDuplication can pick the wrong partitioning columns (Yin Huai via Ashutosh Chauhan) (hashutosh: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519805) * /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/correlation/ReduceSinkDeDuplication.java * /hive/trunk/ql/src/test/results/clientpositive/groupby2.q.out * /hive/trunk/ql/src/test/results/clientpositive/groupby2_map_skew.q.out * /hive/trunk/ql/src/test/results/clientpositive/groupby_cube1.q.out * /hive/trunk/ql/src/test/results/clientpositive/groupby_rollup1.q.out * /hive/trunk/ql/src/test/results/clientpositive/reduce_deduplicate_extended.q.out ReduceSinkDeDuplication can pick the wrong partitioning columns --- Key: HIVE-5149 URL: https://issues.apache.org/jira/browse/HIVE-5149 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.11.0, 0.12.0 Reporter: Yin Huai Assignee: Yin Huai Priority: Blocker Fix For: 0.12.0 Attachments: HIVE-5149.1.patch, HIVE-5149.2.patch, HIVE-5149.3.patch https://mail-archives.apache.org/mod_mbox/hive-user/201308.mbox/%3CCAG6Lhyex5XPwszpihKqkPRpzri2k=m4qgc+cpar5yvr8sjt...@mail.gmail.com%3E -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5014) [HCatalog] Fix HCatalog build issue on Windows
[ https://issues.apache.org/jira/browse/HIVE-5014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758025#comment-13758025 ] Hudson commented on HIVE-5014: -- SUCCESS: Integrated in Hive-trunk-h0.21 #2309 (See [https://builds.apache.org/job/Hive-trunk-h0.21/2309/]) HIVE-5014 : Fix HCatalog build issue on Windows (Daniel Dai via Sushanth Sowmyan) (khorgath: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519853) * /hive/trunk/hcatalog/build.xml [HCatalog] Fix HCatalog build issue on Windows -- Key: HIVE-5014 URL: https://issues.apache.org/jira/browse/HIVE-5014 Project: Hive Issue Type: Sub-task Components: HCatalog Reporter: Daniel Dai Assignee: Daniel Dai Fix For: 0.12.0 Attachments: HIVE-5014-1.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4441) [HCatalog] WebHCat does not honor user home directory
[ https://issues.apache.org/jira/browse/HIVE-4441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758023#comment-13758023 ] Hudson commented on HIVE-4441: -- SUCCESS: Integrated in Hive-trunk-h0.21 #2309 (See [https://builds.apache.org/job/Hive-trunk-h0.21/2309/]) HIVE-4441: WebHCat does not honor user home directory (Daniel Dai) (daijy: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519897) * /hive/trunk/hcatalog/webhcat/svr/src/main/java/org/apache/hcatalog/templeton/tool/TempletonUtils.java HIVE-4441: WebHCat does not honor user home directory (Daniel Dai) (daijy: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519884) * /hive/trunk/RELEASE_NOTES.txt * /hive/trunk/hcatalog/webhcat/svr/src/main/java/org/apache/hcatalog/templeton/tool/TempletonUtils.java [HCatalog] WebHCat does not honor user home directory - Key: HIVE-4441 URL: https://issues.apache.org/jira/browse/HIVE-4441 Project: Hive Issue Type: Bug Components: HCatalog Reporter: Daniel Dai Assignee: Daniel Dai Fix For: 0.12.0 Attachments: HIVE-4441-1.patch, HIVE-4441-2.patch, HIVE-4441-3.patch, HIVE-4441-4.patch If I submit a job as user A and I specify statusdir as a relative path, I would expect results to be stored in the folder relative to the user A's home folder. For example, if I run: {code}curl -s -d user.name=hdinsightuser -d execute=show+tables; -d statusdir=pokes.output 'http://localhost:50111/templeton/v1/hive'{code} I get the results under: {code}/user/hdp/pokes.output{code} And I expect them to be under: {code}/user/hdinsightuser/pokes.output{code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5104) HCatStorer fails to store boolean type
[ https://issues.apache.org/jira/browse/HIVE-5104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758021#comment-13758021 ] Hudson commented on HIVE-5104: -- SUCCESS: Integrated in Hive-trunk-h0.21 #2309 (See [https://builds.apache.org/job/Hive-trunk-h0.21/2309/]) HIVE-5104 : HCatStorer fails to store boolean type (Karl D. Gierach via Sushanth Sowmyan) (khorgath: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519839) * /hive/trunk/hcatalog/hcatalog-pig-adapter/src/main/java/org/apache/hcatalog/pig/HCatBaseStorer.java * /hive/trunk/hcatalog/hcatalog-pig-adapter/src/test/java/org/apache/hcatalog/pig/TestHCatStorer.java HCatStorer fails to store boolean type -- Key: HIVE-5104 URL: https://issues.apache.org/jira/browse/HIVE-5104 Project: Hive Issue Type: Bug Components: HCatalog Affects Versions: 0.11.0 Reporter: Ron Frohock Assignee: Karl D. Gierach Fix For: 0.12.0 Attachments: HIVE-5104.1.patch.txt, HIVE-5104.2.patch Unable to store boolean values to HCat table Assume in Hive you have two tables... CREATE TABLE btest(test as boolean); CREATE TABLE btest2(test as boolean); Then in Pig A = LOAD 'btest' USING org.apache.hcatalog.pig.HCatLoader(); STORE A INTO 'btest2' USING org.apache.hcatalog.pig.HCatStorer(); You will get an ERROR 115: Unsupported type 5: in Pig's Schema Checking HCatBaseStorer.java, the case for data types doesn't check for booleans. Might have been overlooked in adding boolean to Pig in 0.10 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
Re: [ANNOUNCE] New Hive Committer - Yin Huai
Congratulation! On Wed, Sep 4, 2013 at 10:39 AM, yongqiang he heyongqiang...@gmail.comwrote: Congrats! On Wed, Sep 4, 2013 at 10:23 AM, Jason Dere jd...@hortonworks.com wrote: Yin, congrats! Jason On Sep 4, 2013, at 7:54 AM, Eugene Koifman ekoif...@hortonworks.com wrote: Congrats! On Wed, Sep 4, 2013 at 5:23 AM, Brock Noland br...@cloudera.com wrote: Congrats Yin!! On Wed, Sep 4, 2013 at 4:14 AM, Lefty Leverenz leftylever...@gmail.com wrote: Bravo, Yin! -- Lefty On Wed, Sep 4, 2013 at 4:17 AM, Sushanth Sowmyan khorg...@gmail.com wrote: Congrats, Yin! :) On Sep 4, 2013 1:13 AM, Alexander Alten-Lorenz wget.n...@gmail.com wrote: Amazing news, congratz Yin! Well deserved! On Sep 4, 2013, at 6:49 AM, Carl Steinbach c...@apache.org wrote: The Apache Hive PMC has voted to make Yin Huai a committer on the Apache Hive project. Please join me in congratulating Yin! Thanks. Carl -- Alexander Alten-Lorenz http://mapredit.blogspot.com German Hadoop LinkedIn Group: http://goo.gl/N8pCF -- Apache MRUnit - Unit testing MapReduce - http://mrunit.apache.org -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: [ANNOUNCE] New Hive Committer - Yin Huai
Thanks everybody! This is awesome! On Wed, Sep 4, 2013 at 1:58 PM, Daniel Dai da...@hortonworks.com wrote: Congratulation! On Wed, Sep 4, 2013 at 10:39 AM, yongqiang he heyongqiang...@gmail.com wrote: Congrats! On Wed, Sep 4, 2013 at 10:23 AM, Jason Dere jd...@hortonworks.com wrote: Yin, congrats! Jason On Sep 4, 2013, at 7:54 AM, Eugene Koifman ekoif...@hortonworks.com wrote: Congrats! On Wed, Sep 4, 2013 at 5:23 AM, Brock Noland br...@cloudera.com wrote: Congrats Yin!! On Wed, Sep 4, 2013 at 4:14 AM, Lefty Leverenz leftylever...@gmail.com wrote: Bravo, Yin! -- Lefty On Wed, Sep 4, 2013 at 4:17 AM, Sushanth Sowmyan khorg...@gmail.com wrote: Congrats, Yin! :) On Sep 4, 2013 1:13 AM, Alexander Alten-Lorenz wget.n...@gmail.com wrote: Amazing news, congratz Yin! Well deserved! On Sep 4, 2013, at 6:49 AM, Carl Steinbach c...@apache.org wrote: The Apache Hive PMC has voted to make Yin Huai a committer on the Apache Hive project. Please join me in congratulating Yin! Thanks. Carl -- Alexander Alten-Lorenz http://mapredit.blogspot.com German Hadoop LinkedIn Group: http://goo.gl/N8pCF -- Apache MRUnit - Unit testing MapReduce - http://mrunit.apache.org -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
[jira] [Updated] (HIVE-5182) log more stuff via PerfLogger
[ https://issues.apache.org/jira/browse/HIVE-5182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phabricator updated HIVE-5182: -- Attachment: HIVE-5182.D12639.2.patch sershe updated the revision HIVE-5182 [jira] log more stuff via PerfLogger. More perflogs based on CR feedback Reviewers: JIRA REVISION DETAIL https://reviews.facebook.net/D12639 CHANGE SINCE LAST DIFF https://reviews.facebook.net/D12639?vs=39333id=39495#toc AFFECTED FILES ql/src/java/org/apache/hadoop/hive/ql/Driver.java ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java ql/src/java/org/apache/hadoop/hive/ql/io/CombineHiveInputFormat.java ql/src/java/org/apache/hadoop/hive/ql/io/HiveInputFormat.java ql/src/java/org/apache/hadoop/hive/ql/log/PerfLogger.java To: JIRA, sershe log more stuff via PerfLogger - Key: HIVE-5182 URL: https://issues.apache.org/jira/browse/HIVE-5182 Project: Hive Issue Type: Improvement Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Attachments: HIVE-5182.D12639.1.patch, HIVE-5182.D12639.2.patch PerfLogger output is useful in understanding perf. There are large gaps in it, however, and it's not clear what is going on during these. Some sections are large and have no breakdown. It would be nice to add more stuff. At this point I'm not certain where exactly, whoever makes the patch (me?) will just need to look at the above gaps and fill them in. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5206) Support parameterized primitive types
[ https://issues.apache.org/jira/browse/HIVE-5206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758099#comment-13758099 ] Edward Capriolo commented on HIVE-5206: --- Unifying the integral types is surely out of scope for this issue. However, We are making the justification for doing this for string, char, varchar. The same argument would hold for the integral types? I mean if someone was a fresh reader to the code and saw that all the string types work one way, but all the int types work another, what would they conclude is the right way? Are we thinking too patch driven development? Sure your end goal is to have this varchar support (and it is a good goal and this is a good feature) but who cares about all the existing stuff? Is it not sexy not worth touching? I am just thinking out loud here. But I notice we are starting to add many types, some like decimal are special purpose versions of other types. I want to make sure we are thinking about the big picture here. I am not sure we should just be adding a new type every time there is something we do not like about the existing type. What if someone wants nvarchar, or BigInteger, or BigDecimal? Support parameterized primitive types - Key: HIVE-5206 URL: https://issues.apache.org/jira/browse/HIVE-5206 Project: Hive Issue Type: Improvement Components: Types Reporter: Jason Dere Assignee: Jason Dere Attachments: HIVE-5206.1.patch, HIVE-5206.D12693.1.patch Support for parameterized types is needed for char/varchar/decimal support. This adds a type parameters value to the PrimitiveTypeEntry/PrimitiveTypeInfo/PrimitiveObjectInspector objects. NO PRECOMMIT TESTS - dependent on HIVE-5203/HIVE-5204 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5203) FunctionRegistry.getMethodInternal() should prefer method arguments with closer affinity to the original argument types
[ https://issues.apache.org/jira/browse/HIVE-5203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758102#comment-13758102 ] Jason Dere commented on HIVE-5203: -- Hi Edward, did you have a particular comment around the first code section you highlighted in your previous comment? It looks like your comments were directed against the 2nd code section. {code} for (Method m: udfMethods) { +currentScore = 0; +ListTypeInfo argumentsAccepted = +TypeInfoUtils.getParameterTypeInfos(m, argumentsPassed.size()); +IteratorTypeInfo argsPassedIter = argumentsPassed.iterator(); {code} FunctionRegistry.getMethodInternal() should prefer method arguments with closer affinity to the original argument types --- Key: HIVE-5203 URL: https://issues.apache.org/jira/browse/HIVE-5203 Project: Hive Issue Type: Bug Components: Types, UDF Reporter: Jason Dere Assignee: Jason Dere Attachments: HIVE-5203.1.patch When the function registry is trying to determine the best version of UDF evaluate() to use based on a set of arguments passed in, it should prefer methods where the argument types are more related to the original types. For example if varchar is used with UDFFromUnixTime(), varchar is convertible to both the double and string versions of evaluate() for that UDF. In this case we would prefer that the function registry select the string version over the double version, since varchar and string are both string types. This doesn't really affect any of the existing types, but comes into play with the addition of the varchar type (HIVE-4844). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5060) JDBC driver assumes executeStatement is synchronous
[ https://issues.apache.org/jira/browse/HIVE-5060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758113#comment-13758113 ] Vaibhav Gumashta commented on HIVE-5060: [~henryr] Thanks for posting this to rb. There has been some more progress on the async execution part of [HIVE-4569|https://issues.apache.org/jira/browse/HIVE-4569] here: [HIVE-4617|https://issues.apache.org/jira/browse/HIVE-4617]. I'll post my comments soon. JDBC driver assumes executeStatement is synchronous --- Key: HIVE-5060 URL: https://issues.apache.org/jira/browse/HIVE-5060 Project: Hive Issue Type: Bug Components: JDBC Affects Versions: 0.11.0 Reporter: Henry Robinson Fix For: 0.11.1, 0.12.0 Attachments: 0001-HIVE-5060-JDBC-driver-assumes-executeStatement-is-sy.patch, HIVE-5060.patch The JDBC driver seems to assume that {{ExecuteStatement}} is a synchronous call when performing updates via {{executeUpdate}}, where the following comment on the RPC in the Thrift file indicates otherwise: {code} // ExecuteStatement() // // Execute a statement. // The returned OperationHandle can be used to check on the // status of the statement, and to fetch results once the // statement has finished executing. {code} I understand that Hive's implementation of {{ExecuteStatement}} is blocking (see https://issues.apache.org/jira/browse/HIVE-4569), but presumably other implementations of the HiveServer2 API (and I'm talking specifically about Impala here, but others might have a similar concern) should be free to return a pollable {{OperationHandle}} per the specification. The JDBC driver's {{executeUpdate}} is as follows: {code} public int executeUpdate(String sql) throws SQLException { execute(sql); return 0; } {code} {{execute(sql)}} discards the {{OperationHandle}} that it gets from the server after determining whether there are results to be fetched. This is problematic for us, because Impala will cancel queries that are running when a session executes, but there's no easy way to be sure that an {{INSERT}} statement has completed before terminating a session on the client. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4895: - Attachment: (was: HIVE-4895.patch) Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4895: - Attachment: (was: HIVE-4895.move.patch) Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4895: - Attachment: (was: HIVE-4895.update.patch) Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4895: - Attachment: HIVE-4895.patch Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4895.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4895: - Attachment: HIVE-4895.rename.patch HIVE-4895.move.patch Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4895.move.patch, HIVE-4895.patch, HIVE-4895.rename.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4895: - Status: Patch Available (was: Open) Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4895.move.patch, HIVE-4895.patch, HIVE-4895.rename.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-1511) Hive plan serialization is slow
[ https://issues.apache.org/jira/browse/HIVE-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758131#comment-13758131 ] Ashutosh Chauhan commented on HIVE-1511: [~brocknoland] I think rewriting setColumnNameList so that it doesn't require RowSchema is way to go forward. Because, RowSchema is designed to be used at compile time. For execution, we rely on OIs to carry all the type information. Not needing RowSchema to be serializable will make compile time and execution time data structures neatly decoupled. Hive plan serialization is slow --- Key: HIVE-1511 URL: https://issues.apache.org/jira/browse/HIVE-1511 Project: Hive Issue Type: Improvement Affects Versions: 0.7.0, 0.11.0 Reporter: Ning Zhang Assignee: Mohammad Kamrul Islam Attachments: failedPlan.xml, generated_plan.xml, HIVE-1511.10.patch, HIVE-1511.11.patch, HIVE-1511.4.patch, HIVE-1511.5.patch, HIVE-1511.6.patch, HIVE-1511.7.patch, HIVE-1511.8.patch, HIVE-1511.9.patch, HIVE-1511.patch, HIVE-1511-wip2.patch, HIVE-1511-wip3.patch, HIVE-1511-wip4.patch, HIVE-1511.wip.9.patch, HIVE-1511-wip.patch, KryoHiveTest.java, run.sh As reported by Edward Capriolo: For reference I did this as a test case SELECT * FROM src where key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR key=0 OR ...(100 more of these) No OOM but I gave up after the test case did not go anywhere for about 2 minutes. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
Re: [ANNOUNCE] New Hive Committer - Yin Huai
Congratulations, Yin Cheers, Zhuoluo (Clark) Yang 2013/9/5 Yin Huai huaiyin@gmail.com Thanks everybody! This is awesome! On Wed, Sep 4, 2013 at 1:58 PM, Daniel Dai da...@hortonworks.com wrote: Congratulation! On Wed, Sep 4, 2013 at 10:39 AM, yongqiang he heyongqiang...@gmail.com wrote: Congrats! On Wed, Sep 4, 2013 at 10:23 AM, Jason Dere jd...@hortonworks.com wrote: Yin, congrats! Jason On Sep 4, 2013, at 7:54 AM, Eugene Koifman ekoif...@hortonworks.com wrote: Congrats! On Wed, Sep 4, 2013 at 5:23 AM, Brock Noland br...@cloudera.com wrote: Congrats Yin!! On Wed, Sep 4, 2013 at 4:14 AM, Lefty Leverenz leftylever...@gmail.com wrote: Bravo, Yin! -- Lefty On Wed, Sep 4, 2013 at 4:17 AM, Sushanth Sowmyan khorg...@gmail.com wrote: Congrats, Yin! :) On Sep 4, 2013 1:13 AM, Alexander Alten-Lorenz wget.n...@gmail.com wrote: Amazing news, congratz Yin! Well deserved! On Sep 4, 2013, at 6:49 AM, Carl Steinbach c...@apache.org wrote: The Apache Hive PMC has voted to make Yin Huai a committer on the Apache Hive project. Please join me in congratulating Yin! Thanks. Carl -- Alexander Alten-Lorenz http://mapredit.blogspot.com German Hadoop LinkedIn Group: http://goo.gl/N8pCF -- Apache MRUnit - Unit testing MapReduce - http://mrunit.apache.org -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
[jira] [Commented] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758152#comment-13758152 ] Eugene Koifman commented on HIVE-4895: -- Latest patches attached Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4895.move.patch, HIVE-4895.patch, HIVE-4895.rename.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5203) FunctionRegistry.getMethodInternal() should prefer method arguments with closer affinity to the original argument types
[ https://issues.apache.org/jira/browse/HIVE-5203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758157#comment-13758157 ] Edward Capriolo commented on HIVE-5203: --- I was thinking you could use mock up a class with a few methods with signatures you are looking to hit. {code} MyTestClass { eval(String) eval(Text) eval(int) } {code} Then use that code in a test case and maybe use mockito to show that the logic is finding the method you want. Really as long as it continues to work the same way (current tests) we know it is correct, but it would be nice to have a more direct way to show how this logic works (with a unit test). Just a thought. FunctionRegistry.getMethodInternal() should prefer method arguments with closer affinity to the original argument types --- Key: HIVE-5203 URL: https://issues.apache.org/jira/browse/HIVE-5203 Project: Hive Issue Type: Bug Components: Types, UDF Reporter: Jason Dere Assignee: Jason Dere Attachments: HIVE-5203.1.patch When the function registry is trying to determine the best version of UDF evaluate() to use based on a set of arguments passed in, it should prefer methods where the argument types are more related to the original types. For example if varchar is used with UDFFromUnixTime(), varchar is convertible to both the double and string versions of evaluate() for that UDF. In this case we would prefer that the function registry select the string version over the double version, since varchar and string are both string types. This doesn't really affect any of the existing types, but comes into play with the addition of the varchar type (HIVE-4844). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-5158) allow getting all partitions for table to also use direct SQL path
[ https://issues.apache.org/jira/browse/HIVE-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated HIVE-5158: --- Status: Open (was: Patch Available) Latest patch is resulting in ton of failures in my test runs. allow getting all partitions for table to also use direct SQL path -- Key: HIVE-5158 URL: https://issues.apache.org/jira/browse/HIVE-5158 Project: Hive Issue Type: Improvement Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Attachments: HIVE-5158.D12573.1.patch, HIVE-5158.D12573.2.patch, HIVE-5158.D12573.3.patch, HIVE-5158.D12573.4.patch, HIVE-5158.D12573.5.patch While testing some queries I noticed that getPartitions can be very slow (which happens e.g. in non-strict mode with no partition column filter); with a table with many partitions it can take 10-12s easily. SQL perf path can also be used for this path. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
Re: [jira] [Commented] (HIVE-5149) ReduceSinkDeDuplication can pick the wrong partitioning columns
Hi Yin Huai, This patch doesn't apply cleanly in Hive 11. For example, ReduceSinkDeDuplication.java is in ql/src/java/org/apache/hadoop/hive/ql/optimizer/ in Hive 11 while the patch assumes the file is in ql/src/java/org/apache/hadoop/hive/ql/optimizer/correlation/. It seems like there are other changes on this and related files between Hive 11 and this patch. Could you back port this to 11? The bug breaks correctness in 11, and most users aren't ready to move to Hive. Thanks, Eric On Wed, Sep 4, 2013 at 10:45 AM, Hudson (JIRA) j...@apache.org wrote: [ https://issues.apache.org/jira/browse/HIVE-5149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758030#comment-13758030] Hudson commented on HIVE-5149: -- SUCCESS: Integrated in Hive-trunk-h0.21 #2309 (See [ https://builds.apache.org/job/Hive-trunk-h0.21/2309/]) HIVE-5149 : ReduceSinkDeDuplication can pick the wrong partitioning columns (Yin Huai via Ashutosh Chauhan) (hashutosh: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519805) * /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/correlation/ReduceSinkDeDuplication.java * /hive/trunk/ql/src/test/results/clientpositive/groupby2.q.out * /hive/trunk/ql/src/test/results/clientpositive/groupby2_map_skew.q.out * /hive/trunk/ql/src/test/results/clientpositive/groupby_cube1.q.out * /hive/trunk/ql/src/test/results/clientpositive/groupby_rollup1.q.out * /hive/trunk/ql/src/test/results/clientpositive/reduce_deduplicate_extended.q.out ReduceSinkDeDuplication can pick the wrong partitioning columns --- Key: HIVE-5149 URL: https://issues.apache.org/jira/browse/HIVE-5149 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.11.0, 0.12.0 Reporter: Yin Huai Assignee: Yin Huai Priority: Blocker Fix For: 0.12.0 Attachments: HIVE-5149.1.patch, HIVE-5149.2.patch, HIVE-5149.3.patch https://mail-archives.apache.org/mod_mbox/hive-user/201308.mbox/%3CCAG6Lhyex5XPwszpihKqkPRpzri2k=m4qgc+cpar5yvr8sjt...@mail.gmail.com%3E -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (HIVE-5214) Dynamic partitions don't inherit groupname from table's directory
Thiruvel Thirumoolan created HIVE-5214: -- Summary: Dynamic partitions don't inherit groupname from table's directory Key: HIVE-5214 URL: https://issues.apache.org/jira/browse/HIVE-5214 Project: Hive Issue Type: Bug Components: Authorization, Security Affects Versions: 0.12.0 Reporter: Thiruvel Thirumoolan When dynamic partitions are created, the files/partitions don't inherit the group name. The query (say, insert overwrite table select *) uses the scratch directory for creating the temporary data. The temporary data's perm/group is inherited from scratch directory. Finally, the MoveTask does a rename of the temporary dir to be the target partition directory and an explicit group/perm change does not happen. HIVE-3756 fixed it for Load data, dynamic partitions has to be handled. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5206) Support parameterized primitive types
[ https://issues.apache.org/jira/browse/HIVE-5206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758212#comment-13758212 ] Jason Dere commented on HIVE-5206: -- Correction - the justification is for decimal, char, varchar, so for a combination of string/numeric types. The right way depends on whether the type is a qualified type and requires parameters. Not going to indulge the what if fantasies here, but would like to point out that decimal/char/varchar are pretty fundamental SQL types which many Hive users are interested in, and I hope this work helps to facilitate that effort. If someone wants to take up the torch on unifying integral types as a part of this work, that is great too if that is the right thing to do. Support parameterized primitive types - Key: HIVE-5206 URL: https://issues.apache.org/jira/browse/HIVE-5206 Project: Hive Issue Type: Improvement Components: Types Reporter: Jason Dere Assignee: Jason Dere Attachments: HIVE-5206.1.patch, HIVE-5206.D12693.1.patch Support for parameterized types is needed for char/varchar/decimal support. This adds a type parameters value to the PrimitiveTypeEntry/PrimitiveTypeInfo/PrimitiveObjectInspector objects. NO PRECOMMIT TESTS - dependent on HIVE-5203/HIVE-5204 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5131) JDBC client's hive variables are not passed to HS2
[ https://issues.apache.org/jira/browse/HIVE-5131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758215#comment-13758215 ] Xuefu Zhang commented on HIVE-5131: --- The test failures seemed random and unrelated. Nevertheless, manually reran them and they both passed locally with the patch. JDBC client's hive variables are not passed to HS2 -- Key: HIVE-5131 URL: https://issues.apache.org/jira/browse/HIVE-5131 Project: Hive Issue Type: Bug Components: JDBC Affects Versions: 0.11.0 Reporter: Xuefu Zhang Assignee: Xuefu Zhang Fix For: 0.12.0 Attachments: HIVE-5131.1.patch, HIVE-5131.patch, HIVE-5131.patch Related to HIVE-2914. However, HIVE-2914 seems addressing Hive CLI only. JDBC clients suffer the same problem. This was identified in HIVE-4568. I decided it might be better to separate issue from a different issue. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
Re: [jira] [Commented] (HIVE-5149) ReduceSinkDeDuplication can pick the wrong partitioning columns
Sure. Let me take a look at 0.11 branch. On Wed, Sep 4, 2013 at 2:53 PM, Eric Chu e...@rocketfuel.com wrote: Hi Yin Huai, This patch doesn't apply cleanly in Hive 11. For example, ReduceSinkDeDuplication.java is in ql/src/java/org/apache/hadoop/hive/ql/optimizer/ in Hive 11 while the patch assumes the file is in ql/src/java/org/apache/hadoop/hive/ql/optimizer/correlation/. It seems like there are other changes on this and related files between Hive 11 and this patch. Could you back port this to 11? The bug breaks correctness in 11, and most users aren't ready to move to Hive. Thanks, Eric On Wed, Sep 4, 2013 at 10:45 AM, Hudson (JIRA) j...@apache.org wrote: [ https://issues.apache.org/jira/browse/HIVE-5149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758030#comment-13758030 ] Hudson commented on HIVE-5149: -- SUCCESS: Integrated in Hive-trunk-h0.21 #2309 (See [ https://builds.apache.org/job/Hive-trunk-h0.21/2309/]) HIVE-5149 : ReduceSinkDeDuplication can pick the wrong partitioning columns (Yin Huai via Ashutosh Chauhan) (hashutosh: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519805) * /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/correlation/ReduceSinkDeDuplication.java * /hive/trunk/ql/src/test/results/clientpositive/groupby2.q.out * /hive/trunk/ql/src/test/results/clientpositive/groupby2_map_skew.q.out * /hive/trunk/ql/src/test/results/clientpositive/groupby_cube1.q.out * /hive/trunk/ql/src/test/results/clientpositive/groupby_rollup1.q.out * /hive/trunk/ql/src/test/results/clientpositive/reduce_deduplicate_extended.q.out ReduceSinkDeDuplication can pick the wrong partitioning columns --- Key: HIVE-5149 URL: https://issues.apache.org/jira/browse/HIVE-5149 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.11.0, 0.12.0 Reporter: Yin Huai Assignee: Yin Huai Priority: Blocker Fix For: 0.12.0 Attachments: HIVE-5149.1.patch, HIVE-5149.2.patch, HIVE-5149.3.patch https://mail-archives.apache.org/mod_mbox/hive-user/201308.mbox/%3CCAG6Lhyex5XPwszpihKqkPRpzri2k=m4qgc+cpar5yvr8sjt...@mail.gmail.com%3E -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5206) Support parameterized primitive types
[ https://issues.apache.org/jira/browse/HIVE-5206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758229#comment-13758229 ] Xuefu Zhang commented on HIVE-5206: --- I think it might be a good idea to file a separate, followup JIRA addressing Ed's suggestion about unifying integers. I can see that there is a slight difference between different types of integers and other parameterized types. Existing integer types can be imaged to be integer type with the number of bytes as a parameter. However, the possible values for that parameter is enumerated and limited. It's hard to image that one day Hive will need to support int(3). For char/varchar/decimal, however, there are continuous values on the parameters. Having said that, unifying integer types may make the code unify and add ease to understand. To me, it seems a nice thing to have. Support parameterized primitive types - Key: HIVE-5206 URL: https://issues.apache.org/jira/browse/HIVE-5206 Project: Hive Issue Type: Improvement Components: Types Reporter: Jason Dere Assignee: Jason Dere Attachments: HIVE-5206.1.patch, HIVE-5206.D12693.1.patch Support for parameterized types is needed for char/varchar/decimal support. This adds a type parameters value to the PrimitiveTypeEntry/PrimitiveTypeInfo/PrimitiveObjectInspector objects. NO PRECOMMIT TESTS - dependent on HIVE-5203/HIVE-5204 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-5214) Dynamic partitions/insert overwrite don't inherit groupname from table's directory
[ https://issues.apache.org/jira/browse/HIVE-5214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thiruvel Thirumoolan updated HIVE-5214: --- Description: When dynamic partitions are created or insert overwrite without partitions, the files/partition-dirs don't inherit the group name. The query (say, insert overwrite table select *) uses the scratch directory for creating the temporary data. The temporary data's perm/group is inherited from scratch directory. Finally, the MoveTask does a rename of the temporary dir/files to be the target directory and an explicit group/perm change does not happen. HIVE-3756 fixed it for Load data, dynamic partitions/inserts have to be handled. was: When dynamic partitions are created, the files/partitions don't inherit the group name. The query (say, insert overwrite table select *) uses the scratch directory for creating the temporary data. The temporary data's perm/group is inherited from scratch directory. Finally, the MoveTask does a rename of the temporary dir to be the target partition directory and an explicit group/perm change does not happen. HIVE-3756 fixed it for Load data, dynamic partitions has to be handled. Summary: Dynamic partitions/insert overwrite don't inherit groupname from table's directory (was: Dynamic partitions don't inherit groupname from table's directory) Dynamic partitions/insert overwrite don't inherit groupname from table's directory -- Key: HIVE-5214 URL: https://issues.apache.org/jira/browse/HIVE-5214 Project: Hive Issue Type: Bug Components: Authorization, Security Affects Versions: 0.12.0 Reporter: Thiruvel Thirumoolan When dynamic partitions are created or insert overwrite without partitions, the files/partition-dirs don't inherit the group name. The query (say, insert overwrite table select *) uses the scratch directory for creating the temporary data. The temporary data's perm/group is inherited from scratch directory. Finally, the MoveTask does a rename of the temporary dir/files to be the target directory and an explicit group/perm change does not happen. HIVE-3756 fixed it for Load data, dynamic partitions/inserts have to be handled. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
Re: [jira] [Commented] (HIVE-5149) ReduceSinkDeDuplication can pick the wrong partitioning columns
Hi Eric, Can you try https://issues.apache.org/jira/secure/attachment/12601446/HIVE-5149-branch-0.11.patch? Thanks, Yin On Wed, Sep 4, 2013 at 3:14 PM, Yin Huai huaiyin@gmail.com wrote: Sure. Let me take a look at 0.11 branch. On Wed, Sep 4, 2013 at 2:53 PM, Eric Chu e...@rocketfuel.com wrote: Hi Yin Huai, This patch doesn't apply cleanly in Hive 11. For example, ReduceSinkDeDuplication.java is in ql/src/java/org/apache/hadoop/hive/ql/optimizer/ in Hive 11 while the patch assumes the file is in ql/src/java/org/apache/hadoop/hive/ql/optimizer/correlation/. It seems like there are other changes on this and related files between Hive 11 and this patch. Could you back port this to 11? The bug breaks correctness in 11, and most users aren't ready to move to Hive. Thanks, Eric On Wed, Sep 4, 2013 at 10:45 AM, Hudson (JIRA) j...@apache.org wrote: [ https://issues.apache.org/jira/browse/HIVE-5149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758030#comment-13758030 ] Hudson commented on HIVE-5149: -- SUCCESS: Integrated in Hive-trunk-h0.21 #2309 (See [ https://builds.apache.org/job/Hive-trunk-h0.21/2309/]) HIVE-5149 : ReduceSinkDeDuplication can pick the wrong partitioning columns (Yin Huai via Ashutosh Chauhan) (hashutosh: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519805 ) * /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/correlation/ReduceSinkDeDuplication.java * /hive/trunk/ql/src/test/results/clientpositive/groupby2.q.out * /hive/trunk/ql/src/test/results/clientpositive/groupby2_map_skew.q.out * /hive/trunk/ql/src/test/results/clientpositive/groupby_cube1.q.out * /hive/trunk/ql/src/test/results/clientpositive/groupby_rollup1.q.out * /hive/trunk/ql/src/test/results/clientpositive/reduce_deduplicate_extended.q.out ReduceSinkDeDuplication can pick the wrong partitioning columns --- Key: HIVE-5149 URL: https://issues.apache.org/jira/browse/HIVE-5149 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.11.0, 0.12.0 Reporter: Yin Huai Assignee: Yin Huai Priority: Blocker Fix For: 0.12.0 Attachments: HIVE-5149.1.patch, HIVE-5149.2.patch, HIVE-5149.3.patch https://mail-archives.apache.org/mod_mbox/hive-user/201308.mbox/%3CCAG6Lhyex5XPwszpihKqkPRpzri2k=m4qgc+cpar5yvr8sjt...@mail.gmail.com%3E -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5205) Javadoc warnings in HCatalog prevent Hive from building under OpenJDK7
[ https://issues.apache.org/jira/browse/HIVE-5205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758235#comment-13758235 ] Konstantin Boudnik commented on HIVE-5205: -- looks like it has been been fixed in the trunk as a part of 0129f3642660e8696646ac3d4aa414ef062e2118 Hence, the patch is only good for Hive 0.11 Javadoc warnings in HCatalog prevent Hive from building under OpenJDK7 -- Key: HIVE-5205 URL: https://issues.apache.org/jira/browse/HIVE-5205 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.11.0 Reporter: Konstantin Boudnik Assignee: Konstantin Boudnik Fix For: 0.12.0 Attachments: HIVE-5205.patch when building Hive with OpenJDK7 the following warning message makes the build fail: [javadoc] /var/lib/jenkins/workspace/Shark-Hive-0.11-OJDK7/hcatalog/storage-handlers/hbase/src/java/org/apache/hcatalog/hbase/snapshot/RevisionManagerFactory.java:81: warning - @return tag has no arguments. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-5149) ReduceSinkDeDuplication can pick the wrong partitioning columns
[ https://issues.apache.org/jira/browse/HIVE-5149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated HIVE-5149: --- Attachment: HIVE-5149-branch-0.11.patch HIVE-5149-branch-0.11.patch is a 0.11 port ReduceSinkDeDuplication can pick the wrong partitioning columns --- Key: HIVE-5149 URL: https://issues.apache.org/jira/browse/HIVE-5149 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.11.0, 0.12.0 Reporter: Yin Huai Assignee: Yin Huai Priority: Blocker Fix For: 0.12.0 Attachments: HIVE-5149.1.patch, HIVE-5149.2.patch, HIVE-5149.3.patch, HIVE-5149-branch-0.11.patch https://mail-archives.apache.org/mod_mbox/hive-user/201308.mbox/%3CCAG6Lhyex5XPwszpihKqkPRpzri2k=m4qgc+cpar5yvr8sjt...@mail.gmail.com%3E -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
Re: [jira] [Commented] (HIVE-5149) ReduceSinkDeDuplication can pick the wrong partitioning columns
Hi Yin, I was able to apply the patch to 11 w/out problems. Thanks. On Wed, Sep 4, 2013 at 12:41 PM, Yin Huai huaiyin@gmail.com wrote: Hi Eric, Can you try https://issues.apache.org/jira/secure/attachment/12601446/HIVE-5149-branch-0.11.patch? Thanks, Yin On Wed, Sep 4, 2013 at 3:14 PM, Yin Huai huaiyin@gmail.com wrote: Sure. Let me take a look at 0.11 branch. On Wed, Sep 4, 2013 at 2:53 PM, Eric Chu e...@rocketfuel.com wrote: Hi Yin Huai, This patch doesn't apply cleanly in Hive 11. For example, ReduceSinkDeDuplication.java is in ql/src/java/org/apache/hadoop/hive/ql/optimizer/ in Hive 11 while the patch assumes the file is in ql/src/java/org/apache/hadoop/hive/ql/optimizer/correlation/. It seems like there are other changes on this and related files between Hive 11 and this patch. Could you back port this to 11? The bug breaks correctness in 11, and most users aren't ready to move to Hive. Thanks, Eric On Wed, Sep 4, 2013 at 10:45 AM, Hudson (JIRA) j...@apache.org wrote: [ https://issues.apache.org/jira/browse/HIVE-5149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758030#comment-13758030 ] Hudson commented on HIVE-5149: -- SUCCESS: Integrated in Hive-trunk-h0.21 #2309 (See [ https://builds.apache.org/job/Hive-trunk-h0.21/2309/]) HIVE-5149 : ReduceSinkDeDuplication can pick the wrong partitioning columns (Yin Huai via Ashutosh Chauhan) (hashutosh: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1519805) * /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/correlation/ReduceSinkDeDuplication.java * /hive/trunk/ql/src/test/results/clientpositive/groupby2.q.out * /hive/trunk/ql/src/test/results/clientpositive/groupby2_map_skew.q.out * /hive/trunk/ql/src/test/results/clientpositive/groupby_cube1.q.out * /hive/trunk/ql/src/test/results/clientpositive/groupby_rollup1.q.out * /hive/trunk/ql/src/test/results/clientpositive/reduce_deduplicate_extended.q.out ReduceSinkDeDuplication can pick the wrong partitioning columns --- Key: HIVE-5149 URL: https://issues.apache.org/jira/browse/HIVE-5149 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.11.0, 0.12.0 Reporter: Yin Huai Assignee: Yin Huai Priority: Blocker Fix For: 0.12.0 Attachments: HIVE-5149.1.patch, HIVE-5149.2.patch, HIVE-5149.3.patch https://mail-archives.apache.org/mod_mbox/hive-user/201308.mbox/%3CCAG6Lhyex5XPwszpihKqkPRpzri2k=m4qgc+cpar5yvr8sjt...@mail.gmail.com%3E -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-5199) Read Only Custom SerDe works with HDP 1.1 but not with HDP 1.3
[ https://issues.apache.org/jira/browse/HIVE-5199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hari Sankar Sivarama Subramaniyan updated HIVE-5199: Attachment: HIVE-5199.patch.txt Adding code changes to avoid nested structures within complex datatypes to throw exception Read Only Custom SerDe works with HDP 1.1 but not with HDP 1.3 -- Key: HIVE-5199 URL: https://issues.apache.org/jira/browse/HIVE-5199 Project: Hive Issue Type: Bug Reporter: Hari Sankar Sivarama Subramaniyan Assignee: Hari Sankar Sivarama Subramaniyan Priority: Critical Attachments: HIVE-5199.patch.txt Custom serdes which used to work in HDP 1.1 is no longer working with HDP 1.3. The issue happens when the partition serde is not of settable type in HDP 1.3. The below exception happens via FetchOperator as well as MapOperator. Inside FetchOperator consider the following call: getRecordReader()-ObjectInspectorConverters. getConverter() The output object inspector is of settable type(because it is generated via ObjectInspectorConverters.getConvertedOI()) where as the input object inspector that gets passed as serde.getObjectorInspector() and is non-settable. Inside getConverter(), the (inputOI.equals(outputOI)) check fails and the switch statement tries to cast the non-settable object inspector to a settable object inspector. The stack trace as follows: 2013-08-28 17:57:25,307 ERROR CliDriver (SessionState.java:printError(432)) - Failed with exception java.io.IOException:java.lang.ClassCastException: com.skype.data.whaleshark.hadoop.hive.proto.ProtoMapObjectInspector cannot be cast to org.apache.hadoop.hive.serde2.objectinspector.SettableMapObjectInspector java.io.IOException: java.lang.ClassCastException: com.skype.data.whaleshark.hadoop.hive.proto.ProtoMapObjectInspector cannot be cast to org.apache.hadoop.hive.serde2.objectinspector.SettableMapObjectInspector at org.apache.hadoop.hive.ql.exec.FetchOperator.getNextRow(FetchOperator.java:544) at org.apache.hadoop.hive.ql.exec.FetchOperator.pushRow(FetchOperator.java:488) at org.apache.hadoop.hive.ql.exec.FetchTask.fetch(FetchTask.java:136) at org.apache.hadoop.hive.ql.Driver.getResults(Driver.java:1412) at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:271) at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:216) at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:413) at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:756) at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:614) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:601) at org.apache.hadoop.util.RunJar.main(RunJar.java:160) Caused by: java.lang.ClassCastException: com.skype.data.whaleshark.hadoop.hive.proto.ProtoMapObjectInspector cannot be cast to org.apache.hadoop.hive.serde2.objectinspector.SettableMapObjectInspector at org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorConverters.getConverter(ObjectInspectorConverters.java:144) at org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorConverters$StructConverter.init(ObjectInspectorConverters.java:307) at org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorConverters.getConverter(ObjectInspectorConverters.java:138) at org.apache.hadoop.hive.ql.exec.FetchOperator.getRecordReader(FetchOperator.java:406) at org.apache.hadoop.hive.ql.exec.FetchOperator.getNextRow(FetchOperator.java:508) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5199) Read Only Custom SerDe works with HDP 1.1 but not with HDP 1.3
[ https://issues.apache.org/jira/browse/HIVE-5199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758269#comment-13758269 ] Brock Noland commented on HIVE-5199: Hari, Thanks for the contribution! On small suggestion, in this JIRA project we discuss the Apache Hive project, not a distribution, therefore it's useful to discuss a regression in 0.9., 0.10, or 0.11 which caused this. Read Only Custom SerDe works with HDP 1.1 but not with HDP 1.3 -- Key: HIVE-5199 URL: https://issues.apache.org/jira/browse/HIVE-5199 Project: Hive Issue Type: Bug Reporter: Hari Sankar Sivarama Subramaniyan Assignee: Hari Sankar Sivarama Subramaniyan Priority: Critical Attachments: HIVE-5199.patch.txt Custom serdes which used to work in HDP 1.1 is no longer working with HDP 1.3. The issue happens when the partition serde is not of settable type in HDP 1.3. The below exception happens via FetchOperator as well as MapOperator. Inside FetchOperator consider the following call: getRecordReader()-ObjectInspectorConverters. getConverter() The output object inspector is of settable type(because it is generated via ObjectInspectorConverters.getConvertedOI()) where as the input object inspector that gets passed as serde.getObjectorInspector() and is non-settable. Inside getConverter(), the (inputOI.equals(outputOI)) check fails and the switch statement tries to cast the non-settable object inspector to a settable object inspector. The stack trace as follows: 2013-08-28 17:57:25,307 ERROR CliDriver (SessionState.java:printError(432)) - Failed with exception java.io.IOException:java.lang.ClassCastException: com.skype.data.whaleshark.hadoop.hive.proto.ProtoMapObjectInspector cannot be cast to org.apache.hadoop.hive.serde2.objectinspector.SettableMapObjectInspector java.io.IOException: java.lang.ClassCastException: com.skype.data.whaleshark.hadoop.hive.proto.ProtoMapObjectInspector cannot be cast to org.apache.hadoop.hive.serde2.objectinspector.SettableMapObjectInspector at org.apache.hadoop.hive.ql.exec.FetchOperator.getNextRow(FetchOperator.java:544) at org.apache.hadoop.hive.ql.exec.FetchOperator.pushRow(FetchOperator.java:488) at org.apache.hadoop.hive.ql.exec.FetchTask.fetch(FetchTask.java:136) at org.apache.hadoop.hive.ql.Driver.getResults(Driver.java:1412) at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:271) at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:216) at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:413) at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:756) at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:614) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:601) at org.apache.hadoop.util.RunJar.main(RunJar.java:160) Caused by: java.lang.ClassCastException: com.skype.data.whaleshark.hadoop.hive.proto.ProtoMapObjectInspector cannot be cast to org.apache.hadoop.hive.serde2.objectinspector.SettableMapObjectInspector at org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorConverters.getConverter(ObjectInspectorConverters.java:144) at org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorConverters$StructConverter.init(ObjectInspectorConverters.java:307) at org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorConverters.getConverter(ObjectInspectorConverters.java:138) at org.apache.hadoop.hive.ql.exec.FetchOperator.getRecordReader(FetchOperator.java:406) at org.apache.hadoop.hive.ql.exec.FetchOperator.getNextRow(FetchOperator.java:508) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-5199) Read Only Custom SerDe works with HDP 1.1 but not with HDP 1.3
[ https://issues.apache.org/jira/browse/HIVE-5199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hari Sankar Sivarama Subramaniyan updated HIVE-5199: Status: Patch Available (was: Open) Read Only Custom SerDe works with HDP 1.1 but not with HDP 1.3 -- Key: HIVE-5199 URL: https://issues.apache.org/jira/browse/HIVE-5199 Project: Hive Issue Type: Bug Reporter: Hari Sankar Sivarama Subramaniyan Assignee: Hari Sankar Sivarama Subramaniyan Priority: Critical Attachments: HIVE-5199.patch.txt Custom serdes which used to work in HDP 1.1 is no longer working with HDP 1.3. The issue happens when the partition serde is not of settable type in HDP 1.3. The below exception happens via FetchOperator as well as MapOperator. Inside FetchOperator consider the following call: getRecordReader()-ObjectInspectorConverters. getConverter() The output object inspector is of settable type(because it is generated via ObjectInspectorConverters.getConvertedOI()) where as the input object inspector that gets passed as serde.getObjectorInspector() and is non-settable. Inside getConverter(), the (inputOI.equals(outputOI)) check fails and the switch statement tries to cast the non-settable object inspector to a settable object inspector. The stack trace as follows: 2013-08-28 17:57:25,307 ERROR CliDriver (SessionState.java:printError(432)) - Failed with exception java.io.IOException:java.lang.ClassCastException: com.skype.data.whaleshark.hadoop.hive.proto.ProtoMapObjectInspector cannot be cast to org.apache.hadoop.hive.serde2.objectinspector.SettableMapObjectInspector java.io.IOException: java.lang.ClassCastException: com.skype.data.whaleshark.hadoop.hive.proto.ProtoMapObjectInspector cannot be cast to org.apache.hadoop.hive.serde2.objectinspector.SettableMapObjectInspector at org.apache.hadoop.hive.ql.exec.FetchOperator.getNextRow(FetchOperator.java:544) at org.apache.hadoop.hive.ql.exec.FetchOperator.pushRow(FetchOperator.java:488) at org.apache.hadoop.hive.ql.exec.FetchTask.fetch(FetchTask.java:136) at org.apache.hadoop.hive.ql.Driver.getResults(Driver.java:1412) at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:271) at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:216) at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:413) at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:756) at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:614) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:601) at org.apache.hadoop.util.RunJar.main(RunJar.java:160) Caused by: java.lang.ClassCastException: com.skype.data.whaleshark.hadoop.hive.proto.ProtoMapObjectInspector cannot be cast to org.apache.hadoop.hive.serde2.objectinspector.SettableMapObjectInspector at org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorConverters.getConverter(ObjectInspectorConverters.java:144) at org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorConverters$StructConverter.init(ObjectInspectorConverters.java:307) at org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorConverters.getConverter(ObjectInspectorConverters.java:138) at org.apache.hadoop.hive.ql.exec.FetchOperator.getRecordReader(FetchOperator.java:406) at org.apache.hadoop.hive.ql.exec.FetchOperator.getNextRow(FetchOperator.java:508) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5131) JDBC client's hive variables are not passed to HS2
[ https://issues.apache.org/jira/browse/HIVE-5131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758297#comment-13758297 ] Thejas M Nair commented on HIVE-5131: - [~navis] Do you also want to take a look at this patch, since this is very similar to your change in HIVE-4354 . The key difference I see in that patch is that in embedded mode it starts the hiveserver2 with the modified hiveconf. Did you have any use case in mind for that ? JDBC client's hive variables are not passed to HS2 -- Key: HIVE-5131 URL: https://issues.apache.org/jira/browse/HIVE-5131 Project: Hive Issue Type: Bug Components: JDBC Affects Versions: 0.11.0 Reporter: Xuefu Zhang Assignee: Xuefu Zhang Fix For: 0.12.0 Attachments: HIVE-5131.1.patch, HIVE-5131.patch, HIVE-5131.patch Related to HIVE-2914. However, HIVE-2914 seems addressing Hive CLI only. JDBC clients suffer the same problem. This was identified in HIVE-4568. I decided it might be better to separate issue from a different issue. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5210) WebHCatJTShim implementations are missing Apache license headers
[ https://issues.apache.org/jira/browse/HIVE-5210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758300#comment-13758300 ] Sushanth Sowmyan commented on HIVE-5210: Good catch. I came to this jira to respond that HCat is currently frozen today pending the commit of HIVE-4895, but this patch is not affected by that since it lies in shims/. +1 from me as well. WebHCatJTShim implementations are missing Apache license headers Key: HIVE-5210 URL: https://issues.apache.org/jira/browse/HIVE-5210 Project: Hive Issue Type: Bug Components: HCatalog Reporter: Jarek Jarcec Cecho Assignee: Jarek Jarcec Cecho Fix For: 0.12.0 Attachments: HIVE-5210.patch During investigation of SQOOP-1190, I've noticed that {{WebHCatJTShim}} implementaion files do not have license headers. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-5203) FunctionRegistry.getMethodInternal() should prefer method arguments with closer affinity to the original argument types
[ https://issues.apache.org/jira/browse/HIVE-5203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Dere updated HIVE-5203: - Attachment: HIVE-5203.2.patch Attaching HIVE-5203.2.patch. This breaks out the type affinity-based selection to a separate method for better testability, and adds tests. FunctionRegistry.getMethodInternal() should prefer method arguments with closer affinity to the original argument types --- Key: HIVE-5203 URL: https://issues.apache.org/jira/browse/HIVE-5203 Project: Hive Issue Type: Bug Components: Types, UDF Reporter: Jason Dere Assignee: Jason Dere Attachments: HIVE-5203.1.patch, HIVE-5203.2.patch When the function registry is trying to determine the best version of UDF evaluate() to use based on a set of arguments passed in, it should prefer methods where the argument types are more related to the original types. For example if varchar is used with UDFFromUnixTime(), varchar is convertible to both the double and string versions of evaluate() for that UDF. In this case we would prefer that the function registry select the string version over the double version, since varchar and string are both string types. This doesn't really affect any of the existing types, but comes into play with the addition of the varchar type (HIVE-4844). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5203) FunctionRegistry.getMethodInternal() should prefer method arguments with closer affinity to the original argument types
[ https://issues.apache.org/jira/browse/HIVE-5203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758326#comment-13758326 ] Jason Dere commented on HIVE-5203: -- Managed to add some tests by moving the new functionality to a separate method, though we'd never hit those scenarios in real life. Also posted the changes at https://reviews.facebook.net/D12711 FunctionRegistry.getMethodInternal() should prefer method arguments with closer affinity to the original argument types --- Key: HIVE-5203 URL: https://issues.apache.org/jira/browse/HIVE-5203 Project: Hive Issue Type: Bug Components: Types, UDF Reporter: Jason Dere Assignee: Jason Dere Attachments: HIVE-5203.1.patch, HIVE-5203.2.patch When the function registry is trying to determine the best version of UDF evaluate() to use based on a set of arguments passed in, it should prefer methods where the argument types are more related to the original types. For example if varchar is used with UDFFromUnixTime(), varchar is convertible to both the double and string versions of evaluate() for that UDF. In this case we would prefer that the function registry select the string version over the double version, since varchar and string are both string types. This doesn't really affect any of the existing types, but comes into play with the addition of the varchar type (HIVE-4844). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4989) Consolidate and simplify vectorization code and test generation
[ https://issues.apache.org/jira/browse/HIVE-4989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jitendra Nath Pandey updated HIVE-4989: --- Status: Patch Available (was: Reopened) Consolidate and simplify vectorization code and test generation --- Key: HIVE-4989 URL: https://issues.apache.org/jira/browse/HIVE-4989 Project: Hive Issue Type: Sub-task Affects Versions: vectorization-branch Reporter: Tony Murphy Assignee: Tony Murphy Fix For: vectorization-branch Attachments: HIVE-4989.1-vectorization.patch, HIVE-4989.2-vectorization.patch, HIVE-4989.revert.patch, HIVE-4989-vectorization.patch The current code generation is unwieldy to use and prone to errors. This change consolidates all the code and test generation into a single location, and removes the need to manually place files which can lead to missing or incomplete code or tests. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4989) Consolidate and simplify vectorization code and test generation
[ https://issues.apache.org/jira/browse/HIVE-4989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jitendra Nath Pandey updated HIVE-4989: --- Attachment: HIVE-4989.2-vectorization.patch Re-uploading Tony's original patch rebased against the latest state of the vectorization branch. Consolidate and simplify vectorization code and test generation --- Key: HIVE-4989 URL: https://issues.apache.org/jira/browse/HIVE-4989 Project: Hive Issue Type: Sub-task Affects Versions: vectorization-branch Reporter: Tony Murphy Assignee: Tony Murphy Fix For: vectorization-branch Attachments: HIVE-4989.1-vectorization.patch, HIVE-4989.2-vectorization.patch, HIVE-4989.revert.patch, HIVE-4989-vectorization.patch The current code generation is unwieldy to use and prone to errors. This change consolidates all the code and test generation into a single location, and removes the need to manually place files which can lead to missing or incomplete code or tests. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758378#comment-13758378 ] Sushanth Sowmyan commented on HIVE-4895: Thanks for the patches, Eugene. I'm having a look at them. Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4895.move.patch, HIVE-4895.patch, HIVE-4895.rename.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-5210) WebHCatJTShim implementations are missing Apache license headers
[ https://issues.apache.org/jira/browse/HIVE-5210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sushanth Sowmyan updated HIVE-5210: --- Component/s: Shims WebHCatJTShim implementations are missing Apache license headers Key: HIVE-5210 URL: https://issues.apache.org/jira/browse/HIVE-5210 Project: Hive Issue Type: Bug Components: HCatalog, Shims Reporter: Jarek Jarcec Cecho Assignee: Jarek Jarcec Cecho Fix For: 0.12.0 Attachments: HIVE-5210.patch During investigation of SQOOP-1190, I've noticed that {{WebHCatJTShim}} implementaion files do not have license headers. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (HIVE-5215) The optional methods in JDBC API implementations should throw a SQLFeatureNotSupportedException instead of SQLException
Vaibhav Gumashta created HIVE-5215: -- Summary: The optional methods in JDBC API implementations should throw a SQLFeatureNotSupportedException instead of SQLException Key: HIVE-5215 URL: https://issues.apache.org/jira/browse/HIVE-5215 Project: Hive Issue Type: Bug Components: JDBC Reporter: Vaibhav Gumashta Priority: Minor Fix For: 0.12.0 The optional methods in JDBC API implementations should throw a SQLFeatureNotSupportedException instead of SQLException. This is per JDBC standards. From the jdbc 4.1 specs: If a JDBC driver does not support or only provides partial support for an optional feature, the corresponding DatabaseMetaData method must indicate the feature is not supported. Any methods for a feature that is not implemented are required to throw a SQLFeatureNotSupportedException. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4896) create binary backwards compatibility layer hcatalog 0.12 and 0.11
[ https://issues.apache.org/jira/browse/HIVE-4896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4896: - Attachment: HIVE-4896.patch HIVE-4896 should be committed after HIVE-4895. HIVE-4896.patch contains all changes for 4895 4896. It's here for ptests to pick it up and run. Patches that need to be reviewed will be attached shortly. create binary backwards compatibility layer hcatalog 0.12 and 0.11 -- Key: HIVE-4896 URL: https://issues.apache.org/jira/browse/HIVE-4896 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4896.patch Original Estimate: 72h Remaining Estimate: 72h 1. create a 011 dir in each HCat submodule to hold these classes 2. update build to place these classes into jars 3. figure out how to test this The goal is for users who were using public API (e.g. HCatLoader) in their programs to be able to upgrade to hive 0.12 and have their programs still work without change. The compatibility layer will have a sunset clause to be removed in 2 releases. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-5158) allow getting all partitions for table to also use direct SQL path
[ https://issues.apache.org/jira/browse/HIVE-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HIVE-5158: --- Status: Patch Available (was: Open) I cannot repro so far... I will try on different box, but let's try to have hiveqa run allow getting all partitions for table to also use direct SQL path -- Key: HIVE-5158 URL: https://issues.apache.org/jira/browse/HIVE-5158 Project: Hive Issue Type: Improvement Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Attachments: HIVE-5158.D12573.1.patch, HIVE-5158.D12573.2.patch, HIVE-5158.D12573.3.patch, HIVE-5158.D12573.4.patch, HIVE-5158.D12573.5.patch While testing some queries I noticed that getPartitions can be very slow (which happens e.g. in non-strict mode with no partition column filter); with a table with many partitions it can take 10-12s easily. SQL perf path can also be used for this path. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4896) create binary backwards compatibility layer hcatalog 0.12 and 0.11
[ https://issues.apache.org/jira/browse/HIVE-4896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4896: - Attachment: HIVE-4896.v011.patch create binary backwards compatibility layer hcatalog 0.12 and 0.11 -- Key: HIVE-4896 URL: https://issues.apache.org/jira/browse/HIVE-4896 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4896.patch, HIVE-4896.v011.patch Original Estimate: 72h Remaining Estimate: 72h 1. create a 011 dir in each HCat submodule to hold these classes 2. update build to place these classes into jars 3. figure out how to test this The goal is for users who were using public API (e.g. HCatLoader) in their programs to be able to upgrade to hive 0.12 and have their programs still work without change. The compatibility layer will have a sunset clause to be removed in 2 releases. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4896) create binary backwards compatibility layer hcatalog 0.12 and 0.11
[ https://issues.apache.org/jira/browse/HIVE-4896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4896: - Status: Patch Available (was: Open) create binary backwards compatibility layer hcatalog 0.12 and 0.11 -- Key: HIVE-4896 URL: https://issues.apache.org/jira/browse/HIVE-4896 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4896.patch, HIVE-4896.v011.patch Original Estimate: 72h Remaining Estimate: 72h 1. create a 011 dir in each HCat submodule to hold these classes 2. update build to place these classes into jars 3. figure out how to test this The goal is for users who were using public API (e.g. HCatLoader) in their programs to be able to upgrade to hive 0.12 and have their programs still work without change. The compatibility layer will have a sunset clause to be removed in 2 releases. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5199) Read Only Custom SerDe works with HDP 1.1 but not with HDP 1.3
[ https://issues.apache.org/jira/browse/HIVE-5199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758412#comment-13758412 ] Hive QA commented on HIVE-5199: --- {color:red}Overall{color}: -1 at least one tests failed Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12601447/HIVE-5199.patch.txt {color:red}ERROR:{color} -1 due to 24 failed/errored test(s), 2909 tests executed *Failed tests:* {noformat} org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_groupby_sort_11 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partition_vs_table_metadata org.apache.hcatalog.mapreduce.TestHCatPartitioned.testHCatPartitionedTable org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_metadataonly1 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partition_wise_fileformat9 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_decimal_precision org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partition_wise_fileformat11 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partition_serde_format org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partition_wise_fileformat16 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partition_date2 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_fileformat_mix org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partition_wise_fileformat10 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_groupby_neg_float org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partInit org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partition_wise_fileformat12 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partition_wise_fileformat org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partition_wise_fileformat2 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partition_wise_fileformat8 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partition_wise_fileformat13 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partition_wise_fileformat14 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_combine3 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partition_date org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_partition_wise_fileformat15 org.apache.hcatalog.mapreduce.TestHCatExternalPartitioned.testHCatPartitionedTable {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/615/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/615/console Messages: {noformat} Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests failed with: TestsFailedException: 24 tests failed {noformat} This message is automatically generated. Read Only Custom SerDe works with HDP 1.1 but not with HDP 1.3 -- Key: HIVE-5199 URL: https://issues.apache.org/jira/browse/HIVE-5199 Project: Hive Issue Type: Bug Reporter: Hari Sankar Sivarama Subramaniyan Assignee: Hari Sankar Sivarama Subramaniyan Priority: Critical Attachments: HIVE-5199.patch.txt Custom serdes which used to work in HDP 1.1 is no longer working with HDP 1.3. The issue happens when the partition serde is not of settable type in HDP 1.3. The below exception happens via FetchOperator as well as MapOperator. Inside FetchOperator consider the following call: getRecordReader()-ObjectInspectorConverters. getConverter() The output object inspector is of settable type(because it is generated via ObjectInspectorConverters.getConvertedOI()) where as the input object inspector that gets passed as serde.getObjectorInspector() and is non-settable. Inside getConverter(), the (inputOI.equals(outputOI)) check fails and the switch statement tries to cast the non-settable object inspector to a settable object inspector. The stack trace as follows: 2013-08-28 17:57:25,307 ERROR CliDriver (SessionState.java:printError(432)) - Failed with exception java.io.IOException:java.lang.ClassCastException: com.skype.data.whaleshark.hadoop.hive.proto.ProtoMapObjectInspector cannot be cast to org.apache.hadoop.hive.serde2.objectinspector.SettableMapObjectInspector java.io.IOException: java.lang.ClassCastException: com.skype.data.whaleshark.hadoop.hive.proto.ProtoMapObjectInspector cannot be cast to org.apache.hadoop.hive.serde2.objectinspector.SettableMapObjectInspector at org.apache.hadoop.hive.ql.exec.FetchOperator.getNextRow(FetchOperator.java:544) at org.apache.hadoop.hive.ql.exec.FetchOperator.pushRow(FetchOperator.java:488) at org.apache.hadoop.hive.ql.exec.FetchTask.fetch(FetchTask.java:136) at org.apache.hadoop.hive.ql.Driver.getResults(Driver.java:1412) at
[jira] [Updated] (HIVE-4896) create binary backwards compatibility layer hcatalog 0.12 and 0.11
[ https://issues.apache.org/jira/browse/HIVE-4896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4896: - Attachment: HIVE-4896.changes.patch create binary backwards compatibility layer hcatalog 0.12 and 0.11 -- Key: HIVE-4896 URL: https://issues.apache.org/jira/browse/HIVE-4896 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4896.changes.patch, HIVE-4896.patch, HIVE-4896.v011.patch Original Estimate: 72h Remaining Estimate: 72h 1. create a 011 dir in each HCat submodule to hold these classes 2. update build to place these classes into jars 3. figure out how to test this The goal is for users who were using public API (e.g. HCatLoader) in their programs to be able to upgrade to hive 0.12 and have their programs still work without change. The compatibility layer will have a sunset clause to be removed in 2 releases. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4896) create binary backwards compatibility layer hcatalog 0.12 and 0.11
[ https://issues.apache.org/jira/browse/HIVE-4896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758432#comment-13758432 ] Eugene Koifman commented on HIVE-4896: -- HIVE-4896.v011.patch - contains 0.11 version of the main/test sources from http://apache.mirrors.pair.com/hive/hive-0.11.0/hive-0.11.0.tar.gz. HIVE-4896.changes.patch - contains adjustments that had to be made. For example since shim layer was refactored in 0.12, the 0.11 classes are modified to use the new shim layer. It's best if these changes are checked into SVN the same way, as 2 checkins. It will preserve history in a way that is more useful. create binary backwards compatibility layer hcatalog 0.12 and 0.11 -- Key: HIVE-4896 URL: https://issues.apache.org/jira/browse/HIVE-4896 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4896.changes.patch, HIVE-4896.patch, HIVE-4896.v011.patch Original Estimate: 72h Remaining Estimate: 72h 1. create a 011 dir in each HCat submodule to hold these classes 2. update build to place these classes into jars 3. figure out how to test this The goal is for users who were using public API (e.g. HCatLoader) in their programs to be able to upgrade to hive 0.12 and have their programs still work without change. The compatibility layer will have a sunset clause to be removed in 2 releases. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758436#comment-13758436 ] Sushanth Sowmyan commented on HIVE-4895: So far, I have verified the following: * The move patch + the rename patch = the unified patch * The move patch is correct, and moves all the required classes to the respective correct locations. I am yet to verify the rename patch fully, and will update when I do. Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4895.move.patch, HIVE-4895.patch, HIVE-4895.rename.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758443#comment-13758443 ] Eugene Koifman commented on HIVE-4895: -- I think the easiest way to verify combined changes is to compile + run unit tests (I also ran webhcat e2e). I can't imagine this won't catch some error. Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4895.move.patch, HIVE-4895.patch, HIVE-4895.rename.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4895: - Attachment: (was: HIVE-4895.rename.patch) Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4895.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4895: - Attachment: (was: HIVE-4895.move.patch) Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4895.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4895: - Status: Patch Available (was: Open) Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4895.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4895: - Status: Open (was: Patch Available) Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4895.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4895: - Attachment: 4895.rename 4895.move renamed HIVE-4895.move.patch to 4895.move renamed HIVE-4895.update.patch to 4895.rename so that ptest ignores them Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: 4895.move, 4895.rename, HIVE-4895.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4896) create binary backwards compatibility layer hcatalog 0.12 and 0.11
[ https://issues.apache.org/jira/browse/HIVE-4896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4896: - Attachment: (was: HIVE-4896.v011.patch) create binary backwards compatibility layer hcatalog 0.12 and 0.11 -- Key: HIVE-4896 URL: https://issues.apache.org/jira/browse/HIVE-4896 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4896.patch Original Estimate: 72h Remaining Estimate: 72h 1. create a 011 dir in each HCat submodule to hold these classes 2. update build to place these classes into jars 3. figure out how to test this The goal is for users who were using public API (e.g. HCatLoader) in their programs to be able to upgrade to hive 0.12 and have their programs still work without change. The compatibility layer will have a sunset clause to be removed in 2 releases. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4896) create binary backwards compatibility layer hcatalog 0.12 and 0.11
[ https://issues.apache.org/jira/browse/HIVE-4896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4896: - Status: Open (was: Patch Available) create binary backwards compatibility layer hcatalog 0.12 and 0.11 -- Key: HIVE-4896 URL: https://issues.apache.org/jira/browse/HIVE-4896 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4896.patch Original Estimate: 72h Remaining Estimate: 72h 1. create a 011 dir in each HCat submodule to hold these classes 2. update build to place these classes into jars 3. figure out how to test this The goal is for users who were using public API (e.g. HCatLoader) in their programs to be able to upgrade to hive 0.12 and have their programs still work without change. The compatibility layer will have a sunset clause to be removed in 2 releases. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4896) create binary backwards compatibility layer hcatalog 0.12 and 0.11
[ https://issues.apache.org/jira/browse/HIVE-4896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4896: - Status: Patch Available (was: Open) create binary backwards compatibility layer hcatalog 0.12 and 0.11 -- Key: HIVE-4896 URL: https://issues.apache.org/jira/browse/HIVE-4896 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4896.patch Original Estimate: 72h Remaining Estimate: 72h 1. create a 011 dir in each HCat submodule to hold these classes 2. update build to place these classes into jars 3. figure out how to test this The goal is for users who were using public API (e.g. HCatLoader) in their programs to be able to upgrade to hive 0.12 and have their programs still work without change. The compatibility layer will have a sunset clause to be removed in 2 releases. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4896) create binary backwards compatibility layer hcatalog 0.12 and 0.11
[ https://issues.apache.org/jira/browse/HIVE-4896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4896: - Attachment: (was: HIVE-4896.changes.patch) create binary backwards compatibility layer hcatalog 0.12 and 0.11 -- Key: HIVE-4896 URL: https://issues.apache.org/jira/browse/HIVE-4896 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-4896.patch Original Estimate: 72h Remaining Estimate: 72h 1. create a 011 dir in each HCat submodule to hold these classes 2. update build to place these classes into jars 3. figure out how to test this The goal is for users who were using public API (e.g. HCatLoader) in their programs to be able to upgrade to hive 0.12 and have their programs still work without change. The compatibility layer will have a sunset clause to be removed in 2 releases. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4896) create binary backwards compatibility layer hcatalog 0.12 and 0.11
[ https://issues.apache.org/jira/browse/HIVE-4896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758493#comment-13758493 ] Eugene Koifman commented on HIVE-4896: -- To be clear: HIVE-4896.patch is not supposed to be committed. create binary backwards compatibility layer hcatalog 0.12 and 0.11 -- Key: HIVE-4896 URL: https://issues.apache.org/jira/browse/HIVE-4896 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: 4896.changes, 4896.v011, HIVE-4896.patch Original Estimate: 72h Remaining Estimate: 72h 1. create a 011 dir in each HCat submodule to hold these classes 2. update build to place these classes into jars 3. figure out how to test this The goal is for users who were using public API (e.g. HCatLoader) in their programs to be able to upgrade to hive 0.12 and have their programs still work without change. The compatibility layer will have a sunset clause to be removed in 2 releases. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4896) create binary backwards compatibility layer hcatalog 0.12 and 0.11
[ https://issues.apache.org/jira/browse/HIVE-4896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4896: - Attachment: 4896.v011 4896.changes renamed HIVE-4896.v011.patch to 4896.v011 renamed HIVE-4896.changes.patch to 4896.changes to make sure it doesn't conflict ptest framework create binary backwards compatibility layer hcatalog 0.12 and 0.11 -- Key: HIVE-4896 URL: https://issues.apache.org/jira/browse/HIVE-4896 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: 4896.changes, 4896.v011, HIVE-4896.patch Original Estimate: 72h Remaining Estimate: 72h 1. create a 011 dir in each HCat submodule to hold these classes 2. update build to place these classes into jars 3. figure out how to test this The goal is for users who were using public API (e.g. HCatLoader) in their programs to be able to upgrade to hive 0.12 and have their programs still work without change. The compatibility layer will have a sunset clause to be removed in 2 releases. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758500#comment-13758500 ] Sushanth Sowmyan commented on HIVE-4895: That's what I'm running in the background. :) But it doesn't hurt to be more thorough. I've now gone through the rename patch also, trying to make sure that the only diffs in place are those that are changing the package name. I found a couple others as well: * Changes in pom.xml to suppress extra specification of the groupId after the parent's groupId has already been specified. I guess that makes sense from the perspective of honouring the parent groupId, and that works. * Commenting out some parameters in hcatalog/src/test/e2e/templeton/build.xml. Why are these parameters being commented out here? {code} -arg value=${basedir}/tests/serverstatus.conf/ -arg value=${basedir}/tests/ddl.conf/ -arg value=${basedir}/tests/jobsubmission.conf/ -arg value=${basedir}/tests/jobsubmission2.conf/ +!--arg value=${basedir}/tests/serverstatus.conf/ +arg value=${basedir}/tests/ddl.conf/-- + arg value=${basedir}/tests/jobsubmission.conf/ +!--arg value=${basedir}/tests/jobsubmission2.conf/-- {code} Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: 4895.move, 4895.rename, HIVE-4895.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Assigned] (HIVE-5127) Upgrade xerces and xalan for WebHCat
[ https://issues.apache.org/jira/browse/HIVE-5127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman reassigned HIVE-5127: Assignee: Eugene Koifman Upgrade xerces and xalan for WebHCat Key: HIVE-5127 URL: https://issues.apache.org/jira/browse/HIVE-5127 Project: Hive Issue Type: Bug Components: WebHCat Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Currently webhcat log files are full of exceptions like this, which obscures the real output. Upgrading to more recent versions of xerces/xalan fixes this. Add the following to hive/hcatalog/webhcat/svr/pom.xml dependency groupIdxerces/groupId artifactIdxercesImpl/artifactId version2.9.1/version /dependency dependency groupIdxalan/groupId artifactIdxalan/artifactId version2.7.1/version /dependency 13/08/20 16:54:04 ERROR conf.Configuration: Failed to set setXIncludeAware(true) for parser org.apache.xerces.jaxp.DocumentBuilderFactoryImpl@48dbb335:java.lang.UnsupportedOperationException: This parser does not support specification null version null java.lang.UnsupportedOperationException: This parser does not support specification null version null at javax.xml.parsers.DocumentBuilderFactory.setXIncludeAware(DocumentBuilderFactory.java:590) at org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1892) at org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:1861) at org.apache.hadoop.conf.Configuration.getProps(Configuration.java:1778) at org.apache.hadoop.conf.Configuration.get(Configuration.java:870) at org.apache.hadoop.fs.FileSystem.getDefaultUri(FileSystem.java:171) at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:305) at org.apache.hadoop.fs.FileSystem.getLocal(FileSystem.java:288) at org.apache.hadoop.util.GenericOptionsParser.validateFiles(GenericOptionsParser.java:383) at org.apache.hadoop.util.GenericOptionsParser.processGeneralOptions(GenericOptionsParser.java:281) at org.apache.hadoop.util.GenericOptionsParser.parseGeneralOptions(GenericOptionsParser.java:422) at org.apache.hadoop.util.GenericOptionsParser.init(GenericOptionsParser.java:168) at org.apache.hadoop.util.GenericOptionsParser.init(GenericOptionsParser.java:151) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:64) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84) at org.apache.hcatalog.templeton.LauncherDelegator$1.run(LauncherDelegator.java:99) at org.apache.hcatalog.templeton.LauncherDelegator$1.run(LauncherDelegator.java:95) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:396) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1441) at org.apache.hcatalog.templeton.LauncherDelegator.queueAsUser(LauncherDelegator.java:95) at org.apache.hcatalog.templeton.LauncherDelegator.enqueueController(LauncherDelegator.java:77) at org.apache.hcatalog.templeton.JarDelegator.run(JarDelegator.java:52) at org.apache.hcatalog.templeton.StreamingDelegator.run(StreamingDelegator.java:53) at org.apache.hcatalog.templeton.Server.mapReduceStreaming(Server.java:596) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at com.sun.jersey.spi.container.JavaMethodInvokerFactory$1.invoke(JavaMethodInvokerFactory.java:60) at com.sun.jersey.server.impl.model.method.dispatch.AbstractResourceMethodDispatchProvider$TypeOutInvoker._dispatch(AbstractResourceMethodDispatchProvider.java:185) at com.sun.jersey.server.impl.model.method.dispatch.ResourceJavaMethodDispatcher.dispatch(ResourceJavaMethodDispatcher.java:75) at com.sun.jersey.server.impl.uri.rules.HttpMethodRule.accept(HttpMethodRule.java:302) at com.sun.jersey.server.impl.uri.rules.RightHandPathRule.accept(RightHandPathRule.java:147) at com.sun.jersey.server.impl.uri.rules.ResourceClassRule.accept(ResourceClassRule.java:108) at com.sun.jersey.server.impl.uri.rules.RightHandPathRule.accept(RightHandPathRule.java:147) at com.sun.jersey.server.impl.uri.rules.RootResourceClassesRule.accept(RootResourceClassesRule.java:84) at
[jira] [Updated] (HIVE-4958) AppConfig.init() loads webhcat-*.xml before core-*.xml
[ https://issues.apache.org/jira/browse/HIVE-4958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4958: - Component/s: (was: HCatalog) WebHCat AppConfig.init() loads webhcat-*.xml before core-*.xml -- Key: HIVE-4958 URL: https://issues.apache.org/jira/browse/HIVE-4958 Project: Hive Issue Type: Bug Components: WebHCat Affects Versions: 0.12.0 Reporter: Eugene Koifman This method first loads webhcat-*.xml and then core-*xml, mapred-*.xml, etc. Shouldn't it be in the opposite order? -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4725) templeton.hadoop.queue.name property should be documented
[ https://issues.apache.org/jira/browse/HIVE-4725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4725: - Component/s: (was: HCatalog) WebHCat Documentation templeton.hadoop.queue.name property should be documented - Key: HIVE-4725 URL: https://issues.apache.org/jira/browse/HIVE-4725 Project: Hive Issue Type: Bug Components: Documentation, WebHCat Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Lefty Leverenz This is to track that changes in HIVE-4679 get documented. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-5158) allow getting all partitions for table to also use direct SQL path
[ https://issues.apache.org/jira/browse/HIVE-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phabricator updated HIVE-5158: -- Attachment: HIVE-5158.D12573.6.patch sershe updated the revision HIVE-5158 [jira] allow getting all partitions for table to also use direct SQL path. Trying to update the patch to trigger HiveQA Reviewers: ashutoshc, JIRA REVISION DETAIL https://reviews.facebook.net/D12573 CHANGE SINCE LAST DIFF https://reviews.facebook.net/D12573?vs=39447id=39513#toc MANIPHEST TASKS https://reviews.facebook.net/T63 BRANCH HIVE-5158 ARCANIST PROJECT hive AFFECTED FILES metastore/src/java/org/apache/hadoop/hive/metastore/MetaStoreDirectSql.java metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java metastore/src/test/org/apache/hadoop/hive/metastore/VerifyingObjectStore.java ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java ql/src/java/org/apache/hadoop/hive/ql/optimizer/ppr/PartitionPruner.java To: JIRA, ashutoshc, sershe allow getting all partitions for table to also use direct SQL path -- Key: HIVE-5158 URL: https://issues.apache.org/jira/browse/HIVE-5158 Project: Hive Issue Type: Improvement Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Attachments: HIVE-5158.D12573.1.patch, HIVE-5158.D12573.2.patch, HIVE-5158.D12573.3.patch, HIVE-5158.D12573.4.patch, HIVE-5158.D12573.5.patch, HIVE-5158.D12573.6.patch While testing some queries I noticed that getPartitions can be very slow (which happens e.g. in non-strict mode with no partition column filter); with a table with many partitions it can take 10-12s easily. SQL perf path can also be used for this path. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758510#comment-13758510 ] Sushanth Sowmyan commented on HIVE-4895: All hcatalog tests pass. I'm okay with going ahead with this patch once I understand what these commented out parameters do and why it was needed. Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: 4895.move, 4895.rename, HIVE-4895.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758511#comment-13758511 ] Eugene Koifman commented on HIVE-4895: -- You are right, and this proves that point :) there should not be any changes in this build.xml Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: 4895.move, 4895.rename, HIVE-4895.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (HIVE-4894) Update maven coordinates of HCatalog artifacts
[ https://issues.apache.org/jira/browse/HIVE-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman resolved HIVE-4894. -- Resolution: Fixed this was done as part of HIVE-4895 Update maven coordinates of HCatalog artifacts -- Key: HIVE-4894 URL: https://issues.apache.org/jira/browse/HIVE-4894 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Original Estimate: 2h Remaining Estimate: 2h pom.xml files in hcatalog/ specify groupIdorg.apache.hcatalog/groupId and artifactIdhcatalog-core/artifactId Once HCat source is moved to org.apache.hive.hcatalog, groupId should match this new name artifact IDs should be change to how hive's scheme, i.e. with 'hive-' prefix. Also, remove groupId from POMs of hcat submodules as it's already specified in parent POM. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sushanth Sowmyan updated HIVE-4895: --- Attachment: HIVE-4895.2.patch Okay :) I'm uploading a new patch called HIVE-4895.2.patch with that build.xml patch taken out so that the pre-commit tests can pick that up. Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: 4895.move, 4895.rename, HIVE-4895.2.patch, HIVE-4895.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (HIVE-4266) Refactor HCatalog code to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman resolved HIVE-4266. -- Resolution: Duplicate Dup of HIVE-4895 Refactor HCatalog code to org.apache.hive.hcatalog -- Key: HIVE-4266 URL: https://issues.apache.org/jira/browse/HIVE-4266 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.11.0 Reporter: Alan Gates Assignee: Eugene Koifman Priority: Blocker Fix For: 0.12.0 Currently HCatalog code is in packages org.apache.hcatalog. It needs to now move to org.apache.hive.hcatalog. Shell classes/interface need to be created for public facing classes so that user's code does not break. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sushanth Sowmyan updated HIVE-4895: --- Status: Open (was: Patch Available) Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: 4895.move, 4895.rename, HIVE-4895.2.patch, HIVE-4895.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sushanth Sowmyan updated HIVE-4895: --- Status: Patch Available (was: Open) Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: 4895.move, 4895.rename, HIVE-4895.2.patch, HIVE-4895.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-5213) remove hcatalog/shims directory
[ https://issues.apache.org/jira/browse/HIVE-5213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-5213: - Description: hcatalog/shims is no longer relevant. It should have been deleted as part of HIVE-4460 (it's in the .patch) but for some reason it's still in the source tree. (was: hcatalog/shims is no longer relevant. It should have been deleted as part of HIVE-4460 (it's in the .patch) but for some reason it's still in the tree.) remove hcatalog/shims directory --- Key: HIVE-5213 URL: https://issues.apache.org/jira/browse/HIVE-5213 Project: Hive Issue Type: Bug Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 hcatalog/shims is no longer relevant. It should have been deleted as part of HIVE-4460 (it's in the .patch) but for some reason it's still in the source tree. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-4895) Move all HCatalog classes to org.apache.hive.hcatalog
[ https://issues.apache.org/jira/browse/HIVE-4895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758521#comment-13758521 ] Sushanth Sowmyan commented on HIVE-4895: Although I've marked it for the precommit build to pick up, I'm +1 on this, and have verified it locally. I'll try to start getting the massive svn mv required working and ready before I commit. Move all HCatalog classes to org.apache.hive.hcatalog - Key: HIVE-4895 URL: https://issues.apache.org/jira/browse/HIVE-4895 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: 4895.move, 4895.rename, HIVE-4895.2.patch, HIVE-4895.patch Original Estimate: 24h Time Spent: 12h Remaining Estimate: 12h make sure to preserve history in SCM -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-5213) remove hcatalog/shims directory
[ https://issues.apache.org/jira/browse/HIVE-5213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-5213: - Attachment: HIVE-5213.patch remove hcatalog/shims directory --- Key: HIVE-5213 URL: https://issues.apache.org/jira/browse/HIVE-5213 Project: Hive Issue Type: Bug Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-5213.patch hcatalog/shims is no longer relevant. It should have been deleted as part of HIVE-4460 (it's in the .patch) but for some reason it's still in the source tree. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-5213) remove hcatalog/shims directory
[ https://issues.apache.org/jira/browse/HIVE-5213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-5213: - Status: Patch Available (was: Open) remove hcatalog/shims directory --- Key: HIVE-5213 URL: https://issues.apache.org/jira/browse/HIVE-5213 Project: Hive Issue Type: Bug Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: HIVE-5213.patch hcatalog/shims is no longer relevant. It should have been deleted as part of HIVE-4460 (it's in the .patch) but for some reason it's still in the source tree. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5203) FunctionRegistry.getMethodInternal() should prefer method arguments with closer affinity to the original argument types
[ https://issues.apache.org/jira/browse/HIVE-5203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758528#comment-13758528 ] Hive QA commented on HIVE-5203: --- {color:red}Overall{color}: -1 at least one tests failed Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12601464/HIVE-5203.2.patch {color:red}ERROR:{color} -1 due to 4 failed/errored test(s), 2911 tests executed *Failed tests:* {noformat} org.apache.hcatalog.pig.TestHCatLoader.testGetInputBytes org.apache.hcatalog.pig.TestHCatLoader.testReadPartitionedBasic org.apache.hcatalog.pig.TestHCatLoader.testProjectionsBasic org.apache.hcatalog.mapreduce.TestHCatExternalDynamicPartitioned.testHCatDynamicPartitionedTableMultipleTask {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/616/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/616/console Messages: {noformat} Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests failed with: TestsFailedException: 4 tests failed {noformat} This message is automatically generated. FunctionRegistry.getMethodInternal() should prefer method arguments with closer affinity to the original argument types --- Key: HIVE-5203 URL: https://issues.apache.org/jira/browse/HIVE-5203 Project: Hive Issue Type: Bug Components: Types, UDF Reporter: Jason Dere Assignee: Jason Dere Attachments: HIVE-5203.1.patch, HIVE-5203.2.patch When the function registry is trying to determine the best version of UDF evaluate() to use based on a set of arguments passed in, it should prefer methods where the argument types are more related to the original types. For example if varchar is used with UDFFromUnixTime(), varchar is convertible to both the double and string versions of evaluate() for that UDF. In this case we would prefer that the function registry select the string version over the double version, since varchar and string are both string types. This doesn't really affect any of the existing types, but comes into play with the addition of the varchar type (HIVE-4844). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4896) create binary backwards compatibility layer hcatalog 0.12 and 0.11
[ https://issues.apache.org/jira/browse/HIVE-4896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-4896: - Status: Open (was: Patch Available) create binary backwards compatibility layer hcatalog 0.12 and 0.11 -- Key: HIVE-4896 URL: https://issues.apache.org/jira/browse/HIVE-4896 Project: Hive Issue Type: Sub-task Components: HCatalog Affects Versions: 0.12.0 Reporter: Eugene Koifman Assignee: Eugene Koifman Fix For: 0.12.0 Attachments: 4896.changes, 4896.v011, HIVE-4896.patch Original Estimate: 72h Remaining Estimate: 72h 1. create a 011 dir in each HCat submodule to hold these classes 2. update build to place these classes into jars 3. figure out how to test this The goal is for users who were using public API (e.g. HCatLoader) in their programs to be able to upgrade to hive 0.12 and have their programs still work without change. The compatibility layer will have a sunset clause to be removed in 2 releases. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-2818) Create table checks the current database privilege
[ https://issues.apache.org/jira/browse/HIVE-2818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758540#comment-13758540 ] Mikhail Antonov commented on HIVE-2818: --- Is that fixed? Create table checks the current database privilege -- Key: HIVE-2818 URL: https://issues.apache.org/jira/browse/HIVE-2818 Project: Hive Issue Type: Bug Components: Authorization, Security Affects Versions: 0.7.1 Reporter: Benyi Wang Hive seems check the current database to determine the privilege of a statement when you use fully qualified name like 'database.table' {code} hive set hive.security.authorization.enabled=true; hive create database test_db; hive grant all on database test_db to user test_user; hive revoke all on database default from test_user; hive use default; hive create table test_db.new_table (id int); Authorization failed:No privilege 'Create' found for outputs { database:default}. Use show grant to get more details. hive use test_db; hive create table test_db.new_table (id int); {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5203) FunctionRegistry.getMethodInternal() should prefer method arguments with closer affinity to the original argument types
[ https://issues.apache.org/jira/browse/HIVE-5203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758554#comment-13758554 ] Jason Dere commented on HIVE-5203: -- Ran both TestHCatLoader and TestHCatExternalDynamicPartitioned locally and these tests pass for me. FunctionRegistry.getMethodInternal() should prefer method arguments with closer affinity to the original argument types --- Key: HIVE-5203 URL: https://issues.apache.org/jira/browse/HIVE-5203 Project: Hive Issue Type: Bug Components: Types, UDF Reporter: Jason Dere Assignee: Jason Dere Attachments: HIVE-5203.1.patch, HIVE-5203.2.patch When the function registry is trying to determine the best version of UDF evaluate() to use based on a set of arguments passed in, it should prefer methods where the argument types are more related to the original types. For example if varchar is used with UDFFromUnixTime(), varchar is convertible to both the double and string versions of evaluate() for that UDF. In this case we would prefer that the function registry select the string version over the double version, since varchar and string are both string types. This doesn't really affect any of the existing types, but comes into play with the addition of the varchar type (HIVE-4844). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-1545) Add a bunch of UDFs and UDAFs
[ https://issues.apache.org/jira/browse/HIVE-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758552#comment-13758552 ] Jonathan Chang commented on HIVE-1545: -- Ok, I've put a self-contained, compiling repo here: https://github.com/slycoder/hive-udfs What are the next steps? Add a bunch of UDFs and UDAFs - Key: HIVE-1545 URL: https://issues.apache.org/jira/browse/HIVE-1545 Project: Hive Issue Type: New Feature Components: UDF Reporter: Jonathan Chang Assignee: Jonathan Chang Priority: Minor Attachments: core.tar.gz, ext.tar.gz, UDFEndsWith.java, UDFFindInString.java, UDFLtrim.java, UDFRtrim.java, udfs.tar.gz, udfs.tar.gz, UDFStartsWith.java, UDFTrim.java Here some UD(A)Fs which can be incorporated into the Hive distribution: UDFArgMax - Find the 0-indexed index of the largest argument. e.g., ARGMAX(4, 5, 3) returns 1. UDFBucket - Find the bucket in which the first argument belongs. e.g., BUCKET(x, b_1, b_2, b_3, ...), will return the smallest i such that x b_{i} but = b_{i+1}. Returns 0 if x is smaller than all the buckets. UDFFindInArray - Finds the 1-index of the first element in the array given as the second argument. Returns 0 if not found. Returns NULL if either argument is NULL. E.g., FIND_IN_ARRAY(5, array(1,2,5)) will return 3. FIND_IN_ARRAY(5, array(1,2,3)) will return 0. UDFGreatCircleDist - Finds the great circle distance (in km) between two lat/long coordinates (in degrees). UDFLDA - Performs LDA inference on a vector given fixed topics. UDFNumberRows - Number successive rows starting from 1. Counter resets to 1 whenever any of its parameters changes. UDFPmax - Finds the maximum of a set of columns. e.g., PMAX(4, 5, 3) returns 5. UDFRegexpExtractAll - Like REGEXP_EXTRACT except that it returns all matches in an array. UDFUnescape - Returns the string unescaped (using C/Java style unescaping). UDFWhich - Given a boolean array, return the indices which are TRUE. UDFJaccard UDAFCollect - Takes all the values associated with a row and converts it into a list. Make sure to have: set hive.map.aggr = false; UDAFCollectMap - Like collect except that it takes tuples and generates a map. UDAFEntropy - Compute the entropy of a column. UDAFPearson (BROKEN!!!) - Computes the pearson correlation between two columns. UDAFTop - TOP(KEY, VAL) - returns the KEY associated with the largest value of VAL. UDAFTopN (BROKEN!!!) - Like TOP except returns a list of the keys associated with the N (passed as the third parameter) largest values of VAL. UDAFHistogram -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-5158) allow getting all partitions for table to also use direct SQL path
[ https://issues.apache.org/jira/browse/HIVE-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HIVE-5158: --- Status: Open (was: Patch Available) woops, I found the issue... pretty stupid actually allow getting all partitions for table to also use direct SQL path -- Key: HIVE-5158 URL: https://issues.apache.org/jira/browse/HIVE-5158 Project: Hive Issue Type: Improvement Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Attachments: HIVE-5158.D12573.1.patch, HIVE-5158.D12573.2.patch, HIVE-5158.D12573.3.patch, HIVE-5158.D12573.4.patch, HIVE-5158.D12573.5.patch, HIVE-5158.D12573.6.patch While testing some queries I noticed that getPartitions can be very slow (which happens e.g. in non-strict mode with no partition column filter); with a table with many partitions it can take 10-12s easily. SQL perf path can also be used for this path. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
Re: [ANNOUNCE] New Hive Committer - Yin Huai
Congratulations, Yin. I remember your hard work on ysmart ( https://issues.apache.org/jira/browse/HIVE-2206) and others( https://issues.apache.org/jira/issues/?jql=project%20%3D%20HIVE%20AND%20status%20in%20(Resolved%2C%20Closed)%20AND%20assignee%20in%20(yhuai)%20ORDER%20BY%20updated%20DESC%2C%20priority%20DESC ) 2013/9/5 Clark Yang (杨卓荦) yangzhuo...@gmail.com Congratulations, Yin Cheers, Zhuoluo (Clark) Yang 2013/9/5 Yin Huai huaiyin@gmail.com Thanks everybody! This is awesome! On Wed, Sep 4, 2013 at 1:58 PM, Daniel Dai da...@hortonworks.com wrote: Congratulation! On Wed, Sep 4, 2013 at 10:39 AM, yongqiang he heyongqiang...@gmail.com wrote: Congrats! On Wed, Sep 4, 2013 at 10:23 AM, Jason Dere jd...@hortonworks.com wrote: Yin, congrats! Jason On Sep 4, 2013, at 7:54 AM, Eugene Koifman ekoif...@hortonworks.com wrote: Congrats! On Wed, Sep 4, 2013 at 5:23 AM, Brock Noland br...@cloudera.com wrote: Congrats Yin!! On Wed, Sep 4, 2013 at 4:14 AM, Lefty Leverenz leftylever...@gmail.com wrote: Bravo, Yin! -- Lefty On Wed, Sep 4, 2013 at 4:17 AM, Sushanth Sowmyan khorg...@gmail.com wrote: Congrats, Yin! :) On Sep 4, 2013 1:13 AM, Alexander Alten-Lorenz wget.n...@gmail.com wrote: Amazing news, congratz Yin! Well deserved! On Sep 4, 2013, at 6:49 AM, Carl Steinbach c...@apache.org wrote: The Apache Hive PMC has voted to make Yin Huai a committer on the Apache Hive project. Please join me in congratulating Yin! Thanks. Carl -- Alexander Alten-Lorenz http://mapredit.blogspot.com German Hadoop LinkedIn Group: http://goo.gl/N8pCF -- Apache MRUnit - Unit testing MapReduce - http://mrunit.apache.org -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
[jira] [Commented] (HIVE-4331) Integrated StorageHandler for Hive and HCat using the HiveStorageHandler
[ https://issues.apache.org/jira/browse/HIVE-4331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758575#comment-13758575 ] Viraj Bhat commented on HIVE-4331: -- Sushanth, most of the issues stated in phabricator have been addressed in the patch I will reattach if HIVE-4869 goes through. Meanwhile I will wait for your go as to when HIVE-4869 gets committed. Integrated StorageHandler for Hive and HCat using the HiveStorageHandler Key: HIVE-4331 URL: https://issues.apache.org/jira/browse/HIVE-4331 Project: Hive Issue Type: Task Components: HBase Handler, HCatalog Affects Versions: 0.11.0, 0.12.0 Reporter: Ashutosh Chauhan Assignee: Viraj Bhat Attachments: HIVE4331_07-17.patch, StorageHandlerDesign_HIVE4331.pdf 1) Deprecate the HCatHBaseStorageHandler and RevisionManager from HCatalog. These will now continue to function but internally they will use the DefaultStorageHandler from Hive. They will be removed in future release of Hive. 2) Design a HivePassThroughFormat so that any new StorageHandler in Hive will bypass the HiveOutputFormat. We will use this class in Hive's HBaseStorageHandler instead of the HiveHBaseTableOutputFormat. 3) Write new unit tests in the HCat's storagehandler so that systems such as Pig and Map Reduce can use the Hive's HBaseStorageHandler instead of the HCatHBaseStorageHandler. 4) Make sure all the old and new unit tests pass without backward compatibility (except known issues as described in the Design Document). 5) Replace all instances of the HCat source code, which point to HCatStorageHandler to use theHiveStorageHandler including the FosterStorageHandler. I have attached the design document for the same and will attach a patch to this Jira. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-5209) JDBC support for varchar
[ https://issues.apache.org/jira/browse/HIVE-5209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Dere updated HIVE-5209: - Status: Patch Available (was: Open) JDBC support for varchar Key: HIVE-5209 URL: https://issues.apache.org/jira/browse/HIVE-5209 Project: Hive Issue Type: Improvement Components: JDBC, Types Reporter: Jason Dere Assignee: Jason Dere Attachments: HIVE-5209.1.patch, HIVE-5209.D12705.1.patch Support returning varchar length in result set metadata NO PRECOMMIT TESTS - dependent on HIVE-4844 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-5216) Need to annotate public API in HCatalog
[ https://issues.apache.org/jira/browse/HIVE-5216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugene Koifman updated HIVE-5216: - Description: need to annotate which API is considered public using something like @InterfaceAudience.Public @InterfaceStability.Evolving Currently this is what is considered (at a minimum) public API HCatLoader HCatStorer HCatInputFormat HCatOutputFormat HCatReader HCatWriter HCatRecord HCatSchema This is needed so that clients/dependent projects know which API they can rely on and which can change w/o notice. was: need to annotate which API is considered public using something like @InterfaceAudience.Public @InterfaceStability.Evolving Currently this is what is considered (at a minimum) public API HCatLoader HCatStorer HCatInputFormat HCatOutputFormat HCatReader HCatWriter HCatRecord HCatSchema Need to annotate public API in HCatalog --- Key: HIVE-5216 URL: https://issues.apache.org/jira/browse/HIVE-5216 Project: Hive Issue Type: Bug Components: HCatalog, WebHCat Affects Versions: 0.12.0 Reporter: Eugene Koifman need to annotate which API is considered public using something like @InterfaceAudience.Public @InterfaceStability.Evolving Currently this is what is considered (at a minimum) public API HCatLoader HCatStorer HCatInputFormat HCatOutputFormat HCatReader HCatWriter HCatRecord HCatSchema This is needed so that clients/dependent projects know which API they can rely on and which can change w/o notice. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (HIVE-5216) Need to annotate public API in HCatalog
Eugene Koifman created HIVE-5216: Summary: Need to annotate public API in HCatalog Key: HIVE-5216 URL: https://issues.apache.org/jira/browse/HIVE-5216 Project: Hive Issue Type: Bug Components: HCatalog, WebHCat Affects Versions: 0.12.0 Reporter: Eugene Koifman need to annotate which API is considered public using something like @InterfaceAudience.Public @InterfaceStability.Evolving Currently this is what is considered (at a minimum) public API HCatLoader HCatStorer HCatInputFormat HCatOutputFormat HCatReader HCatWriter HCatRecord HCatSchema -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-5203) FunctionRegistry.getMethodInternal() should prefer method arguments with closer affinity to the original argument types
[ https://issues.apache.org/jira/browse/HIVE-5203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758613#comment-13758613 ] Edward Capriolo commented on HIVE-5203: --- Cool looks good +1 FunctionRegistry.getMethodInternal() should prefer method arguments with closer affinity to the original argument types --- Key: HIVE-5203 URL: https://issues.apache.org/jira/browse/HIVE-5203 Project: Hive Issue Type: Bug Components: Types, UDF Reporter: Jason Dere Assignee: Jason Dere Attachments: HIVE-5203.1.patch, HIVE-5203.2.patch When the function registry is trying to determine the best version of UDF evaluate() to use based on a set of arguments passed in, it should prefer methods where the argument types are more related to the original types. For example if varchar is used with UDFFromUnixTime(), varchar is convertible to both the double and string versions of evaluate() for that UDF. In this case we would prefer that the function registry select the string version over the double version, since varchar and string are both string types. This doesn't really affect any of the existing types, but comes into play with the addition of the varchar type (HIVE-4844). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira