[jira] [Commented] (HIVE-6430) MapJoin hash table has large memory overhead

2014-05-02 Thread Lefty Leverenz (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-6430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13987458#comment-13987458
 ] 

Lefty Leverenz commented on HIVE-6430:
--

Thanks for the parameter descriptions in hive-default.xml.template.  But patch 
12 has a duplicate description for hive.mapjoin.optimized.hashtable.

 MapJoin hash table has large memory overhead
 

 Key: HIVE-6430
 URL: https://issues.apache.org/jira/browse/HIVE-6430
 Project: Hive
  Issue Type: Improvement
Reporter: Sergey Shelukhin
Assignee: Sergey Shelukhin
 Attachments: HIVE-6430.01.patch, HIVE-6430.02.patch, 
 HIVE-6430.03.patch, HIVE-6430.04.patch, HIVE-6430.05.patch, 
 HIVE-6430.06.patch, HIVE-6430.07.patch, HIVE-6430.08.patch, 
 HIVE-6430.09.patch, HIVE-6430.10.patch, HIVE-6430.11.patch, 
 HIVE-6430.12.patch, HIVE-6430.12.patch, HIVE-6430.patch


 Right now, in some queries, I see that storing e.g. 4 ints (2 for key and 2 
 for row) can take several hundred bytes, which is ridiculous. I am reducing 
 the size of MJKey and MJRowContainer in other jiras, but in general we don't 
 need to have java hash table there.  We can either use primitive-friendly 
 hashtable like the one from HPPC (Apache-licenced), or some variation, to map 
 primitive keys to single row storage structure without an object per row 
 (similar to vectorization).



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-6828) Hive tez bucket map join conversion interferes with map join conversion

2014-05-02 Thread Hive QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-6828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13987519#comment-13987519
 ] 

Hive QA commented on HIVE-6828:
---



{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12642970/HIVE-6828.2.patch

{color:red}ERROR:{color} -1 due to 6 failed/errored test(s), 5427 tests executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_groupby2_map_skew
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_stats_list_bucket
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_stats_partscan_1_23
org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_root_dir_external_table
org.apache.hadoop.hive.cli.TestNegativeCliDriver.testNegativeCliDriver_dynamic_partitions_with_whitelist
org.apache.hadoop.hive.cli.TestNegativeCliDriver.testNegativeCliDriver_stats_partialscan_autogether
{noformat}

Test results: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-Build/101/testReport
Console output: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-Build/101/console

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 6 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12642970

 Hive tez bucket map join conversion interferes with map join conversion
 ---

 Key: HIVE-6828
 URL: https://issues.apache.org/jira/browse/HIVE-6828
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0, 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-6828.1.patch, HIVE-6828.2.patch


 The issue is that bucket count is used for checking the scaled down size of 
 the hash tables but is used later on to convert to the map join as well which 
 may be incorrect in cases where the entire hash table does not fit in the 
 specified size.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-5092) Fix hiveserver2 mapreduce local job on Windows

2014-05-02 Thread Hive QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-5092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13987650#comment-13987650
 ] 

Hive QA commented on HIVE-5092:
---



{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12642884/HIVE-5092.4.patch

{color:red}ERROR:{color} -1 due to 6 failed/errored test(s), 5428 tests executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_groupby2_map_skew
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_stats_list_bucket
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_stats_partscan_1_23
org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_root_dir_external_table
org.apache.hadoop.hive.cli.TestNegativeCliDriver.testNegativeCliDriver_dynamic_partitions_with_whitelist
org.apache.hadoop.hive.cli.TestNegativeCliDriver.testNegativeCliDriver_stats_partialscan_autogether
{noformat}

Test results: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-Build/102/testReport
Console output: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-Build/102/console

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 6 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12642884

 Fix hiveserver2 mapreduce local job on Windows
 --

 Key: HIVE-5092
 URL: https://issues.apache.org/jira/browse/HIVE-5092
 Project: Hive
  Issue Type: Bug
  Components: HiveServer2, Windows
Affects Versions: 0.14.0
Reporter: Daniel Dai
Assignee: Hari Sankar Sivarama Subramaniyan
 Fix For: 0.14.0

 Attachments: HIVE-5092-1.patch, HIVE-5092.2.patch, HIVE-5092.3.patch, 
 HIVE-5092.4.patch


 Hiveserver2 fail on Mapreduce local job fail. For example:
 {code}
 select /*+ MAPJOIN(v) */ registration from studenttab10k s join votertab10k v 
 on (s.name = v.name);
 {code}
 The root cause is class not found in the local hadoop job 
 (MapredLocalTask.execute). HADOOP_CLASSPATH does not include $HIVE_HOME/lib. 
 Set HADOOP_CLASSPATH correctly will fix the issue.
 However, there is one complexity in Windows. We start Hiveserver2 using 
 Windows service console (services.msc), which takes hiveserver2.xml generated 
 by hive.cmd. There is no way to pass environment variable in hiveserver2.xml 
 (weird but reality). I attach a patch which pass it through command line 
 arguments and relay to HADOOP_CLASSPATH in Hive code. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6995) GenericUDFBridge should log exception when it is unable to instantiate UDF object

2014-05-02 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-6995:
---

   Resolution: Fixed
Fix Version/s: 0.14.0
   Status: Resolved  (was: Patch Available)

Committed to trunk. Thanks, Jason!

 GenericUDFBridge should log exception when it is unable to instantiate UDF 
 object
 -

 Key: HIVE-6995
 URL: https://issues.apache.org/jira/browse/HIVE-6995
 Project: Hive
  Issue Type: Bug
  Components: UDF
Reporter: Jason Dere
Assignee: Jason Dere
 Fix For: 0.14.0

 Attachments: HIVE-6995.1.patch


 If GenericUDFBridge.initialize() is unable to create an instance of the UDF 
 class, it logs a blanket error message of The UDF implementation class udf 
 is not present in the class path.  This can make it difficult to determine 
 what the actual error is as there can be other issues besides the class not 
 being in the class path.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6868) Create table in HCatalog sets different SerDe defaults than what is set through the CLI

2014-05-02 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-6868:
---

   Resolution: Fixed
Fix Version/s: 0.14.0
   Status: Resolved  (was: Patch Available)

Committed to trunk. Thanks, Harish!

 Create table in HCatalog sets different SerDe defaults than what is set 
 through the CLI
 ---

 Key: HIVE-6868
 URL: https://issues.apache.org/jira/browse/HIVE-6868
 Project: Hive
  Issue Type: Bug
  Components: HCatalog
Reporter: Harish Butani
Assignee: Harish Butani
 Fix For: 0.14.0

 Attachments: HIVE-6868.1.patch, HIVE-6868.2.patch, HIVE-6868.3.patch


 HCatCreateTableDesc doesn't invoke the getEmptyTable function on 
 org.apache.hadoop.hive.ql.metadata.Table



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-7006) Fix ql_rewrite_gbtoidx.q output file

2014-05-02 Thread Ashutosh Chauhan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13987828#comment-13987828
 ] 

Ashutosh Chauhan commented on HIVE-7006:


+1

 Fix ql_rewrite_gbtoidx.q output file
 

 Key: HIVE-7006
 URL: https://issues.apache.org/jira/browse/HIVE-7006
 Project: Hive
  Issue Type: Bug
Reporter: Harish Butani
Assignee: Harish Butani
 Attachments: HIVE-7006.1.patch


 HIVE-4904 moved GroupBy Optimization to happen before ColumnPruner
 Now, the RewriteGBUsingIndex happens after GroupByOptimization.
 So  setting of bucketGroup flag doesn't happen for Index tables.
 Temporary fix is to update .q.out file.
 Will file a bug to look into moving RewriteGBUsingIndex before 
 GroupByOptimizer. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-6960) Set Hive pom to use Hadoop-2.4

2014-05-02 Thread Hive QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-6960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13987877#comment-13987877
 ] 

Hive QA commented on HIVE-6960:
---



{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12642785/HIVE-6960.1.patch

{color:red}ERROR:{color} -1 due to 12 failed/errored test(s), 5428 tests 
executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_auto_join32
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_groupby2_map_skew
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_mapjoin_test_outer
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_nullgroup3
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_ppd_join4
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_select_dummy_source
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_stats_list_bucket
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_stats_partscan_1_23
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_symlink_text_input_format
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_udf_current_database
org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_root_dir_external_table
org.apache.hadoop.hive.cli.TestNegativeCliDriver.testNegativeCliDriver_dynamic_partitions_with_whitelist
{noformat}

Test results: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-Build/103/testReport
Console output: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-Build/103/console

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 12 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12642785

 Set Hive pom to use Hadoop-2.4
 --

 Key: HIVE-6960
 URL: https://issues.apache.org/jira/browse/HIVE-6960
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Affects Versions: 0.14.0
Reporter: Jason Dere
Assignee: Jason Dere
 Attachments: HIVE-6960.1.patch, HIVE-6960.1.patch


 A number of the hadoop-2 unit test failures are due to HADOOP-10425, fixed in 
 Hadoop 2.4.  Perhaps we should move onto that version.
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_nullgroup3
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_ppd_join4
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_select_dummy_source
 - 
 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_symlink_text_input_format
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_udf_current_database
 - 
 org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_bucketizedhiveinputformat



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6960) Set Hive pom to use Hadoop-2.4

2014-05-02 Thread Jason Dere (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Dere updated HIVE-6960:
-

Status: Open  (was: Patch Available)

Will take a look at the q file failures, it's possible that these golden files 
were changed in HIVE-6979 to have stats values that reflected the bug in 
HADOOP-10425.

 Set Hive pom to use Hadoop-2.4
 --

 Key: HIVE-6960
 URL: https://issues.apache.org/jira/browse/HIVE-6960
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Affects Versions: 0.14.0
Reporter: Jason Dere
Assignee: Jason Dere
 Attachments: HIVE-6960.1.patch, HIVE-6960.1.patch


 A number of the hadoop-2 unit test failures are due to HADOOP-10425, fixed in 
 Hadoop 2.4.  Perhaps we should move onto that version.
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_nullgroup3
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_ppd_join4
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_select_dummy_source
 - 
 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_symlink_text_input_format
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_udf_current_database
 - 
 org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_bucketizedhiveinputformat



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-2777) ability to add and drop partitions atomically

2014-05-02 Thread Steven Wong (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988029#comment-13988029
 ] 

Steven Wong commented on HIVE-2777:
---

ASF.LICENSE.NOT.GRANTED--HIVE-2777.D2271.1.patch was created by Netflix. We at 
Netflix intend it to be freely used according to the Apache license.

 ability to add and drop partitions atomically
 -

 Key: HIVE-2777
 URL: https://issues.apache.org/jira/browse/HIVE-2777
 Project: Hive
  Issue Type: New Feature
  Components: Metastore
Reporter: Aniket Mokashi
Assignee: Aniket Mokashi
 Attachments: ASF.LICENSE.NOT.GRANTED--HIVE-2777.D2271.1.patch


 Hive should have ability to atomically add and drop partitions. This way 
 admins can change partitions atomically without breaking the running jobs. It 
 allows admin to merge several partitions into one.
 Essentially, we would like to have an api- add_drop_partitions(String db, 
 String tbl_name, ListPartition addParts, ListListString dropParts, 
 boolean deleteData);
 This jira covers changes required for metastore and thrift.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Created] (HIVE-7008) Clean-up some old dead code

2014-05-02 Thread Ashutosh Chauhan (JIRA)
Ashutosh Chauhan created HIVE-7008:
--

 Summary: Clean-up some old dead code
 Key: HIVE-7008
 URL: https://issues.apache.org/jira/browse/HIVE-7008
 Project: Hive
  Issue Type: Task
  Components: Tests
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan


There is some code to workaround limitations in historic Hadoop (hadoop-17  
earlier). Lets get rid of those.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7008) Clean-up some old dead code

2014-05-02 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-7008:
---

Attachment: HIVE-7008.patch

 Clean-up some old dead code
 ---

 Key: HIVE-7008
 URL: https://issues.apache.org/jira/browse/HIVE-7008
 Project: Hive
  Issue Type: Task
  Components: Tests
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Attachments: HIVE-7008.patch


 There is some code to workaround limitations in historic Hadoop (hadoop-17  
 earlier). Lets get rid of those.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7008) Clean-up some old dead code

2014-05-02 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-7008:
---

Status: Patch Available  (was: Open)

 Clean-up some old dead code
 ---

 Key: HIVE-7008
 URL: https://issues.apache.org/jira/browse/HIVE-7008
 Project: Hive
  Issue Type: Task
  Components: Tests
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Attachments: HIVE-7008.patch


 There is some code to workaround limitations in historic Hadoop (hadoop-17  
 earlier). Lets get rid of those.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-7008) Clean-up some old dead code

2014-05-02 Thread Ashutosh Chauhan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988040#comment-13988040
 ] 

Ashutosh Chauhan commented on HIVE-7008:


As a first step, mark now-obsolete classes as Deprecated and delete tests. Next 
release we should delete classes as well.

 Clean-up some old dead code
 ---

 Key: HIVE-7008
 URL: https://issues.apache.org/jira/browse/HIVE-7008
 Project: Hive
  Issue Type: Task
  Components: Tests
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Attachments: HIVE-7008.patch


 There is some code to workaround limitations in historic Hadoop (hadoop-17  
 earlier). Lets get rid of those.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Review Request 21016: Deprecate class and remove tests.

2014-05-02 Thread Ashutosh Chauhan

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/21016/
---

Review request for hive and Jason Dere.


Bugs: HIVE-7008
https://issues.apache.org/jira/browse/HIVE-7008


Repository: hive-git


Description
---

Marked class as Deprecated.
Removed tests for now obsolete functionality.


Diffs
-

  ql/src/java/org/apache/hadoop/hive/ql/io/FlatFileInputFormat.java b9b151a 
  ql/src/test/org/apache/hadoop/hive/ql/io/RecordTestObj.java 1bb846b 
  ql/src/test/org/apache/hadoop/hive/ql/io/TestFlatFileInputFormat.java c46d754 

Diff: https://reviews.apache.org/r/21016/diff/


Testing
---


Thanks,

Ashutosh Chauhan



Re: SMB join bug

2014-05-02 Thread Thejas Nair
It is possible that you hit this issue  -
https://issues.apache.org/jira/browse/HIVE-5973
It is fixed in apache hive 0.13 release.


On Thu, May 1, 2014 at 7:10 PM, Sukhendu Chakraborty
sukhendu.chakrabo...@gmail.com wrote:
 I am seeing very different number of rows in this query output depending on
 whether I enable SMB join:

 select count(*)
 from dss.hist_hshld_profl_mc  a
   join
   dss.hshld_summary_mc b
on a.hh_key = b.hh_key
  where ('2012-02-27' between a.hshld_profl_eff_dt and a.hshld_profl_exp_dt)
   and a.hshld_exp_dt='-12-31'
and trim(a.cntry_id) = 'USA'

 The SMB join returns 60 rows (wrong value) while the regular join returns
 30million plus rows (correct value).

 Is there a known issue/jira for this? We are using CDH5.0/hive-0.12.

 -Sukhendu

-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.


Re: Review Request 21016: Deprecate class and remove tests.

2014-05-02 Thread Jason Dere

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/21016/#review42045
---


Also delete JavaTestObjFlatFileInputFormat which is being used by the deleted 
test.

- Jason Dere


On May 2, 2014, 6:19 p.m., Ashutosh Chauhan wrote:
 
 ---
 This is an automatically generated e-mail. To reply, visit:
 https://reviews.apache.org/r/21016/
 ---
 
 (Updated May 2, 2014, 6:19 p.m.)
 
 
 Review request for hive and Jason Dere.
 
 
 Bugs: HIVE-7008
 https://issues.apache.org/jira/browse/HIVE-7008
 
 
 Repository: hive-git
 
 
 Description
 ---
 
 Marked class as Deprecated.
 Removed tests for now obsolete functionality.
 
 
 Diffs
 -
 
   ql/src/java/org/apache/hadoop/hive/ql/io/FlatFileInputFormat.java b9b151a 
   ql/src/test/org/apache/hadoop/hive/ql/io/RecordTestObj.java 1bb846b 
   ql/src/test/org/apache/hadoop/hive/ql/io/TestFlatFileInputFormat.java 
 c46d754 
 
 Diff: https://reviews.apache.org/r/21016/diff/
 
 
 Testing
 ---
 
 
 Thanks,
 
 Ashutosh Chauhan
 




[jira] [Commented] (HIVE-7001) fs.permissions.umask-mode is getting unset when Session is started

2014-05-02 Thread Thejas M Nair (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988089#comment-13988089
 ] 

Thejas M Nair commented on HIVE-7001:
-

Ran parquet_ctas locally and it passed, rest of the test failures are unrelated.


 fs.permissions.umask-mode is getting unset when Session is started
 --

 Key: HIVE-7001
 URL: https://issues.apache.org/jira/browse/HIVE-7001
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.13.0
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Attachments: HIVE-7001.1.patch, HIVE-7001.2.patch, HIVE-7001.3.patch


 {code}
 hive set fs.permissions.umask-mode;
 fs.permissions.umask-mode=022
 hive show tables;
 OK
 t1
 Time taken: 0.301 seconds, Fetched: 1 row(s)
 hive set fs.permissions.umask-mode;
 fs.permissions.umask-mode is undefined
 {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-7004) Fix more unit test failures on hadoop-2

2014-05-02 Thread Hive QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988098#comment-13988098
 ] 

Hive QA commented on HIVE-7004:
---



{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12642963/HIVE-7004.1.patch

{color:red}ERROR:{color} -1 due to 5 failed/errored test(s), 5495 tests executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_stats_partscan_1_23
org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver_load_dyn_part1
org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver_tez_dml
org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver_tez_union
org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_root_dir_external_table
{noformat}

Test results: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-Build/105/testReport
Console output: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-Build/105/console

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 5 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12642963

 Fix more unit test failures on hadoop-2
 ---

 Key: HIVE-7004
 URL: https://issues.apache.org/jira/browse/HIVE-7004
 Project: Hive
  Issue Type: Bug
  Components: Tests
Reporter: Jason Dere
Assignee: Jason Dere
 Attachments: HIVE-7004.1.patch


 Still a number of precommit failures with hadoop-2, will try to fix some of 
 them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6901) Explain plan doesn't show operator tree for the fetch operator

2014-05-02 Thread Xuefu Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xuefu Zhang updated HIVE-6901:
--

Attachment: HIVE-6109.10.patch

 Explain plan doesn't show operator tree for the fetch operator
 --

 Key: HIVE-6901
 URL: https://issues.apache.org/jira/browse/HIVE-6901
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.12.0
Reporter: Xuefu Zhang
Assignee: Xuefu Zhang
Priority: Minor
 Attachments: HIVE-6109.10.patch, HIVE-6901.1.patch, 
 HIVE-6901.2.patch, HIVE-6901.3.patch, HIVE-6901.4.patch, HIVE-6901.5.patch, 
 HIVE-6901.6.patch, HIVE-6901.7.patch, HIVE-6901.8.patch, HIVE-6901.9.patch, 
 HIVE-6901.patch


 Explaining a simple select query that involves a MR phase doesn't show 
 processor tree for the fetch operator.
 {code}
 hive explain select d from test;
 OK
 STAGE DEPENDENCIES:
   Stage-1 is a root stage
   Stage-0 is a root stage
 STAGE PLANS:
   Stage: Stage-1
 Map Reduce
   Map Operator Tree:
 ...
   Stage: Stage-0
 Fetch Operator
   limit: -1
 {code}
 It would be nice if the operator tree is shown even if there is only one node.
 Please note that in local execution, the operator tree is complete:
 {code}
 hive explain select * from test;
 OK
 STAGE DEPENDENCIES:
   Stage-0 is a root stage
 STAGE PLANS:
   Stage: Stage-0
 Fetch Operator
   limit: -1
   Processor Tree:
 TableScan
   alias: test
   Statistics: Num rows: 8 Data size: 34 Basic stats: COMPLETE Column 
 stats: NONE
   Select Operator
 expressions: d (type: int)
 outputColumnNames: _col0
 Statistics: Num rows: 8 Data size: 34 Basic stats: COMPLETE 
 Column stats: NONE
 ListSink
 {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7001) fs.permissions.umask-mode is getting unset when Session is started

2014-05-02 Thread Thejas M Nair (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thejas M Nair updated HIVE-7001:


   Resolution: Fixed
Fix Version/s: 0.14.0
   Status: Resolved  (was: Patch Available)

Patch committed to trunk.
Thanks for the reviews Ashutosh  Vikram!


 fs.permissions.umask-mode is getting unset when Session is started
 --

 Key: HIVE-7001
 URL: https://issues.apache.org/jira/browse/HIVE-7001
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.13.0
Reporter: Thejas M Nair
Assignee: Thejas M Nair
 Fix For: 0.14.0

 Attachments: HIVE-7001.1.patch, HIVE-7001.2.patch, HIVE-7001.3.patch


 {code}
 hive set fs.permissions.umask-mode;
 fs.permissions.umask-mode=022
 hive show tables;
 OK
 t1
 Time taken: 0.301 seconds, Fetched: 1 row(s)
 hive set fs.permissions.umask-mode;
 fs.permissions.umask-mode is undefined
 {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6946) Make it easier to run WebHCat e2e tests

2014-05-02 Thread Thejas M Nair (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thejas M Nair updated HIVE-6946:


   Resolution: Fixed
Fix Version/s: 0.14.0
   Status: Resolved  (was: Patch Available)

Patch committed to trunk.
Thanks for the contribution Eugene!


 Make it easier to run WebHCat e2e tests
 ---

 Key: HIVE-6946
 URL: https://issues.apache.org/jira/browse/HIVE-6946
 Project: Hive
  Issue Type: Bug
  Components: WebHCat
Affects Versions: 0.14.0
Reporter: Eugene Koifman
Assignee: Eugene Koifman
 Fix For: 0.14.0

 Attachments: HIVE-6946.2.patch, HIVE-6946.3.patch, HIVE-6946.patch


 Right now hcatalog/src/test/e2e/templeton/README.txt explains the steps to 
 set up WebHCat e2e tests but it's cumbersome and error prone.  Need to make 
 some improvements here.
 The high level goal here is to simplify code-compile-test loop for WebHCat.  
 NO PRECOMMIT TESTS



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6960) Set Hive pom to use Hadoop-2.4

2014-05-02 Thread Jason Dere (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Dere updated HIVE-6960:
-

Attachment: HIVE-6960.2.patch

patch v2 updates a number of diffs that were changed by HIVE-6979

 Set Hive pom to use Hadoop-2.4
 --

 Key: HIVE-6960
 URL: https://issues.apache.org/jira/browse/HIVE-6960
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Affects Versions: 0.14.0
Reporter: Jason Dere
Assignee: Jason Dere
 Attachments: HIVE-6960.1.patch, HIVE-6960.1.patch, HIVE-6960.2.patch


 A number of the hadoop-2 unit test failures are due to HADOOP-10425, fixed in 
 Hadoop 2.4.  Perhaps we should move onto that version.
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_nullgroup3
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_ppd_join4
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_select_dummy_source
 - 
 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_symlink_text_input_format
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_udf_current_database
 - 
 org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_bucketizedhiveinputformat



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6960) Set Hive pom to use Hadoop-2.4

2014-05-02 Thread Jason Dere (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Dere updated HIVE-6960:
-

Status: Patch Available  (was: Open)

 Set Hive pom to use Hadoop-2.4
 --

 Key: HIVE-6960
 URL: https://issues.apache.org/jira/browse/HIVE-6960
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Affects Versions: 0.14.0
Reporter: Jason Dere
Assignee: Jason Dere
 Attachments: HIVE-6960.1.patch, HIVE-6960.1.patch, HIVE-6960.2.patch


 A number of the hadoop-2 unit test failures are due to HADOOP-10425, fixed in 
 Hadoop 2.4.  Perhaps we should move onto that version.
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_nullgroup3
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_ppd_join4
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_select_dummy_source
 - 
 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_symlink_text_input_format
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_udf_current_database
 - 
 org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_bucketizedhiveinputformat



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Created] (HIVE-7009) HIVE_USER_INSTALL_DIR could not bet set to non-HDFS filesystem

2014-05-02 Thread Chuan Liu (JIRA)
Chuan Liu created HIVE-7009:
---

 Summary: HIVE_USER_INSTALL_DIR could not bet set to non-HDFS 
filesystem
 Key: HIVE-7009
 URL: https://issues.apache.org/jira/browse/HIVE-7009
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0
Reporter: Chuan Liu


In {{hive/ql/exec/tez/DagUtils.java}}, we enforce the user path get from 
{{HIVE_USER_INSTALL_DIR}} to be HDFS. This makes it impossible to run Hive+Tez 
jobs on non-HDFS filesystem, e.g. WASB. Relevant code are as follows:
{noformat}
  public Path getDefaultDestDir(Configuration conf) throws LoginException, 
IOException {
UserGroupInformation ugi = ShimLoader.getHadoopShims().getUGIForConf(conf);
String userName = ShimLoader.getHadoopShims().getShortUserName(ugi);
String userPathStr = HiveConf.getVar(conf, 
HiveConf.ConfVars.HIVE_USER_INSTALL_DIR);
Path userPath = new Path(userPathStr);
FileSystem fs = userPath.getFileSystem(conf);
if (!(fs instanceof DistributedFileSystem)) {
  throw new IOException(ErrorMsg.INVALID_HDFS_URI.format(userPathStr));
}
{noformat}

Exceptions running jobs with defaultFs configured to WASB.
{noformat}
2014-05-01 00:21:39,847 ERROR exec.Task (TezTask.java:execute(192)) - Failed to 
execute tez graph.
java.io.IOException: 
wasb://hdi31-chuan...@clhdistorage.blob.core.windows.net/user is not a hdfs uri
at 
org.apache.hadoop.hive.ql.exec.tez.DagUtils.getDefaultDestDir(DagUtils.java:662)
at 
org.apache.hadoop.hive.ql.exec.tez.DagUtils.getHiveJarDirectory(DagUtils.java:759)
at 
org.apache.hadoop.hive.ql.exec.tez.TezSessionState.createJarLocalResource(TezSessionState.java:321)
at 
org.apache.hadoop.hive.ql.exec.tez.TezSessionState.open(TezSessionState.java:159)
at org.apache.hadoop.hive.ql.exec.tez.TezTask.execute(TezTask.java:154)
at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:153)
at 
org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1504)
at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1271)
at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1089)
at org.apache.hadoop.hive.ql.Driver.run(Driver.java:912)
at org.apache.hadoop.hive.ql.Driver.run(Driver.java:902)
at 
org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:268)
at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:220)
at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:423)
at 
org.apache.hadoop.hive.cli.CliDriver.executeDriver(CliDriver.java:793)
at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:686)
at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:625)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:601)
at org.apache.hadoop.util.RunJar.main(RunJar.java:212)
{noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Created] (HIVE-7010) templeton/v1/queue REST method has been removed

2014-05-02 Thread Eugene Koifman (JIRA)
Eugene Koifman created HIVE-7010:


 Summary: templeton/v1/queue REST method has been removed
 Key: HIVE-7010
 URL: https://issues.apache.org/jira/browse/HIVE-7010
 Project: Hive
  Issue Type: Bug
  Components: Documentation, WebHCat
Affects Versions: 0.14.0
Reporter: Eugene Koifman


deprecated queue REST method was removed from WebHCat in HIVE-6432.  jobs 
is the replacement.

https://cwiki.apache.org/confluence/display/Hive/WebHCat+Reference needs to be 
updated



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-7004) Fix more unit test failures on hadoop-2

2014-05-02 Thread Ashutosh Chauhan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988250#comment-13988250
 ] 

Ashutosh Chauhan commented on HIVE-7004:


+1

 Fix more unit test failures on hadoop-2
 ---

 Key: HIVE-7004
 URL: https://issues.apache.org/jira/browse/HIVE-7004
 Project: Hive
  Issue Type: Bug
  Components: Tests
Reporter: Jason Dere
Assignee: Jason Dere
 Attachments: HIVE-7004.1.patch


 Still a number of precommit failures with hadoop-2, will try to fix some of 
 them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-6960) Set Hive pom to use Hadoop-2.4

2014-05-02 Thread Ashutosh Chauhan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-6960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988256#comment-13988256
 ] 

Ashutosh Chauhan commented on HIVE-6960:


+1

 Set Hive pom to use Hadoop-2.4
 --

 Key: HIVE-6960
 URL: https://issues.apache.org/jira/browse/HIVE-6960
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Affects Versions: 0.14.0
Reporter: Jason Dere
Assignee: Jason Dere
 Attachments: HIVE-6960.1.patch, HIVE-6960.1.patch, HIVE-6960.2.patch


 A number of the hadoop-2 unit test failures are due to HADOOP-10425, fixed in 
 Hadoop 2.4.  Perhaps we should move onto that version.
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_nullgroup3
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_ppd_join4
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_select_dummy_source
 - 
 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_symlink_text_input_format
 - org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_udf_current_database
 - 
 org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_bucketizedhiveinputformat



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Review Request 21026: Remove HDFS check in tez/DagUtils

2014-05-02 Thread Chuan Liu

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/21026/
---

Review request for hive.


Bugs: HIVE-7009
https://issues.apache.org/jira/browse/HIVE-7009


Repository: hive-git


Description
---

Removed all the DFS checks in the change. They should not impact any 
functionality.


Diffs
-

  ql/src/java/org/apache/hadoop/hive/ql/exec/tez/DagUtils.java abbf38f 

Diff: https://reviews.apache.org/r/21026/diff/


Testing
---

Manually tested on a cluster deployed in Windows Azure environment.


Thanks,

Chuan Liu



[jira] [Updated] (HIVE-7009) HIVE_USER_INSTALL_DIR could not bet set to non-HDFS filesystem

2014-05-02 Thread Chuan Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chuan Liu updated HIVE-7009:


Attachment: HIVE-7009.patch

Attach a patch that remove HDFS checks in tez/DagUtils.java.

 HIVE_USER_INSTALL_DIR could not bet set to non-HDFS filesystem
 --

 Key: HIVE-7009
 URL: https://issues.apache.org/jira/browse/HIVE-7009
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0
Reporter: Chuan Liu
 Attachments: HIVE-7009.patch


 In {{hive/ql/exec/tez/DagUtils.java}}, we enforce the user path get from 
 {{HIVE_USER_INSTALL_DIR}} to be HDFS. This makes it impossible to run 
 Hive+Tez jobs on non-HDFS filesystem, e.g. WASB. Relevant code are as follows:
 {noformat}
   public Path getDefaultDestDir(Configuration conf) throws LoginException, 
 IOException {
 UserGroupInformation ugi = 
 ShimLoader.getHadoopShims().getUGIForConf(conf);
 String userName = ShimLoader.getHadoopShims().getShortUserName(ugi);
 String userPathStr = HiveConf.getVar(conf, 
 HiveConf.ConfVars.HIVE_USER_INSTALL_DIR);
 Path userPath = new Path(userPathStr);
 FileSystem fs = userPath.getFileSystem(conf);
 if (!(fs instanceof DistributedFileSystem)) {
   throw new IOException(ErrorMsg.INVALID_HDFS_URI.format(userPathStr));
 }
 {noformat}
 Exceptions running jobs with defaultFs configured to WASB.
 {noformat}
 2014-05-01 00:21:39,847 ERROR exec.Task (TezTask.java:execute(192)) - Failed 
 to execute tez graph.
 java.io.IOException: 
 wasb://hdi31-chuan...@clhdistorage.blob.core.windows.net/user is not a hdfs 
 uri
   at 
 org.apache.hadoop.hive.ql.exec.tez.DagUtils.getDefaultDestDir(DagUtils.java:662)
   at 
 org.apache.hadoop.hive.ql.exec.tez.DagUtils.getHiveJarDirectory(DagUtils.java:759)
   at 
 org.apache.hadoop.hive.ql.exec.tez.TezSessionState.createJarLocalResource(TezSessionState.java:321)
   at 
 org.apache.hadoop.hive.ql.exec.tez.TezSessionState.open(TezSessionState.java:159)
   at org.apache.hadoop.hive.ql.exec.tez.TezTask.execute(TezTask.java:154)
   at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:153)
   at 
 org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:85)
   at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1504)
   at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1271)
   at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1089)
   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:912)
   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:902)
   at 
 org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:268)
   at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:220)
   at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:423)
   at 
 org.apache.hadoop.hive.cli.CliDriver.executeDriver(CliDriver.java:793)
   at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:686)
   at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:625)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:601)
   at org.apache.hadoop.util.RunJar.main(RunJar.java:212)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6684) Beeline does not accept comments that are preceded by spaces

2014-05-02 Thread Jeremy Beard (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeremy Beard updated HIVE-6684:
---

Attachment: HIVE-6684.1.patch

Added patch with unit test - please review.

 Beeline does not accept comments that are preceded by spaces
 

 Key: HIVE-6684
 URL: https://issues.apache.org/jira/browse/HIVE-6684
 Project: Hive
  Issue Type: Bug
  Components: CLI
Affects Versions: 0.10.0
Reporter: Jeremy Beard
 Attachments: HIVE-6684.1.patch


 Beeline throws an error if single-line comments are indented with spaces. 
 This works in the embedded Hive CLI.
 For example:
 SELECT
-- this is the field we want
field
 FROM
table;
 Error: Error while processing statement: FAILED: ParseException line 1:71 
 cannot recognize input near 'EOF' 'EOF' 'EOF' in select clause 
 (state=42000,code=4)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6684) Beeline does not accept comments that are preceded by spaces

2014-05-02 Thread Jeremy Beard (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeremy Beard updated HIVE-6684:
---

Fix Version/s: 0.14.0
   Status: Patch Available  (was: Open)

 Beeline does not accept comments that are preceded by spaces
 

 Key: HIVE-6684
 URL: https://issues.apache.org/jira/browse/HIVE-6684
 Project: Hive
  Issue Type: Bug
  Components: CLI
Affects Versions: 0.10.0
Reporter: Jeremy Beard
 Fix For: 0.14.0

 Attachments: HIVE-6684.1.patch


 Beeline throws an error if single-line comments are indented with spaces. 
 This works in the embedded Hive CLI.
 For example:
 SELECT
-- this is the field we want
field
 FROM
table;
 Error: Error while processing statement: FAILED: ParseException line 1:71 
 cannot recognize input near 'EOF' 'EOF' 'EOF' in select clause 
 (state=42000,code=4)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-6915) Hive Hbase queries fail on secure Tez cluster

2014-05-02 Thread Craig Condit (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-6915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988328#comment-13988328
 ] 

Craig Condit commented on HIVE-6915:


Patch applied here, get a different error now:
{noformat}
Vertex failed, vertexName=Map 1, vertexId=vertex_1392942637536_9375_1_00, 
diagnostics=[Vertex Input: hosting_scheduled_jobs initializer failed., 
org.apache.hadoop.hbase.security.AccessDeniedException: Token generation only 
allowed for Kerberos authenticated clients
at 
org.apache.hadoop.hbase.security.token.TokenProvider.getAuthenticationToken(TokenProvider.java:122)
at 
org.apache.hadoop.hbase.protobuf.generated.AuthenticationProtos$AuthenticationService$1.getAuthenticationToken(AuthenticationProtos.java:4267)
at 
org.apache.hadoop.hbase.protobuf.generated.AuthenticationProtos$AuthenticationService.callMethod(AuthenticationProtos.java:4387)
at 
org.apache.hadoop.hbase.regionserver.HRegion.execService(HRegion.java:5088)
at 
org.apache.hadoop.hbase.regionserver.HRegionServer.execService(HRegionServer.java:3197)
at 
org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:26933)
at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2146)
at 
org.apache.hadoop.hbase.ipc.RpcServer$Handler.run(RpcServer.java:1851)
]
14/05/02 15:06:21 ERROR tez.TezJobMonitor: Vertex failed, vertexName=Map 1, 
vertexId=vertex_1392942637536_9375_1_00, diagnostics=[Vertex Input: 
hosting_scheduled_jobs initializer failed., 
org.apache.hadoop.hbase.security.AccessDeniedException: Token generation only 
allowed for Kerberos authenticated clients
at 
org.apache.hadoop.hbase.security.token.TokenProvider.getAuthenticationToken(TokenProvider.java:122)
at 
org.apache.hadoop.hbase.protobuf.generated.AuthenticationProtos$AuthenticationService$1.getAuthenticationToken(AuthenticationProtos.java:4267)
at 
org.apache.hadoop.hbase.protobuf.generated.AuthenticationProtos$AuthenticationService.callMethod(AuthenticationProtos.java:4387)
at 
org.apache.hadoop.hbase.regionserver.HRegion.execService(HRegion.java:5088)
at 
org.apache.hadoop.hbase.regionserver.HRegionServer.execService(HRegionServer.java:3197)
at 
org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:26933)
at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2146)
at 
org.apache.hadoop.hbase.ipc.RpcServer$Handler.run(RpcServer.java:1851)
]
DAG failed due to vertex failure. failedVertices:1 killedVertices:0
14/05/02 15:06:21 ERROR tez.TezJobMonitor: DAG failed due to vertex failure. 
failedVertices:1 killedVertices:0
FAILED: Execution Error, return code 2 from 
org.apache.hadoop.hive.ql.exec.tez.TezTask
14/05/02 15:06:21 ERROR ql.Driver: FAILED: Execution Error, return code 2 from 
org.apache.hadoop.hive.ql.exec.tez.TezTask
{noformat}

 Hive Hbase queries fail on secure Tez cluster
 -

 Key: HIVE-6915
 URL: https://issues.apache.org/jira/browse/HIVE-6915
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0
 Environment: Kerberos secure Tez cluster
Reporter: Deepesh Khandelwal
Assignee: Siddharth Seth
 Attachments: HIVE-6915.1.patch


 Hive queries reading and writing to HBase are currently failing with the 
 following exception in a secure Tez cluster:
 {noformat}
 2014-04-14 13:47:05,644 FATAL [InputInitializer [Map 1] #0] 
 org.apache.hadoop.ipc.RpcClient: SASL authentication failed. The most likely 
 cause is missing or invalid credentials. Consider 'kinit'.
 javax.security.sasl.SaslException: GSS initiate failed [Caused by 
 GSSException: No valid credentials provided (Mechanism level: Failed to find 
 any Kerberos tgt)]
   at 
 com.sun.security.sasl.gsskerb.GssKrb5Client.evaluateChallenge(GssKrb5Client.java:212)
   at 
 org.apache.hadoop.hbase.security.HBaseSaslRpcClient.saslConnect(HBaseSaslRpcClient.java:152)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection.setupSaslConnection(RpcClient.java:792)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection.access$800(RpcClient.java:349)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection$2.run(RpcClient.java:918)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection$2.run(RpcClient.java:915)
   at java.security.AccessController.doPrivileged(Native Method)
   at javax.security.auth.Subject.doAs(Subject.java:415)
   at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1557)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection.setupIOstreams(RpcClient.java:915)
   at 
 

[jira] [Updated] (HIVE-6684) Beeline does not accept comments that are preceded by spaces

2014-05-02 Thread Jeremy Beard (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeremy Beard updated HIVE-6684:
---

Fix Version/s: (was: 0.14.0)

 Beeline does not accept comments that are preceded by spaces
 

 Key: HIVE-6684
 URL: https://issues.apache.org/jira/browse/HIVE-6684
 Project: Hive
  Issue Type: Bug
  Components: CLI
Affects Versions: 0.10.0
Reporter: Jeremy Beard
 Attachments: HIVE-6684.1.patch


 Beeline throws an error if single-line comments are indented with spaces. 
 This works in the embedded Hive CLI.
 For example:
 SELECT
-- this is the field we want
field
 FROM
table;
 Error: Error while processing statement: FAILED: ParseException line 1:71 
 cannot recognize input near 'EOF' 'EOF' 'EOF' in select clause 
 (state=42000,code=4)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-7006) Fix ql_rewrite_gbtoidx.q output file

2014-05-02 Thread Hive QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988368#comment-13988368
 ] 

Hive QA commented on HIVE-7006:
---



{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12642980/HIVE-7006.1.patch

{color:red}ERROR:{color} -1 due to 6 failed/errored test(s), 5429 tests executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_groupby2_map_skew
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_stats_list_bucket
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_stats_partscan_1_23
org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_root_dir_external_table
org.apache.hadoop.hive.cli.TestNegativeCliDriver.testNegativeCliDriver_dynamic_partitions_with_whitelist
org.apache.hadoop.hive.cli.TestNegativeCliDriver.testNegativeCliDriver_stats_partialscan_autogether
{noformat}

Test results: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-Build/107/testReport
Console output: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-Build/107/console

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 6 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12642980

 Fix ql_rewrite_gbtoidx.q output file
 

 Key: HIVE-7006
 URL: https://issues.apache.org/jira/browse/HIVE-7006
 Project: Hive
  Issue Type: Bug
Reporter: Harish Butani
Assignee: Harish Butani
 Attachments: HIVE-7006.1.patch


 HIVE-4904 moved GroupBy Optimization to happen before ColumnPruner
 Now, the RewriteGBUsingIndex happens after GroupByOptimization.
 So  setting of bucketGroup flag doesn't happen for Index tables.
 Temporary fix is to update .q.out file.
 Will file a bug to look into moving RewriteGBUsingIndex before 
 GroupByOptimizer. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-6828) Hive tez bucket map join conversion interferes with map join conversion

2014-05-02 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-6828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988376#comment-13988376
 ] 

Gunther Hagleitner commented on HIVE-6828:
--

No new test failures (build 100 had the same errors - plus one more).


 Hive tez bucket map join conversion interferes with map join conversion
 ---

 Key: HIVE-6828
 URL: https://issues.apache.org/jira/browse/HIVE-6828
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0, 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-6828.1.patch, HIVE-6828.2.patch


 The issue is that bucket count is used for checking the scaled down size of 
 the hash tables but is used later on to convert to the map join as well which 
 may be incorrect in cases where the entire hash table does not fit in the 
 specified size.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-2777) ability to add and drop partitions atomically

2014-05-02 Thread Xinyu Wang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xinyu Wang updated HIVE-2777:
-

Affects Version/s: 0.13.0
   Status: Patch Available  (was: Open)

This is a rebased patch on top of hive branch-0.13. Please review.

 ability to add and drop partitions atomically
 -

 Key: HIVE-2777
 URL: https://issues.apache.org/jira/browse/HIVE-2777
 Project: Hive
  Issue Type: New Feature
  Components: Metastore
Affects Versions: 0.13.0
Reporter: Aniket Mokashi
Assignee: Aniket Mokashi
 Attachments: ASF.LICENSE.NOT.GRANTED--HIVE-2777.D2271.1.patch


 Hive should have ability to atomically add and drop partitions. This way 
 admins can change partitions atomically without breaking the running jobs. It 
 allows admin to merge several partitions into one.
 Essentially, we would like to have an api- add_drop_partitions(String db, 
 String tbl_name, ListPartition addParts, ListListString dropParts, 
 boolean deleteData);
 This jira covers changes required for metastore and thrift.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6996) FS based stats broken with indexed tables

2014-05-02 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-6996:
---

Status: Open  (was: Patch Available)

 FS based stats broken with indexed tables
 -

 Key: HIVE-6996
 URL: https://issues.apache.org/jira/browse/HIVE-6996
 Project: Hive
  Issue Type: Bug
  Components: Indexing, Statistics
Affects Versions: 0.13.0
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Attachments: HIVE-6996.patch






--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-2777) ability to add and drop partitions atomically

2014-05-02 Thread Xinyu Wang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xinyu Wang updated HIVE-2777:
-

Attachment: hive-2777.patch

 ability to add and drop partitions atomically
 -

 Key: HIVE-2777
 URL: https://issues.apache.org/jira/browse/HIVE-2777
 Project: Hive
  Issue Type: New Feature
  Components: Metastore
Affects Versions: 0.13.0
Reporter: Aniket Mokashi
Assignee: Aniket Mokashi
 Attachments: ASF.LICENSE.NOT.GRANTED--HIVE-2777.D2271.1.patch, 
 hive-2777.patch


 Hive should have ability to atomically add and drop partitions. This way 
 admins can change partitions atomically without breaking the running jobs. It 
 allows admin to merge several partitions into one.
 Essentially, we would like to have an api- add_drop_partitions(String db, 
 String tbl_name, ListPartition addParts, ListListString dropParts, 
 boolean deleteData);
 This jira covers changes required for metastore and thrift.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6996) FS based stats broken with indexed tables

2014-05-02 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-6996:
---

Attachment: HIVE-6996.2.patch

Fixed failing tests.

 FS based stats broken with indexed tables
 -

 Key: HIVE-6996
 URL: https://issues.apache.org/jira/browse/HIVE-6996
 Project: Hive
  Issue Type: Bug
  Components: Indexing, Statistics
Affects Versions: 0.13.0
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Attachments: HIVE-6996.2.patch, HIVE-6996.patch






--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6996) FS based stats broken with indexed tables

2014-05-02 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-6996:
---

Status: Patch Available  (was: Open)

 FS based stats broken with indexed tables
 -

 Key: HIVE-6996
 URL: https://issues.apache.org/jira/browse/HIVE-6996
 Project: Hive
  Issue Type: Bug
  Components: Indexing, Statistics
Affects Versions: 0.13.0
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Attachments: HIVE-6996.2.patch, HIVE-6996.patch






--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6828) Hive tez bucket map join conversion interferes with map join conversion

2014-05-02 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-6828:
-

   Resolution: Fixed
Fix Version/s: 0.14.0
   Status: Resolved  (was: Patch Available)

Committed to trunk. Thanks Vikram!

 Hive tez bucket map join conversion interferes with map join conversion
 ---

 Key: HIVE-6828
 URL: https://issues.apache.org/jira/browse/HIVE-6828
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0, 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Fix For: 0.14.0

 Attachments: HIVE-6828.1.patch, HIVE-6828.2.patch


 The issue is that bucket count is used for checking the scaled down size of 
 the hash tables but is used later on to convert to the map join as well which 
 may be incorrect in cases where the entire hash table does not fit in the 
 specified size.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-6915) Hive Hbase queries fail on secure Tez cluster

2014-05-02 Thread Craig Condit (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-6915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988400#comment-13988400
 ] 

Craig Condit commented on HIVE-6915:


Better stack trace:
{noformat}
org.apache.hadoop.hbase.security.AccessDeniedException: 
org.apache.hadoop.hbase.security.AccessDeniedException: Token generation only 
allowed for Kerberos authenticated clients
at 
org.apache.hadoop.hbase.security.token.TokenProvider.getAuthenticationToken(TokenProvider.java:122)
at 
org.apache.hadoop.hbase.protobuf.generated.AuthenticationProtos$AuthenticationService$1.getAuthenticationToken(AuthenticationProtos.java:4267)
at 
org.apache.hadoop.hbase.protobuf.generated.AuthenticationProtos$AuthenticationService.callMethod(AuthenticationProtos.java:4387)
at 
org.apache.hadoop.hbase.regionserver.HRegion.execService(HRegion.java:5088)
at 
org.apache.hadoop.hbase.regionserver.HRegionServer.execService(HRegionServer.java:3197)
at 
org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:26933)
at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2146)
at 
org.apache.hadoop.hbase.ipc.RpcServer$Handler.run(RpcServer.java:1851)

at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at 
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57)
at 
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
at 
org.apache.hadoop.ipc.RemoteException.instantiateException(RemoteException.java:106)
at 
org.apache.hadoop.ipc.RemoteException.unwrapRemoteException(RemoteException.java:95)
at 
org.apache.hadoop.hbase.protobuf.ProtobufUtil.getRemoteException(ProtobufUtil.java:235)
at 
org.apache.hadoop.hbase.protobuf.ProtobufUtil.execService(ProtobufUtil.java:1348)
at 
org.apache.hadoop.hbase.ipc.RegionCoprocessorRpcChannel$1.call(RegionCoprocessorRpcChannel.java:87)
at 
org.apache.hadoop.hbase.ipc.RegionCoprocessorRpcChannel$1.call(RegionCoprocessorRpcChannel.java:84)
at 
org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:116)
at 
org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:94)
at 
org.apache.hadoop.hbase.ipc.RegionCoprocessorRpcChannel.callExecService(RegionCoprocessorRpcChannel.java:90)
at 
org.apache.hadoop.hbase.ipc.CoprocessorRpcChannel.callBlockingMethod(CoprocessorRpcChannel.java:67)
at 
org.apache.hadoop.hbase.protobuf.generated.AuthenticationProtos$AuthenticationService$BlockingStub.getAuthenticationToken(AuthenticationProtos.java:4512)
at 
org.apache.hadoop.hbase.security.token.TokenUtil.obtainToken(TokenUtil.java:60)
at 
org.apache.hadoop.hbase.security.token.TokenUtil$3.run(TokenUtil.java:174)
at 
org.apache.hadoop.hbase.security.token.TokenUtil$3.run(TokenUtil.java:172)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
at 
org.apache.hadoop.hbase.security.token.TokenUtil.obtainTokenForJob(TokenUtil.java:171)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.hadoop.hbase.util.Methods.call(Methods.java:39)
at 
org.apache.hadoop.hbase.security.User$SecureHadoopUser.obtainAuthTokenForJob(User.java:314)
at 
org.apache.hadoop.hbase.mapred.TableMapReduceUtil.initCredentials(TableMapReduceUtil.java:181)
at 
org.apache.hadoop.hive.hbase.HiveHBaseTableInputFormat.getSplits(HiveHBaseTableInputFormat.java:416)
at 
org.apache.hadoop.hive.ql.io.HiveInputFormat.addSplitsForGroup(HiveInputFormat.java:291)
at 
org.apache.hadoop.hive.ql.io.HiveInputFormat.getSplits(HiveInputFormat.java:372)
at 
org.apache.hadoop.mapred.split.TezGroupedSplitsInputFormat.getSplits(TezGroupedSplitsInputFormat.java:68)
at 
org.apache.tez.mapreduce.hadoop.MRHelpers.generateOldSplits(MRHelpers.java:263)
at 
org.apache.tez.mapreduce.common.MRInputAMSplitGenerator.initialize(MRInputAMSplitGenerator.java:139)
at 
org.apache.tez.dag.app.dag.RootInputInitializerRunner$InputInitializerCallable$1.run(RootInputInitializerRunner.java:146)
at 

Re: Review Request 21016: Deprecate class and remove tests.

2014-05-02 Thread Ashutosh Chauhan

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/21016/
---

(Updated May 2, 2014, 11:24 p.m.)


Review request for hive and Jason Dere.


Changes
---

Incorporated Jason feedback to delete another test file.


Bugs: HIVE-7008
https://issues.apache.org/jira/browse/HIVE-7008


Repository: hive-git


Description
---

Marked class as Deprecated.
Removed tests for now obsolete functionality.


Diffs (updated)
-

  ql/src/java/org/apache/hadoop/hive/ql/io/FlatFileInputFormat.java b9b151a 
  ql/src/test/org/apache/hadoop/hive/ql/io/JavaTestObjFlatFileInputFormat.java 
830ecef 
  ql/src/test/org/apache/hadoop/hive/ql/io/RecordTestObj.java 1bb846b 
  ql/src/test/org/apache/hadoop/hive/ql/io/TestFlatFileInputFormat.java c46d754 

Diff: https://reviews.apache.org/r/21016/diff/


Testing
---


Thanks,

Ashutosh Chauhan



[jira] [Updated] (HIVE-7008) Clean-up some old dead code

2014-05-02 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-7008:
---

Status: Open  (was: Patch Available)

 Clean-up some old dead code
 ---

 Key: HIVE-7008
 URL: https://issues.apache.org/jira/browse/HIVE-7008
 Project: Hive
  Issue Type: Task
  Components: Tests
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Attachments: HIVE-7008.1.patch, HIVE-7008.patch


 There is some code to workaround limitations in historic Hadoop (hadoop-17  
 earlier). Lets get rid of those.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7008) Clean-up some old dead code

2014-05-02 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-7008:
---

Attachment: HIVE-7008.1.patch

Incorporated Jason's RB feedback.

 Clean-up some old dead code
 ---

 Key: HIVE-7008
 URL: https://issues.apache.org/jira/browse/HIVE-7008
 Project: Hive
  Issue Type: Task
  Components: Tests
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Attachments: HIVE-7008.1.patch, HIVE-7008.patch


 There is some code to workaround limitations in historic Hadoop (hadoop-17  
 earlier). Lets get rid of those.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7008) Clean-up some old dead code

2014-05-02 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-7008:
---

Status: Patch Available  (was: Open)

 Clean-up some old dead code
 ---

 Key: HIVE-7008
 URL: https://issues.apache.org/jira/browse/HIVE-7008
 Project: Hive
  Issue Type: Task
  Components: Tests
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Attachments: HIVE-7008.1.patch, HIVE-7008.patch


 There is some code to workaround limitations in historic Hadoop (hadoop-17  
 earlier). Lets get rid of those.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-5823) Support for DECIMAL primitive type in AvroSerDe

2014-05-02 Thread Xuefu Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-5823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xuefu Zhang updated HIVE-5823:
--

Attachment: HIVE-5823.4.patch

Patch #4 updated with review comment.

 Support for DECIMAL primitive type in AvroSerDe
 ---

 Key: HIVE-5823
 URL: https://issues.apache.org/jira/browse/HIVE-5823
 Project: Hive
  Issue Type: New Feature
  Components: Serializers/Deserializers
Affects Versions: 0.12.0
Reporter: Mariano Dominguez
Assignee: Xuefu Zhang
  Labels: avro, serde
 Attachments: HIVE-5823.1.patch, HIVE-5823.2.patch, HIVE-5823.3.patch, 
 HIVE-5823.4.patch, HIVE-5823.patch, dec.avro


 This new feature request would be tied to AVRO-1402.
 Adding DECIMAL support would be particularly interesting when converting 
 types from Avro to Hive, since DECIMAL is already a supported data type in 
 Hive.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-7008) Clean-up some old dead code

2014-05-02 Thread Jason Dere (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988430#comment-13988430
 ] 

Jason Dere commented on HIVE-7008:
--

+1

 Clean-up some old dead code
 ---

 Key: HIVE-7008
 URL: https://issues.apache.org/jira/browse/HIVE-7008
 Project: Hive
  Issue Type: Task
  Components: Tests
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Attachments: HIVE-7008.1.patch, HIVE-7008.patch


 There is some code to workaround limitations in historic Hadoop (hadoop-17  
 earlier). Lets get rid of those.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6915) Hive Hbase queries fail on secure Tez cluster

2014-05-02 Thread Craig Condit (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Craig Condit updated HIVE-6915:
---

Attachment: HIVE-6915.2.patch

New patch version. This version calls TableMapReduceUtil.initCredentials() only 
in the case where the current user is logged in via Kerberos.

 Hive Hbase queries fail on secure Tez cluster
 -

 Key: HIVE-6915
 URL: https://issues.apache.org/jira/browse/HIVE-6915
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0
 Environment: Kerberos secure Tez cluster
Reporter: Deepesh Khandelwal
Assignee: Siddharth Seth
 Attachments: HIVE-6915.1.patch, HIVE-6915.2.patch


 Hive queries reading and writing to HBase are currently failing with the 
 following exception in a secure Tez cluster:
 {noformat}
 2014-04-14 13:47:05,644 FATAL [InputInitializer [Map 1] #0] 
 org.apache.hadoop.ipc.RpcClient: SASL authentication failed. The most likely 
 cause is missing or invalid credentials. Consider 'kinit'.
 javax.security.sasl.SaslException: GSS initiate failed [Caused by 
 GSSException: No valid credentials provided (Mechanism level: Failed to find 
 any Kerberos tgt)]
   at 
 com.sun.security.sasl.gsskerb.GssKrb5Client.evaluateChallenge(GssKrb5Client.java:212)
   at 
 org.apache.hadoop.hbase.security.HBaseSaslRpcClient.saslConnect(HBaseSaslRpcClient.java:152)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection.setupSaslConnection(RpcClient.java:792)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection.access$800(RpcClient.java:349)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection$2.run(RpcClient.java:918)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection$2.run(RpcClient.java:915)
   at java.security.AccessController.doPrivileged(Native Method)
   at javax.security.auth.Subject.doAs(Subject.java:415)
   at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1557)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection.setupIOstreams(RpcClient.java:915)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection.writeRequest(RpcClient.java:1065)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection.tracedWriteRequest(RpcClient.java:1032)
   at org.apache.hadoop.hbase.ipc.RpcClient.call(RpcClient.java:1474)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient.callBlockingMethod(RpcClient.java:1684)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$BlockingRpcChannelImplementation.callBlockingMethod(RpcClient.java:1737)
   at 
 org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$BlockingStub.execService(ClientProtos.java:29288)
   at 
 org.apache.hadoop.hbase.protobuf.ProtobufUtil.execService(ProtobufUtil.java:1562)
   at 
 org.apache.hadoop.hbase.ipc.RegionCoprocessorRpcChannel$1.call(RegionCoprocessorRpcChannel.java:87)
   at 
 org.apache.hadoop.hbase.ipc.RegionCoprocessorRpcChannel$1.call(RegionCoprocessorRpcChannel.java:84)
   at 
 org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:121)
   at 
 org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:97)
   at 
 org.apache.hadoop.hbase.ipc.RegionCoprocessorRpcChannel.callExecService(RegionCoprocessorRpcChannel.java:90)
   at 
 org.apache.hadoop.hbase.ipc.CoprocessorRpcChannel.callBlockingMethod(CoprocessorRpcChannel.java:67)
   at 
 org.apache.hadoop.hbase.protobuf.generated.AuthenticationProtos$AuthenticationService$BlockingStub.getAuthenticationToken(AuthenticationProtos.java:4512)
   at 
 org.apache.hadoop.hbase.security.token.TokenUtil.obtainToken(TokenUtil.java:60)
   at 
 org.apache.hadoop.hbase.security.token.TokenUtil$3.run(TokenUtil.java:174)
   at 
 org.apache.hadoop.hbase.security.token.TokenUtil$3.run(TokenUtil.java:172)
   at java.security.AccessController.doPrivileged(Native Method)
   at javax.security.auth.Subject.doAs(Subject.java:415)
   at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1557)
   at 
 org.apache.hadoop.hbase.security.token.TokenUtil.obtainTokenForJob(TokenUtil.java:171)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:601)
   at org.apache.hadoop.hbase.util.Methods.call(Methods.java:39)
   at 
 org.apache.hadoop.hbase.security.User$SecureHadoopUser.obtainAuthTokenForJob(User.java:334)
   at 
 org.apache.hadoop.hbase.mapred.TableMapReduceUtil.initCredentials(TableMapReduceUtil.java:201)
   at 

[jira] [Updated] (HIVE-6826) Hive-tez has issues when different partitions work off of different input types

2014-05-02 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-6826:
-

Status: Open  (was: Patch Available)

 Hive-tez has issues when different partitions work off of different input 
 types
 ---

 Key: HIVE-6826
 URL: https://issues.apache.org/jira/browse/HIVE-6826
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0, 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-6826.1.patch, HIVE-6826.2.patch


 create table test (key int, value string) partitioned by (p int) stored as 
 textfile;
 insert into table test partition (p=1) select * from src limit 10;
 alter table test set fileformat orc;
 insert into table test partition (p=2) select * from src limit 10;
 describe test;
 select * from test where p=1 and key  0;
 select * from test where p=2 and key  0;
 select * from test where key  0;
 throws a classcast exception



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6826) Hive-tez has issues when different partitions work off of different input types

2014-05-02 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-6826:
-

Attachment: HIVE-6826.2.patch

Address Gunther's comments.

 Hive-tez has issues when different partitions work off of different input 
 types
 ---

 Key: HIVE-6826
 URL: https://issues.apache.org/jira/browse/HIVE-6826
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0, 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-6826.1.patch, HIVE-6826.2.patch


 create table test (key int, value string) partitioned by (p int) stored as 
 textfile;
 insert into table test partition (p=1) select * from src limit 10;
 alter table test set fileformat orc;
 insert into table test partition (p=2) select * from src limit 10;
 describe test;
 select * from test where p=1 and key  0;
 select * from test where p=2 and key  0;
 select * from test where key  0;
 throws a classcast exception



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6826) Hive-tez has issues when different partitions work off of different input types

2014-05-02 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-6826:
-

Status: Patch Available  (was: Open)

 Hive-tez has issues when different partitions work off of different input 
 types
 ---

 Key: HIVE-6826
 URL: https://issues.apache.org/jira/browse/HIVE-6826
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0, 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-6826.1.patch, HIVE-6826.2.patch


 create table test (key int, value string) partitioned by (p int) stored as 
 textfile;
 insert into table test partition (p=1) select * from src limit 10;
 alter table test set fileformat orc;
 insert into table test partition (p=2) select * from src limit 10;
 describe test;
 select * from test where p=1 and key  0;
 select * from test where p=2 and key  0;
 select * from test where key  0;
 throws a classcast exception



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6826) Hive-tez has issues when different partitions work off of different input types

2014-05-02 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-6826:
-

Status: Open  (was: Patch Available)

 Hive-tez has issues when different partitions work off of different input 
 types
 ---

 Key: HIVE-6826
 URL: https://issues.apache.org/jira/browse/HIVE-6826
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0, 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-6826.1.patch, HIVE-6826.2.patch


 create table test (key int, value string) partitioned by (p int) stored as 
 textfile;
 insert into table test partition (p=1) select * from src limit 10;
 alter table test set fileformat orc;
 insert into table test partition (p=2) select * from src limit 10;
 describe test;
 select * from test where p=1 and key  0;
 select * from test where p=2 and key  0;
 select * from test where key  0;
 throws a classcast exception



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6826) Hive-tez has issues when different partitions work off of different input types

2014-05-02 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-6826:
-

Attachment: (was: HIVE-6826.2.patch)

 Hive-tez has issues when different partitions work off of different input 
 types
 ---

 Key: HIVE-6826
 URL: https://issues.apache.org/jira/browse/HIVE-6826
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0, 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-6826.1.patch


 create table test (key int, value string) partitioned by (p int) stored as 
 textfile;
 insert into table test partition (p=1) select * from src limit 10;
 alter table test set fileformat orc;
 insert into table test partition (p=2) select * from src limit 10;
 describe test;
 select * from test where p=1 and key  0;
 select * from test where p=2 and key  0;
 select * from test where key  0;
 throws a classcast exception



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6826) Hive-tez has issues when different partitions work off of different input types

2014-05-02 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-6826:
-

Status: Patch Available  (was: Open)

 Hive-tez has issues when different partitions work off of different input 
 types
 ---

 Key: HIVE-6826
 URL: https://issues.apache.org/jira/browse/HIVE-6826
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0, 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-6826.1.patch, HIVE-6826.2.patch


 create table test (key int, value string) partitioned by (p int) stored as 
 textfile;
 insert into table test partition (p=1) select * from src limit 10;
 alter table test set fileformat orc;
 insert into table test partition (p=2) select * from src limit 10;
 describe test;
 select * from test where p=1 and key  0;
 select * from test where p=2 and key  0;
 select * from test where key  0;
 throws a classcast exception



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-6826) Hive-tez has issues when different partitions work off of different input types

2014-05-02 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-6826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-6826:
-

Attachment: HIVE-6826.2.patch

 Hive-tez has issues when different partitions work off of different input 
 types
 ---

 Key: HIVE-6826
 URL: https://issues.apache.org/jira/browse/HIVE-6826
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0, 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-6826.1.patch, HIVE-6826.2.patch


 create table test (key int, value string) partitioned by (p int) stored as 
 textfile;
 insert into table test partition (p=1) select * from src limit 10;
 alter table test set fileformat orc;
 insert into table test partition (p=2) select * from src limit 10;
 describe test;
 select * from test where p=1 and key  0;
 select * from test where p=2 and key  0;
 select * from test where key  0;
 throws a classcast exception



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Re: Review Request 21026: Remove HDFS check in tez/DagUtils

2014-05-02 Thread Mohammad Islam

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/21026/#review42080
---



ql/src/java/org/apache/hadoop/hive/ql/exec/tez/DagUtils.java
https://reviews.apache.org/r/21026/#comment75826

Just a proposal: in place of throwing exception, you could LOG info/debug 
something telling what type of FS is utilized. This might help to debug if 
someone used  (intentionally or mistakenly) unsupported FS . 




- Mohammad Islam


On May 2, 2014, 9:16 p.m., Chuan Liu wrote:
 
 ---
 This is an automatically generated e-mail. To reply, visit:
 https://reviews.apache.org/r/21026/
 ---
 
 (Updated May 2, 2014, 9:16 p.m.)
 
 
 Review request for hive.
 
 
 Bugs: HIVE-7009
 https://issues.apache.org/jira/browse/HIVE-7009
 
 
 Repository: hive-git
 
 
 Description
 ---
 
 Removed all the DFS checks in the change. They should not impact any 
 functionality.
 
 
 Diffs
 -
 
   ql/src/java/org/apache/hadoop/hive/ql/exec/tez/DagUtils.java abbf38f 
 
 Diff: https://reviews.apache.org/r/21026/diff/
 
 
 Testing
 ---
 
 Manually tested on a cluster deployed in Windows Azure environment.
 
 
 Thanks,
 
 Chuan Liu
 




Re: Apache Hive 0.13.1

2014-05-02 Thread Sushanth Sowmyan
Hi Folks,

It is now past 6pm PDT May 2nd 2014, so I have locked down the table
of jiras for inclusion into 0.13.1. The jiras included are as listed
in :

https://cwiki.apache.org/confluence/display/Hive/Hive+0.13.1+Release+tracking

The jiras being included are :

HIVE-6919, HIVE-6921, HIVE-6957, HIVE-6945, HIVE-6862,
HIVE-6883, HIVE-6961, HIVE-4576, HIVE-6952, HIVE-6828
HIVE-6898, HIVE-6783, HIVE-6955, HIVE-6877, HIVE-6985
HIVE-7001

The inclusion requestors for the above are :

thejas, ashutoshc, sershe, ekoifman, prasanth_j, hagleitn, szehon,
rhbutani and amareshwari.

If you are in this list, I might be pinging you in the next couple of
days for help if the patch does not cleanly apply, or causes
additional failures, and I appreciate any help you can tend.

There is one jira on that list, HIVE-6826, which is in the list, and
will still be included if it moves from Patch-available to Committed
in 0.14 by Monday when I roll out the RC, which I have assurances from
vikram that it will, given that it's waiting on a test result.

If anyone has any further breaking jiras that they absolutely want in
this release, please do still bring it up, I've created a new table
for them in the release tracking wiki page. I do, however, raise the
bar for inclusion by asking two other committers to also +1 such an
inclusion request at this stage.

Hopefully, we'll see an RC and a release pretty soon! :)

Thanks all!
-Sushanth


On Thu, May 1, 2014 at 2:34 PM, Sushanth Sowmyan khorg...@gmail.com wrote:
 Hi All,

 A quick reminder - per the timeline discussed so far, the list of
 jiras for inclusion in 0.13.1 RC0 freezes tomorrow (May 2nd) evening
 at 6pm PDT. Please make sure you ask for inclusion or edit the wiki
 with any further requests so that we can get this release done in a
 timely manner.

 Thanks,
 -Sushanth


 On Mon, Apr 28, 2014 at 1:39 PM, Sushanth Sowmyan khorg...@gmail.com wrote:
 Good question - I think that if the number of patches is small (  ~20
 which I expect because this is a stabilization release soon after
 0.13.0), I'd prefer to commit them myself, so that I can retain order
 of patch application and make sure they all apply cleanly. I'm also
 actually using this as an opportunity towards testing out an
 semi-automated script that reads the requested jiras to do this - I
 figure it's useful to test something like that out on a smaller
 release that'll make further releases easier.

 On Mon, Apr 28, 2014 at 11:42 AM, Thejas Nair the...@hortonworks.com wrote:
 Sushanth,
 How do you want to go about committing the patches into 0.13 branch ?
 Will you be able to commit the requested patches into 0.13.1 yourself, or
 do you want requestors to discuss it with you and then commit the patches
 themselves (if the requestor is a committer).

 Thanks,
 Thejas



 On Sat, Apr 26, 2014 at 2:19 PM, Sushanth Sowmyan khorg...@gmail.comwrote:

 Added.

 If others have difficulty editing( I can't figure out how to change editing
 privileges, but it seems to indicate that others can edit) the page, I'll
 accept replies to this thread as well and can add it in.
  On Apr 25, 2014 6:25 PM, Sergey Shelukhin ser...@hortonworks.com
 wrote:

  I don't have access to edit this page (or cannot figure out the UI).
  Username sershe.
  Can you add
  HIVE-6961 : Drop partitions treats partition columns as strings (area -
  metastore)
 
 
  On Fri, Apr 25, 2014 at 4:20 PM, Sushanth Sowmyan khorg...@gmail.com
  wrote:
 
   I've created the following wiki link :
  
  
 
 https://cwiki.apache.org/confluence/display/Hive/Hive+0.13.1+Release+tracking
  
   People should be able to request additional jiras by adding it to the
   list. I think it might make sense to halt addition of requests to the
   list 3 days before the RC is cut, so as to prevent an endless-tail
   scenario, unless the bug in question is a breaking severe issue,
   where, yes, after discussion, we can vote to add it to the list. That
   also gives us time to run a full suite of tests on a stable build
   before we cut the RC.
  
   I propose that the first RC (RC0) be built on Monday May 5th at 6pm
   PDT, and the jira list on the wiki be closed to open/easy additions at
   6pm PDT on Friday May 2nd.
  
  
   On Fri, Apr 25, 2014 at 2:40 PM, Gunther Hagleitner
   ghagleit...@hortonworks.com wrote:
Sorry - HIVE-6824 isn't needed. Just the other 3. My bad.
   
Thanks,
Gunther.
   
   
On Fri, Apr 25, 2014 at 2:10 PM, Gunther Hagleitner 
ghagleit...@hortonworks.com wrote:
   
I'd like to request to include these Tez fixes:
   
HIVE-6824, HIVE-6826, HIVE-6828, HIVE-6898
   
Thanks,
Gunther.
   
   
On Fri, Apr 25, 2014 at 11:59 AM, Sushanth Sowmyan 
  khorg...@gmail.com
   wrote:
   
True, I was counting two weeks from today, but 0.13 has already
 been
out for a week. I'm amenable to having an RC1 out on May 5th. If
 any
further issues appear that block, then we can deal with them in an
RC2/etc modification to 

[jira] [Created] (HIVE-7011) HiveInputFormat's split generation isn't thread safe

2014-05-02 Thread Gunther Hagleitner (JIRA)
Gunther Hagleitner created HIVE-7011:


 Summary: HiveInputFormat's split generation isn't thread safe
 Key: HIVE-7011
 URL: https://issues.apache.org/jira/browse/HIVE-7011
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0, 0.14.0
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner


Tez will do split generation in parallel. Need to protect the inputformat cache 
against concurrent access.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7011) HiveInputFormat's split generation isn't thread safe

2014-05-02 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-7011:
-

Attachment: HIVE-7011.1.patch

 HiveInputFormat's split generation isn't thread safe
 

 Key: HIVE-7011
 URL: https://issues.apache.org/jira/browse/HIVE-7011
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0, 0.14.0
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-7011.1.patch


 Tez will do split generation in parallel. Need to protect the inputformat 
 cache against concurrent access.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-7011) HiveInputFormat's split generation isn't thread safe

2014-05-02 Thread Gunther Hagleitner (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-7011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gunther Hagleitner updated HIVE-7011:
-

Status: Patch Available  (was: Open)

 HiveInputFormat's split generation isn't thread safe
 

 Key: HIVE-7011
 URL: https://issues.apache.org/jira/browse/HIVE-7011
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.13.0, 0.14.0
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-7011.1.patch


 Tez will do split generation in parallel. Need to protect the inputformat 
 cache against concurrent access.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-7008) Clean-up some old dead code

2014-05-02 Thread Hive QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988535#comment-13988535
 ] 

Hive QA commented on HIVE-7008:
---



{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12643087/HIVE-7008.patch

{color:red}ERROR:{color} -1 due to 6 failed/errored test(s), 5428 tests executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_groupby2_map_skew
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_stats_list_bucket
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_stats_partscan_1_23
org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_root_dir_external_table
org.apache.hadoop.hive.cli.TestNegativeCliDriver.testNegativeCliDriver_dynamic_partitions_with_whitelist
org.apache.hadoop.hive.cli.TestNegativeCliDriver.testNegativeCliDriver_stats_partialscan_autogether
{noformat}

Test results: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-Build/108/testReport
Console output: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-Build/108/console

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 6 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12643087

 Clean-up some old dead code
 ---

 Key: HIVE-7008
 URL: https://issues.apache.org/jira/browse/HIVE-7008
 Project: Hive
  Issue Type: Task
  Components: Tests
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Attachments: HIVE-7008.1.patch, HIVE-7008.patch


 There is some code to workaround limitations in historic Hadoop (hadoop-17  
 earlier). Lets get rid of those.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-6684) Beeline does not accept comments that are preceded by spaces

2014-05-02 Thread Xuefu Zhang (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-6684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988547#comment-13988547
 ] 

Xuefu Zhang commented on HIVE-6684:
---

Patch looks good. Minor nit though, is that in the line
{code}
+return line.trim().startsWith(#) || line.trim().startsWith(--);
{code}
line.trim() may be called twice, which is unnecessary. Better if just to trim 
once.

 Beeline does not accept comments that are preceded by spaces
 

 Key: HIVE-6684
 URL: https://issues.apache.org/jira/browse/HIVE-6684
 Project: Hive
  Issue Type: Bug
  Components: CLI
Affects Versions: 0.10.0
Reporter: Jeremy Beard
 Attachments: HIVE-6684.1.patch


 Beeline throws an error if single-line comments are indented with spaces. 
 This works in the embedded Hive CLI.
 For example:
 SELECT
-- this is the field we want
field
 FROM
table;
 Error: Error while processing statement: FAILED: ParseException line 1:71 
 cannot recognize input near 'EOF' 'EOF' 'EOF' in select clause 
 (state=42000,code=4)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HIVE-5823) Support for DECIMAL primitive type in AvroSerDe

2014-05-02 Thread Xuefu Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-5823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xuefu Zhang updated HIVE-5823:
--

Attachment: HIVE-5823.5.patch

 Support for DECIMAL primitive type in AvroSerDe
 ---

 Key: HIVE-5823
 URL: https://issues.apache.org/jira/browse/HIVE-5823
 Project: Hive
  Issue Type: New Feature
  Components: Serializers/Deserializers
Affects Versions: 0.12.0
Reporter: Mariano Dominguez
Assignee: Xuefu Zhang
  Labels: avro, serde
 Attachments: HIVE-5823.1.patch, HIVE-5823.2.patch, HIVE-5823.3.patch, 
 HIVE-5823.4.patch, HIVE-5823.5.patch, HIVE-5823.patch, dec.avro


 This new feature request would be tied to AVRO-1402.
 Adding DECIMAL support would be particularly interesting when converting 
 types from Avro to Hive, since DECIMAL is already a supported data type in 
 Hive.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Re: Review Request 20826: HIVE-5823: Support for DECIMAL primitive type in AvroSerDe

2014-05-02 Thread Xuefu Zhang

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/20826/
---

(Updated May 3, 2014, 3:44 a.m.)


Review request for hive.


Bugs: HIVE-5823
https://issues.apache.org/jira/browse/HIVE-5823


Repository: hive-git


Description
---

Support decimal type in Avro storage. The implemenation is based on 
specifications detailed in AVRO-1402.


Diffs (updated)
-

  data/files/dec.txt PRE-CREATION 
  ql/src/java/org/apache/hadoop/hive/ql/exec/TableScanOperator.java ecce21e 
  ql/src/java/org/apache/hadoop/hive/ql/io/avro/AvroGenericRecordReader.java 
52a22e5 
  ql/src/test/queries/clientpositive/avro_decimal.q PRE-CREATION 
  ql/src/test/queries/clientpositive/avro_schema_literal.q d77f310 
  ql/src/test/results/clientpositive/avro_decimal.q.out PRE-CREATION 
  ql/src/test/results/clientpositive/avro_schema_literal.q.out ca945d5 
  serde/src/java/org/apache/hadoop/hive/serde2/avro/AvroDeserializer.java 
a28861f 
  
serde/src/java/org/apache/hadoop/hive/serde2/avro/AvroGenericRecordWritable.java
 8beffd7 
  serde/src/java/org/apache/hadoop/hive/serde2/avro/AvroSerDe.java 92799ed 
  serde/src/java/org/apache/hadoop/hive/serde2/avro/AvroSerdeUtils.java 9d58d13 
  serde/src/java/org/apache/hadoop/hive/serde2/avro/AvroSerializer.java b2c58c7 
  serde/src/java/org/apache/hadoop/hive/serde2/avro/SchemaToTypeInfo.java 
251f04f 
  serde/src/test/org/apache/hadoop/hive/serde2/avro/TestAvroSerializer.java 
b3559ea 
  
serde/src/test/org/apache/hadoop/hive/serde2/avro/TestGenericAvroRecordWritable.java
 a0e5018 
  serde/src/test/org/apache/hadoop/hive/serde2/avro/Utils.java d5730fa 

Diff: https://reviews.apache.org/r/20826/diff/


Testing
---

Unit tests are added. Test suite passed.


Thanks,

Xuefu Zhang