[jira] [Created] (HIVE-11546) Projected columns read size should be scaled to split size for ORC Splits

2015-08-13 Thread Prasanth Jayachandran (JIRA)
Prasanth Jayachandran created HIVE-11546:


 Summary: Projected columns read size should be scaled to split 
size for ORC Splits
 Key: HIVE-11546
 URL: https://issues.apache.org/jira/browse/HIVE-11546
 Project: Hive
  Issue Type: Bug
Affects Versions: 1.3.0, 2.0.0
Reporter: Prasanth Jayachandran
Assignee: Prasanth Jayachandran


HIVE-10114 added projected columns data size information to OrcSplit which Tez 
Split Grouper can take advantage off (HIVE-7428). The annotated data size was 
for entire file and was not scaled to the split size. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


Build failed in Jenkins: HIVE-TRUNK-JAVA8 #101

2015-08-13 Thread hiveqa
See 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/HIVE-TRUNK-JAVA8/101/changes

Changes:

[jcamacho] HIVE-11437: CBO: Calcite Operator To Hive Operator (Calcite Return 
Path) : dealing with insert into (Pengcheng Xiong, reviewed by Jesus Camacho 
Rodriguez)

[amareshwari] HIVE-11376 : Removes legacy code wrt skipping files with codec to 
HiveInputFormat from CombineHiveInputFormat (Rajat Khandelwal, reviewed by 
Amareshwari)

[gopalv] HIVE-11457: Vectorization: Improve GenVectorCode string equals 
intrinsic (Gopal V, reviewed by Matt McCline)

[harisankar] HIVE-11387: CBO: Calcite Operator To Hive Operator (Calcite Return 
Path) : fix reduce_deduplicate optimization (Pengcheng Xiong, reviewed by Jesus 
Camacho Rodriguez, Hari Subramaniyan)

[gunther] HIVE-11398: Parse wide OR and wide AND trees to flat OR/AND trees 
(Jesus Camacho Rodriguez via Gopal V)

[dong1.chen] HIVE-11498: HIVE Authorization v2 should not check permission for 
dummy entity (Dapeng Sun via Dong Chen)

[amareshwari] HIVE-11278 : Fix Partition.setOutputFormatClass to set class name 
properly (Rajat Khandelwal, reviewed by Amareshwari)

[Alan Gates] HIVE-11511 Output the message of orcfiledump when ORC files are 
not specified (Shinichi Yamashita via gates)

[Chao Sun] HIVE-11340 - Create ORC based table using like clause doesn't copy 
compression property (Yongzhi Chen, reviewed by Chao Sun)

[pxiong] HIVE-11416: CBO: Calcite Operator To Hive Operator (Calcite Return 
Path): Groupby Optimizer assumes the schema can match after removing RS and GBY 
(reviewed by Jesus Camacho Rodriguez)

[jdere] HIVE-11449: Capacity must be a power of two error when 
HybridHashTableContainer memory threshold is too low (Jason Dere, reviewed by 
Sergey Shelukhin)

[pxiong] HIVE-11348: Support START TRANSACTION/COMMIT/ROLLBACK commands: 
support SQL2011 reserved keywords (Pengcheng Xiong reviewed by Eugene Koifman)

[daijy] HIVE-11442: Remove commons-configuration.jar from Hive distribution

[pxiong] HIVE-11480: CBO: Calcite Operator To Hive Operator (Calcite Return 
Path): char/varchar as input to GenericUDAF (Pengcheng Xiong, reviewed by Jesus 
Camacho Rodriguez)

[gopalv] HIVE-11462: Constant fold struct() UDF (Gopal V, reviewed by Hari 
Sankar Sivarama Subramaniyan)

--
[...truncated 325 lines...]
A
ptest2/src/main/java/org/apache/hive/ptest/execution/context/CloudComputeService.java
Aptest2/src/main/java/org/apache/hive/ptest/execution/conf
Aptest2/src/main/java/org/apache/hive/ptest/execution/conf/Context.java
A
ptest2/src/main/java/org/apache/hive/ptest/execution/conf/TestConfiguration.java
Aptest2/src/main/java/org/apache/hive/ptest/execution/conf/TestParser.java
Aptest2/src/main/java/org/apache/hive/ptest/execution/conf/Host.java
A
ptest2/src/main/java/org/apache/hive/ptest/execution/conf/ExecutionContextConfiguration.java
Aptest2/src/main/java/org/apache/hive/ptest/execution/conf/TestBatch.java
A
ptest2/src/main/java/org/apache/hive/ptest/execution/conf/UnitTestBatch.java
A
ptest2/src/main/java/org/apache/hive/ptest/execution/conf/QFileTestBatch.java
Aptest2/src/main/java/org/apache/hive/ptest/execution/JIRAService.java
Aptest2/src/main/java/org/apache/hive/ptest/execution/Drone.java
Aptest2/src/main/java/org/apache/hive/ptest/api
Aptest2/src/main/java/org/apache/hive/ptest/api/client
Aptest2/src/main/java/org/apache/hive/ptest/api/client/PTestClient.java
Aptest2/src/main/java/org/apache/hive/ptest/api/response
A
ptest2/src/main/java/org/apache/hive/ptest/api/response/TestLogResponse.java
A
ptest2/src/main/java/org/apache/hive/ptest/api/response/TestStatusResponse.java
Aptest2/src/main/java/org/apache/hive/ptest/api/response/TestStatus.java
A
ptest2/src/main/java/org/apache/hive/ptest/api/response/TestStopResponse.java
A
ptest2/src/main/java/org/apache/hive/ptest/api/response/TestListResponse.java
A
ptest2/src/main/java/org/apache/hive/ptest/api/response/GenericResponse.java
A
ptest2/src/main/java/org/apache/hive/ptest/api/response/TestStartResponse.java
Aptest2/src/main/java/org/apache/hive/ptest/api/Status.java
Aptest2/src/main/java/org/apache/hive/ptest/api/server
A
ptest2/src/main/java/org/apache/hive/ptest/api/server/ExecutionController.java
Aptest2/src/main/java/org/apache/hive/ptest/api/server/TestExecutor.java
Aptest2/src/main/java/org/apache/hive/ptest/api/server/Test.java
Aptest2/src/main/java/org/apache/hive/ptest/api/server/TestLogger.java
Aptest2/src/main/java/org/apache/hive/ptest/api/request
Aptest2/src/main/java/org/apache/hive/ptest/api/request/TestListRequest.java
A
ptest2/src/main/java/org/apache/hive/ptest/api/request/TestStartRequest.java
Aptest2/src/main/java/org/apache/hive/ptest/api/request/TestLogRequest.java
A
ptest2/src/main/java/org/apache/hive/ptest/api/request/TestStatusRequest.java
A

Re: [DISCUSS] Hive and HBase dependency

2015-08-13 Thread Alan Gates
On the hbase-metastore branch I've actually already moved to HBase 1.1.  
I'm +1 for moving to 1.1 or 1.0 on master and staying at 0.98 on branch-1.


Alan.


kulkarni.swar...@gmail.com mailto:kulkarni.swar...@gmail.com
August 12, 2015 at 8:43
Hi all,

It seems like our current dependency on HBase is a little fuzzy to say the
least. And with increased features relying on HBase(HBase integration,
HBase metastore etc), I think it would be worth giving a thought into how
we want to manage this dependency. I have also seen regressions[1][2] come
up recently due to this dependency not managed properly. Plus we need to
think about moving to HBase 1.0 soon as well to take advantage of the
backwards compatibility guarantees that HBase is providing.

Our current HBase dependency is 0.98.9. Also with out current bifurcation
of branches to create a 1.x branch for stability and 2.x for bleeding 
edge,

I propose that we still keep the version to 0.98.9 on the 1.x branch and
move to HBase 1.0 in our 2.0 branch. In that way we can start taking
advantage of the latest updates to the HBase API in our 2.x branch and
still keep 1.x backwards compatible by avoiding a direct jump to HBase 
1.0.

If we decide to go this route, we might need to revert back some of the
compatibility breaking changes[2] that sneaked into 1.x and move them over
to 2.x.

Thoughts?

Thanks,
Swarnim


[1] https://issues.apache.org/jira/browse/HIVE-10990
[2] https://issues.apache.org/jira/browse/HIVE-8898



Hive-0.14 - Build # 1042 - Still Failing

2015-08-13 Thread Apache Jenkins Server
Changes for Build #1021

Changes for Build #1022

Changes for Build #1023

Changes for Build #1024

Changes for Build #1025

Changes for Build #1026

Changes for Build #1027

Changes for Build #1028

Changes for Build #1029

Changes for Build #1030

Changes for Build #1031

Changes for Build #1032

Changes for Build #1033

Changes for Build #1034

Changes for Build #1035

Changes for Build #1036

Changes for Build #1037

Changes for Build #1038

Changes for Build #1039

Changes for Build #1040

Changes for Build #1041

Changes for Build #1042



No tests ran.

The Apache Jenkins build system has built Hive-0.14 (build #1042)

Status: Still Failing

Check console output at https://builds.apache.org/job/Hive-0.14/1042/ to view 
the results.

[jira] [Created] (HIVE-11548) HCatLoader should support predicate pushdown.

2015-08-13 Thread Mithun Radhakrishnan (JIRA)
Mithun Radhakrishnan created HIVE-11548:
---

 Summary: HCatLoader should support predicate pushdown.
 Key: HIVE-11548
 URL: https://issues.apache.org/jira/browse/HIVE-11548
 Project: Hive
  Issue Type: New Feature
  Components: HCatalog
Reporter: Mithun Radhakrishnan
Assignee: Mithun Radhakrishnan


When one uses {{HCatInputFormat}}/{{HCatLoader}} to read from file-formats that 
support predicate pushdown (such as ORC, with 
{{hive.optimize.index.filter=true}}), one sees that the predicates aren't 
actually pushed down into the storage layer.

The forthcoming patch should allow for filter-pushdown, if any of the 
partitions being scanned with {{HCatLoader}} support the functionality. The 
patch should technically allow the same for users of {{HCatInputFormat}}, but I 
don't currently have a neat interface to build a compound predicate-expression. 
Will add this separately, if required.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


too many 1.*.* unreleased versions on the JIRA

2015-08-13 Thread Sergey Shelukhin
On the JIRA, we currently have 1.1.0 marked as unreleased even though
1.2.0 is released (and 1.1.1 is also present); then, we have both 1.0.1
and 1.0.2, plus 1.2.1 and 1.2.2 showing in unreleased.
I poked around and cannot see where this can be changed. Release managers
for respective releases should probably clean this up, anyway :)




[jira] [Created] (HIVE-11552) implement basic methods for getting/putting file metadata

2015-08-13 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HIVE-11552:
---

 Summary: implement basic methods for getting/putting file metadata
 Key: HIVE-11552
 URL: https://issues.apache.org/jira/browse/HIVE-11552
 Project: Hive
  Issue Type: Bug
Affects Versions: hbase-metastore-branch
Reporter: Sergey Shelukhin
Assignee: Sergey Shelukhin






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-11554) Exchange partition outputs missing from post execute hooks

2015-08-13 Thread Paul Yang (JIRA)
Paul Yang created HIVE-11554:


 Summary: Exchange partition outputs missing from post execute hooks
 Key: HIVE-11554
 URL: https://issues.apache.org/jira/browse/HIVE-11554
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 1.2.0, 1.0.0, 0.14.0, 0.13.0, 0.12.0
Reporter: Paul Yang


The pre/post execute hook interface has fields that indicate which Hive objects 
were read / written to as a result of running the query. For the exchange 
partition operation, these fields (ReadEntity and WriteEntity) are empty. 

This is an important issue as the hook interface may be configured to perform 
critical warehouse operations.

See
{noformat}
ql/src/test/results/clientpositive/exchange_partition3.q.out
{noformat}

{noformat}
POSTHOOK: query: -- This will exchange both partitions hr=1 and hr=2
ALTER TABLE exchange_part_test1 EXCHANGE PARTITION (ds='2013-04-05') WITH TABLE 
exchange_part_test2
POSTHOOK: type: null
{noformat}

The post hook should not say null.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-11553) use basic file metadata cache in ETLSplitStrategy-related paths

2015-08-13 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HIVE-11553:
---

 Summary: use basic file metadata cache in ETLSplitStrategy-related 
paths
 Key: HIVE-11553
 URL: https://issues.apache.org/jira/browse/HIVE-11553
 Project: Hive
  Issue Type: Bug
Affects Versions: hbase-metastore-branch
Reporter: Sergey Shelukhin
Assignee: Sergey Shelukhin






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-11551) Branch 1.0 : test cases regression after cherry-pick HIVE-9397

2015-08-13 Thread Pengcheng Xiong (JIRA)
Pengcheng Xiong created HIVE-11551:
--

 Summary: Branch 1.0 : test cases regression after cherry-pick 
HIVE-9397
 Key: HIVE-11551
 URL: https://issues.apache.org/jira/browse/HIVE-11551
 Project: Hive
  Issue Type: Bug
Reporter: Pengcheng Xiong
Assignee: Pengcheng Xiong


insert_into1.q,stats_only_null.q,metadata_only_queries_with_filters.q,insert_into2.q,metadata_only_queries.q



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-11549) Hide Hive configuration from spark driver launching process

2015-08-13 Thread Xuefu Zhang (JIRA)
Xuefu Zhang created HIVE-11549:
--

 Summary: Hide Hive configuration from spark driver launching 
process
 Key: HIVE-11549
 URL: https://issues.apache.org/jira/browse/HIVE-11549
 Project: Hive
  Issue Type: Bug
  Components: Spark
Affects Versions: 1.2.0
Reporter: Xuefu Zhang
Assignee: Xuefu Zhang


Hive uses Spark application submission script, spark-submit, to launch remote 
spark driver. Starting from Spark 1.4, this script also does a lot of things 
that Hive doesn't need, for instance, accessing metastore for delegation 
tokens. Hive on Spark doesn't need this, and one way to do this is hide Hive 
configuration from being visible by that script.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-11550) ACID queries pollute HiveConf

2015-08-13 Thread Eugene Koifman (JIRA)
Eugene Koifman created HIVE-11550:
-

 Summary: ACID queries pollute HiveConf
 Key: HIVE-11550
 URL: https://issues.apache.org/jira/browse/HIVE-11550
 Project: Hive
  Issue Type: Bug
  Components: Transactions
Affects Versions: 1.0.0
Reporter: Eugene Koifman
Assignee: Eugene Koifman


HiveConf is a SessionState level object.  Some ACID related logic makes changes 
to it (which are meant to be per query) but become permanent.

See SemanticAnalyzer.checkAcidConstraints()

[~alangates], do you know of other cases or ideas on how to deal with this 
differently?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-11547) beeline does not continue running the script after an error occurs while beeline --force=true is already set.

2015-08-13 Thread Wei Huang (JIRA)
Wei Huang created HIVE-11547:


 Summary: beeline does not continue running the script after an 
error occurs while beeline --force=true is already set.
 Key: HIVE-11547
 URL: https://issues.apache.org/jira/browse/HIVE-11547
 Project: Hive
  Issue Type: Bug
  Components: Beeline
Affects Versions: 1.2.0
 Environment: HDP 2.3 on Virtual box 
Reporter: Wei Huang


If you execute beeline to run a SQL script file, using the following command

 beeline -f query file name

the beeline exists after the first error. i.e. when a test query fails beeline 
quits to the CLI.
The beeline --force=true seems to have a bug and it does not continue running 
the script after an error occurs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)