[jira] [Created] (HIVE-20688) Update Committer List

2018-10-03 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-20688:
--

 Summary: Update Committer List
 Key: HIVE-20688
 URL: https://issues.apache.org/jira/browse/HIVE-20688
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


Please update committer list:
Name: Janaki Lahorani
Apache ID: janaki
Organization: Cloudera



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-20570) Union ALL with hive.optimize.union.remove=true has incorrect plan

2018-09-16 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-20570:
--

 Summary: Union ALL with hive.optimize.union.remove=true has 
incorrect plan
 Key: HIVE-20570
 URL: https://issues.apache.org/jira/browse/HIVE-20570
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani


When hive.optimize.union.remove=true and a select query is run with group by, 
the final fetch is waiting only for one of the branches and not both.

Test Case:
{code}
create table if not exists test_table(column1 string, column2 int);
insert into test_table values('a',1),('b',2);

set hive.optimize.union.remove=true;
set mapred.input.dir.recursive=true;

explain
select column1 from test_table group by column1
union all
select column1 from test_table group by column1;
{code}

In the below the two stages correspond to the two parts of union all.  But the 
final fetch operator (Stage 0) only depends on one of the stages, but it should 
depend on both.
Plan:
{code}
STAGE DEPENDENCIES:
  Stage-1 is a root stage
  Stage-2 is a root stage
*  Stage-0 depends on stages: Stage-1*

STAGE PLANS:
  Stage: Stage-1
Map Reduce
  Map Operator Tree:
  TableScan
alias: test_table
Statistics: Num rows: 2 Data size: 6 Basic stats: COMPLETE Column 
stats: NONE
Select Operator
  expressions: column1 (type: string)
  outputColumnNames: column1
  Statistics: Num rows: 2 Data size: 6 Basic stats: COMPLETE Column 
stats: NONE
  Group By Operator
keys: column1 (type: string)
mode: hash
outputColumnNames: _col0
Statistics: Num rows: 2 Data size: 6 Basic stats: COMPLETE 
Column stats: NONE
Reduce Output Operator
  key expressions: _col0 (type: string)
  sort order: +
  Map-reduce partition columns: _col0 (type: string)
  Statistics: Num rows: 2 Data size: 6 Basic stats: COMPLETE 
Column stats: NONE
  Execution mode: vectorized
  Reduce Operator Tree:
Group By Operator
  keys: KEY._col0 (type: string)
  mode: mergepartial
  outputColumnNames: _col0
  Statistics: Num rows: 1 Data size: 3 Basic stats: COMPLETE Column 
stats: NONE
  File Output Operator
compressed: false
Statistics: Num rows: 1 Data size: 3 Basic stats: COMPLETE Column 
stats: NONE
table:
input format: org.apache.hadoop.mapred.SequenceFileInputFormat
output format: 
org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe

  Stage: Stage-2
Map Reduce
  Map Operator Tree:
  TableScan
alias: test_table
Statistics: Num rows: 2 Data size: 6 Basic stats: COMPLETE Column 
stats: NONE
Select Operator
  expressions: column1 (type: string)
  outputColumnNames: column1
  Statistics: Num rows: 2 Data size: 6 Basic stats: COMPLETE Column 
stats: NONE
  Group By Operator
keys: column1 (type: string)
mode: hash
outputColumnNames: _col0
Statistics: Num rows: 2 Data size: 6 Basic stats: COMPLETE 
Column stats: NONE
Reduce Output Operator
  key expressions: _col0 (type: string)
  sort order: +
  Map-reduce partition columns: _col0 (type: string)
  Statistics: Num rows: 2 Data size: 6 Basic stats: COMPLETE 
Column stats: NONE
  Execution mode: vectorized
  Reduce Operator Tree:
Group By Operator
  keys: KEY._col0 (type: string)
  mode: mergepartial
  outputColumnNames: _col0
  Statistics: Num rows: 1 Data size: 3 Basic stats: COMPLETE Column 
stats: NONE
  File Output Operator
compressed: false
Statistics: Num rows: 1 Data size: 3 Basic stats: COMPLETE Column 
stats: NONE
table:
input format: org.apache.hadoop.mapred.SequenceFileInputFormat
output format: 
org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe

  Stage: Stage-0
Fetch Operator
  limit: -1
  Processor Tree:
ListSink
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-20562) Intermittent test failures from Druid tests

2018-09-14 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-20562:
--

 Summary: Intermittent test failures from Druid tests
 Key: HIVE-20562
 URL: https://issues.apache.org/jira/browse/HIVE-20562
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani


Druid tests are failing intermittently in Hive Pre-commit jobs.

The typical failures include:
org.apache.hadoop.hive.cli.TestMiniDruidCliDriver.testCliDriver[druidmini_dynamic_partition]
 (batchId=193)
org.apache.hadoop.hive.cli.TestMiniDruidCliDriver.testCliDriver[druidmini_expressions]
 (batchId=193)
org.apache.hadoop.hive.cli.TestMiniDruidCliDriver.testCliDriver[druidmini_test1]
 (batchId=193)
org.apache.hadoop.hive.cli.TestMiniDruidCliDriver.testCliDriver[druidmini_test_alter]
 (batchId=193)
org.apache.hadoop.hive.cli.TestMiniDruidCliDriver.testCliDriver[druidmini_test_insert]
 (batchId=193)

The test log shows the following:
Exception: org.skife.jdbi.v2.exceptions.UnableToObtainConnectionException: 
java.sql.SQLException: Cannot create PoolableConnectionFactory 
(java.net.ConnectException : Error connecting to server localhost on port 
60,000 with message Connection refused.)
org.apache.hadoop.hive.ql.metadata.HiveException: 
org.skife.jdbi.v2.exceptions.UnableToObtainConnectionException: 
java.sql.SQLException: Cannot create PoolableConnectionFactory 
(java.net.ConnectException : Error connecting to server localhost on port 
60,000 with message Connection refused.)
at org.apache.hadoop.hive.ql.metadata.Hive.dropTable(Hive.java:1077)
at 
org.apache.hadoop.hive.ql.QTestUtil.clearTablesCreatedDuringTests(QTestUtil.java:958)
at 
org.apache.hadoop.hive.ql.QTestUtil.clearTestSideEffects(QTestUtil.java:1039)
at 
org.apache.hadoop.hive.cli.control.CoreCliDriver$5.invokeInternal(CoreCliDriver.java:135)
at 
org.apache.hadoop.hive.cli.control.CoreCliDriver$5.invokeInternal(CoreCliDriver.java:131)
at 
org.apache.hadoop.hive.util.ElapsedTimeLoggingWrapper.invoke(ElapsedTimeLoggingWrapper.java:33)
at 
org.apache.hadoop.hive.cli.control.CoreCliDriver.tearDown(CoreCliDriver.java:138)
at 
org.apache.hadoop.hive.cli.control.CliAdapter$2$1.evaluate(CliAdapter.java:94)

The following search shows many Hive Jiras with patches where Druid tests are 
failing.
https://issues.apache.org/jira/issues/?jql=project%20%3D%20HIVE%20AND%20text%20~%20druidmini%20ORDER%20BY%20key%20DESC




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-20527) Intern table descriptors from spark task

2018-09-10 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-20527:
--

 Summary: Intern table descriptors from spark task
 Key: HIVE-20527
 URL: https://issues.apache.org/jira/browse/HIVE-20527
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


Table descriptors from MR tasks and Tez tasks are interned.  This fix is to 
intern table desc from spark tasks as well.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-20526) Add test case for HIVE-20489

2018-09-10 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-20526:
--

 Summary: Add test case for HIVE-20489
 Key: HIVE-20526
 URL: https://issues.apache.org/jira/browse/HIVE-20526
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


Add a test case for the issue discussed in HIVE-20489.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-20489) Explain plan of query hangs

2018-08-30 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-20489:
--

 Summary: Explain plan of query hangs
 Key: HIVE-20489
 URL: https://issues.apache.org/jira/browse/HIVE-20489
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani


Explain on a query that joins 47 views, in effect around 94 joins after view 
expansion seems to take forever. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-20437) Handle schema evolution from float, double and decimal

2018-08-21 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-20437:
--

 Summary: Handle schema evolution from float, double and decimal
 Key: HIVE-20437
 URL: https://issues.apache.org/jira/browse/HIVE-20437
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


When data created as float, double or decimal in parquet format is read back 
using some other type, errors are seen.  Parquet should behave just like any 
other format.  If the value is valid for the new type, data is retuned 
otherwise null has to be returned.  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-19940) Push predicates with deterministic UDFs with RBO

2018-06-18 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-19940:
--

 Summary: Push predicates with deterministic UDFs with RBO
 Key: HIVE-19940
 URL: https://issues.apache.org/jira/browse/HIVE-19940
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


With RBO, predicates with any UDF doesn't get pushed down.  It makes sense to 
not pushdown the predicates with non-deterministic function as the meaning of 
the query changes after the predicate is resolved to use the function.  But 
pushing a deterministic function is beneficial.

Test Case:
{code}
set hive.cbo.enable=false;

CREATE TABLE `testb`(
   `cola` string COMMENT '',
   `colb` string COMMENT '',
   `colc` string COMMENT '')
PARTITIONED BY (
   `part1` string,
   `part2` string,
   `part3` string)

STORED AS AVRO;

CREATE TABLE `testa`(
   `col1` string COMMENT '',
   `col2` string COMMENT '',
   `col3` string COMMENT '',
   `col4` string COMMENT '',
   `col5` string COMMENT '')
PARTITIONED BY (
   `part1` string,
   `part2` string,
   `part3` string)
STORED AS AVRO;

insert into testA partition (part1='US', part2='ABC', part3='123')
values ('12.34', '100', '200', '300', 'abc'),
('12.341', '1001', '2001', '3001', 'abcd');

insert into testA partition (part1='UK', part2='DEF', part3='123')
values ('12.34', '100', '200', '300', 'abc'),
('12.341', '1001', '2001', '3001', 'abcd');

insert into testA partition (part1='US', part2='DEF', part3='200')
values ('12.34', '100', '200', '300', 'abc'),
('12.341', '1001', '2001', '3001', 'abcd');

insert into testA partition (part1='CA', part2='ABC', part3='300')
values ('12.34', '100', '200', '300', 'abc'),
('12.341', '1001', '2001', '3001', 'abcd');

insert into testB partition (part1='CA', part2='ABC', part3='300')
values ('600', '700', 'abc'), ('601', '701', 'abcd');

insert into testB partition (part1='CA', part2='ABC', part3='400')
values ( '600', '700', 'abc'), ( '601', '701', 'abcd');

insert into testB partition (part1='UK', part2='PQR', part3='500')
values ('600', '700', 'abc'), ('601', '701', 'abcd');

insert into testB partition (part1='US', part2='DEF', part3='200')
values ( '600', '700', 'abc'), ('601', '701', 'abcd');

insert into testB partition (part1='US', part2='PQR', part3='123')
values ( '600', '700', 'abc'), ('601', '701', 'abcd');

-- views with deterministic functions
create view viewDeterministicUDFA partitioned on (vpart1, vpart2, vpart3) as 
select
 cast(col1 as decimal(38,18)) as vcol1,
 cast(col2 as decimal(38,18)) as vcol2,
 cast(col3 as decimal(38,18)) as vcol3,
 cast(col4 as decimal(38,18)) as vcol4,
 cast(col5 as char(10)) as vcol5,
 cast(part1 as char(2)) as vpart1,
 cast(part2 as char(3)) as vpart2,
 cast(part3 as char(3)) as vpart3
 from testa
where part1 in ('US', 'CA');

create view viewDeterministicUDFB partitioned on (vpart1, vpart2, vpart3) as 
select
 cast(cola as decimal(38,18)) as vcolA,
 cast(colb as decimal(38,18)) as vcolB,
 cast(colc as char(10)) as vcolC,
 cast(part1 as char(2)) as vpart1,
 cast(part2 as char(3)) as vpart2,
 cast(part3 as char(3)) as vpart3
 from testb
where part1 in ('US', 'CA');

explain
select vcol1, vcol2, vcol3, vcola, vcolb
from viewDeterministicUDFA a inner join viewDeterministicUDFB b
on a.vpart1 = b.vpart1
and a.vpart2 = b.vpart2
and a.vpart3 = b.vpart3
and a.vpart1 = 'US'
and a.vpart2 = 'DEF'
and a.vpart3 = '200';
{code}

Plan where the CAST is not pushed down.
{code}
STAGE PLANS:
  Stage: Stage-1
Map Reduce
  Map Operator Tree:
  TableScan
alias: testa
filterExpr: (part1) IN ('US', 'CA') (type: boolean)
Statistics: Num rows: 6 Data size: 13740 Basic stats: COMPLETE 
Column stats: NONE
Select Operator
  expressions: CAST( col1 AS decimal(38,18)) (type: 
decimal(38,18)), CAST( col2 AS decimal(38,18)) (type: decimal(38,18)), CAST( 
col3 AS decimal(38,18)) (type: decimal(38,18)), CAST( part1 AS CHAR(2)) (type: 
char(2)), CAST( part2 AS CHAR(3)) (type: char(3)), CAST( part3 AS CHAR(3)) 
(type: char(3))
  outputColumnNames: _col0, _col1, _col2, _col5, _col6, _col7
  Statistics: Num rows: 6 Data size: 13740 Basic stats: COMPLETE 
Column stats: NONE
  Filter Operator
predicate: ((_col5 = 'US') and (_col6 = 'DEF') and (_col7 = 
'200')) (type: boolean)
Statistics: Num rows: 1 Data size: 2290 Basic stats: COMPLETE 
Column stats: NONE
Reduce Output Operator
  key expressions: 'US' (type: char(2)), 'DEF' (type: char(3)), 
'200' (type: char(3))
  sort order: +++
  Map-reduce partition columns: 'US' (type: char(2)), 'DEF' 
(type: char(3)), '200' (type: char(3))
  Statistics: Num rows: 1 Data size: 2290 Basic stats: COMPLETE 
Column stats: NONE
  value expressions: _col0 (type: decimal(38,18

[jira] [Created] (HIVE-19889) Wrong results due to PPD of non deterministic functions with CBO

2018-06-13 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-19889:
--

 Summary: Wrong results due to PPD of non deterministic functions 
with CBO
 Key: HIVE-19889
 URL: https://issues.apache.org/jira/browse/HIVE-19889
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


The following query can give wrong results when CBO is on:
select * from (
select part1,randum123
from (SELECT *, cast(rand() as double) AS randum123 FROM testA where part1='CA' 
and part2 = 'ABC') a
where randum123 <= 0.5) s where s.randum123 > 0.25 limit 20;

The plan of the query is as follows:
STAGE PLANS:
  Stage: Stage-1
Map Reduce
  Map Operator Tree:
  TableScan
alias: testa
Statistics: Num rows: 2 Data size: 4580 Basic stats: COMPLETE 
Column stats: NONE
Filter Operator
  predicate: ((rand() <= 0.5D) and (rand() > 0.25D)) (type: boolean)
  Statistics: Num rows: 1 Data size: 2290 Basic stats: COMPLETE 
Column stats: NONE
  Select Operator
expressions: 'CA' (type: string), rand() (type: double)
outputColumnNames: _col0, _col1
Statistics: Num rows: 1 Data size: 2290 Basic stats: COMPLETE 
Column stats: NONE
Limit
  Number of rows: 20
  Statistics: Num rows: 1 Data size: 2290 Basic stats: COMPLETE 
Column stats: NONE
  File Output Operator
compressed: false
Statistics: Num rows: 1 Data size: 2290 Basic stats: 
COMPLETE Column stats: NONE
table:
input format: 
org.apache.hadoop.mapred.SequenceFileInputFormat
output format: 
org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
serde: 
org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe

  Stage: Stage-0
Fetch Operator
  limit: 20
  Processor Tree:
ListSink

The relevant part in the plan is the filter:
Filter Operator
  predicate: ((rand() <= 0.5D) and (rand() > 0.25D)) (type: boolean)

The predicates s.randum123 > 0.25 and s.randum123 > 0.25 were pushed down.  And 
randum123 was resolved to rand().  This is bad because it will result in 
invocation of rand() two times and rand() UDF is non-deterministic.  Both the 
rand calls can generate values that can satisfy the predicates independently, 
but not together, whereas the original intention of the query is to give 
results when rand falls between 0.25 and 0.5.

A sample result:
CA  0.9191984370369802
CA  0.397933021566812
where the condition was not satisfied.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-19662) Upgrade Avro to 1.8.2

2018-05-22 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-19662:
--

 Summary: Upgrade Avro to 1.8.2
 Key: HIVE-19662
 URL: https://issues.apache.org/jira/browse/HIVE-19662
 Project: Hive
  Issue Type: Improvement
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


Upgrade Avro to 1.8.2 and use the version from root pom where Avro is 
referenced.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-19421) Upgrade versions of Jetty and Jackson

2018-05-04 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-19421:
--

 Summary: Upgrade versions of Jetty and Jackson
 Key: HIVE-19421
 URL: https://issues.apache.org/jira/browse/HIVE-19421
 Project: Hive
  Issue Type: Improvement
Reporter: Janaki Lahorani


Move Jackson up to 2.9.5
Move Jetty up to 9.3.20.v20170531



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-19317) Handle schema evolution from int like types to decimal

2018-04-26 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-19317:
--

 Summary: Handle schema evolution from int like types to decimal
 Key: HIVE-19317
 URL: https://issues.apache.org/jira/browse/HIVE-19317
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


If int like type is changed to decimal on parquet data, select results in 
errors.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-19228) Remove commons-httpclient 3.x usage

2018-04-17 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-19228:
--

 Summary: Remove commons-httpclient 3.x usage
 Key: HIVE-19228
 URL: https://issues.apache.org/jira/browse/HIVE-19228
 Project: Hive
  Issue Type: Improvement
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


Commons-httpclient is not supported well anymore.  Remove dependency and move 
to Apache HTTP client.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-19182) Code clean up of HiveMetaStoreChecker.java

2018-04-11 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-19182:
--

 Summary: Code clean up of HiveMetaStoreChecker.java
 Key: HIVE-19182
 URL: https://issues.apache.org/jira/browse/HIVE-19182
 Project: Hive
  Issue Type: Improvement
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


HiveMetaStoreChecker.java has code like:
result.getTablesNotOnFs().add(table.getTableName());
result.getPartitionsNotOnFs().add(pr);
result.getPartitionsNotInMs().add(pr);

It is not a good practice to change internal states like this.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-19174) Enhance HMS to support drop of partitions with partition names as input

2018-04-11 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-19174:
--

 Summary: Enhance HMS to support drop of partitions with partition 
names as input
 Key: HIVE-19174
 URL: https://issues.apache.org/jira/browse/HIVE-19174
 Project: Hive
  Issue Type: Improvement
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-19160) Insert data into decimal column fails with Null Pointer Exception

2018-04-10 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-19160:
--

 Summary: Insert data into decimal column fails with Null Pointer 
Exception
 Key: HIVE-19160
 URL: https://issues.apache.org/jira/browse/HIVE-19160
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


drop table if exists testDecimal;
create table testDecimal
(cIdTINYINT,
 cBigIntDECIMAL,
 cInt   DECIMAL,
 cSmallInt  DECIMAL,
 cTinyint   DECIMAL);

insert into testDecimal values
(1,
 1234567890123456789,
 1234567890,
 12345,
 123);

insert into testDecimal values
(2,
 1,
 2,
 3,
 4);

The second insert fails with null pointer exception.

2018-04-10T15:23:23,080 ERROR [5dba40ef-be49-4187-8a72-afbb46c41ecc main] 
metastore.RetryingHMSHandler: java.lang.NullPointerException
at 
org.apache.hadoop.hive.metastore.api.Decimal.compareTo(Decimal.java:318)
at 
org.apache.hadoop.hive.metastore.columnstats.merge.DecimalColumnStatsMerger.merge(DecimalColumnStatsMerger.java:35)
at 
org.apache.hadoop.hive.metastore.utils.MetaStoreUtils.mergeColStats(MetaStoreUtils.java:1040)
at 
org.apache.hadoop.hive.metastore.HiveMetaStore$HMSHandler.set_aggr_stats_for(HiveMetaStore.java:7166)
at sun.reflect.GeneratedMethodAccessor43.invoke(Unknown Source)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at 
org.apache.hadoop.hive.metastore.RetryingHMSHandler.invokeInternal(RetryingHMSHandler.java:147)
at 
org.apache.hadoop.hive.metastore.RetryingHMSHandler.invoke(RetryingHMSHandler.java:108)
at com.sun.proxy.$Proxy40.set_aggr_stats_for(Unknown Source)
at 
org.apache.hadoop.hive.metastore.HiveMetaStoreClient.setPartitionColumnStatistics(HiveMetaStoreClient.java:1870)
at 
org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient.setPartitionColumnStatistics(SessionHiveMetaStoreClient.java:395)
at sun.reflect.GeneratedMethodAccessor42.invoke(Unknown Source)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at 
org.apache.hadoop.hive.metastore.RetryingMetaStoreClient.invoke(RetryingMetaStoreClient.java:212)
at com.sun.proxy.$Proxy41.setPartitionColumnStatistics(Unknown Source)
at 
org.apache.hadoop.hive.ql.metadata.Hive.setPartitionColumnStatistics(Hive.java:4171)
at 
org.apache.hadoop.hive.ql.stats.ColStatsProcessor.persistColumnStats(ColStatsProcessor.java:179)
at 
org.apache.hadoop.hive.ql.stats.ColStatsProcessor.process(ColStatsProcessor.java:83)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-18718) Integer like types throws error when there is a match

2018-02-14 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-18718:
--

 Summary: Integer like types throws error when there is a match
 Key: HIVE-18718
 URL: https://issues.apache.org/jira/browse/HIVE-18718
 Project: Hive
  Issue Type: Improvement
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


If a value is saved with long type and read as int type it results in

FAILED: Execution Error, return code 2 from 
org.apache.hadoop.hive.ql.exec.mr.MapRedTask



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-18586) Upgrade Derby to 10.14.1.0

2018-01-30 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-18586:
--

 Summary: Upgrade Derby to 10.14.1.0
 Key: HIVE-18586
 URL: https://issues.apache.org/jira/browse/HIVE-18586
 Project: Hive
  Issue Type: Improvement
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-18499) Amend point lookup tests to check for data

2018-01-19 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-18499:
--

 Summary: Amend point lookup tests to check for data
 Key: HIVE-18499
 URL: https://issues.apache.org/jira/browse/HIVE-18499
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-18472) Beeline gives log4j warnings

2018-01-17 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-18472:
--

 Summary: Beeline gives log4j warnings
 Key: HIVE-18472
 URL: https://issues.apache.org/jira/browse/HIVE-18472
 Project: Hive
  Issue Type: Bug
Affects Versions: 3.0.0
Reporter: Janaki Lahorani


Starting Beeline gives the following warnings multiple times:

SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in 
[jar:file:/opt/cloudera/parcels/CDH-6.x-1.cdh6.x.p0.215261/jars/log4j-slf4j-impl-2.8.2.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in 
[jar:file:/opt/cloudera/parcels/CDH-6.x-1.cdh6.x.p0.215261/jars/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See [http://www.slf4j.org/codes.html#multiple_bindings] for an 
explanation.
SLF4J: Actual binding is of type [org.apache.logging.slf4j.Log4jLoggerFactory]
ERROR StatusLogger No log4j2 configuration file found. Using default 
configuration: logging only errors to the console. Set system property 
'org.apache.logging.log4j.simplelog.StatusLogger.level' to TRACE to show Log4j2 
internal initialization logging.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-18471) Beeline gives log4j warnings

2018-01-17 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-18471:
--

 Summary: Beeline gives log4j warnings
 Key: HIVE-18471
 URL: https://issues.apache.org/jira/browse/HIVE-18471
 Project: Hive
  Issue Type: Bug
Affects Versions: 3.0.0
Reporter: Janaki Lahorani


Starting Beeline gives the following warnings:

 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-18393) Error returned when some other type is read as string from parquet tables

2018-01-08 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-18393:
--

 Summary: Error returned when some other type is read as string 
from parquet tables
 Key: HIVE-18393
 URL: https://issues.apache.org/jira/browse/HIVE-18393
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


TimeStamp, Decimal, Double, Float, BigInt, Int, SmallInt, Tinyint and Boolean 
when read as String, Varchar or Char should return the correct data.  Now this 
results in error for parquet tables.

Test Case:
drop table if exists testAltCol;
create table testAltCol
(cIdTINYINT,
 cTimeStamp TIMESTAMP,
 cDecimal   DECIMAL(38,18),
 cDoubleDOUBLE,
 cFloat FLOAT,
 cBigIntBIGINT,
 cInt   INT,
 cSmallInt  SMALLINT,
 cTinyint   TINYINT,
 cBoolean   BOOLEAN);

insert into testAltCol values
(1,
 '2017-11-07 09:02:49.9',
 12345678901234567890.123456789012345678,
 1.79e308,
 3.4e38,
 1234567890123456789,
 1234567890,
 12345,
 123,
 TRUE);

insert into testAltCol values
(2,
 '1400-01-01 01:01:01.1',
 1.1,
 2.2,
 3.3,
 1,
 2,
 3,
 4,
 FALSE);

insert into testAltCol values
(3,
 '1400-01-01 01:01:01.1',
 10.1,
 20.2,
 30.3,
 1234567890123456789,
 1234567890,
 12345,
 123,
 TRUE);

select cId, cTimeStamp from testAltCol order by cId;
select cId, cDecimal, cDouble, cFloat from testAltCol order by cId;
select cId, cBigInt, cInt, cSmallInt, cTinyint from testAltCol order by cId;
select cId, cBoolean from testAltCol order by cId;

drop table if exists testAltColP;
create table testAltColP stored as parquet as select * from testAltCol;

select cId, cTimeStamp from testAltColP order by cId;
select cId, cDecimal, cDouble, cFloat from testAltColP order by cId;
select cId, cBigInt, cInt, cSmallInt, cTinyint from testAltColP order by cId;
select cId, cBoolean from testAltColP order by cId;

alter table testAltColP replace columns
(cIdTINYINT,
 cTimeStamp STRING,
 cDecimal   STRING,
 cDoubleSTRING,
 cFloat STRING,
 cBigIntSTRING,
 cInt   STRING,
 cSmallInt  STRING,
 cTinyint   STRING,
 cBoolean   STRING);

select cId, cTimeStamp from testAltColP order by cId;
select cId, cDecimal, cDouble, cFloat from testAltColP order by cId;
select cId, cBigInt, cInt, cSmallInt, cTinyint from testAltColP order by cId;
select cId, cBoolean from testAltColP order by cId;

alter table testAltColP replace columns
(cIdTINYINT,
 cTimeStamp VARCHAR(100),
 cDecimal   VARCHAR(100),
 cDoubleVARCHAR(100),
 cFloat VARCHAR(100),
 cBigIntVARCHAR(100),
 cInt   VARCHAR(100),
 cSmallInt  VARCHAR(100),
 cTinyint   VARCHAR(100),
 cBoolean   VARCHAR(100));

select cId, cTimeStamp from testAltColP order by cId;
select cId, cDecimal, cDouble, cFloat from testAltColP order by cId;
select cId, cBigInt, cInt, cSmallInt, cTinyint from testAltColP order by cId;
select cId, cBoolean from testAltColP order by cId;

alter table testAltColP replace columns
(cIdTINYINT,
 cTimeStamp CHAR(100),
 cDecimal   CHAR(100),
 cDoubleCHAR(100),
 cFloat CHAR(100),
 cBigIntCHAR(100),
 cInt   CHAR(100),
 cSmallInt  CHAR(100),
 cTinyint   CHAR(100),
 cBoolean   CHAR(100));

select cId, cTimeStamp from testAltColP order by cId;
select cId, cDecimal, cDouble, cFloat from testAltColP order by cId;
select cId, cBigInt, cInt, cSmallInt, cTinyint from testAltColP order by cId;
select cId, cBoolean from testAltColP order by cId;
drop table if exists testAltColP;

Error:
FAILED: Execution Error, return code 2 from 
org.apache.hadoop.hive.ql.exec.mr.MapRedTask

Excerpt for log:
2018-01-05T15:54:05,756 ERROR [LocalJobRunner Map Task Executor #0] 
mr.ExecMapper: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime 
Error while processing row [Error getting row data with exception 
java.lang.UnsupportedOperationException: Cannot inspect 
org.apache.hadoop.hive.serde2.io.TimestampWritable
at 
org.apache.hadoop.hive.ql.io.parquet.serde.primitive.ParquetStringInspector.getPrimitiveJavaObject(ParquetStringInspector.java:77)



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-18311) Enable smb_mapjoin_8.q for cli driver

2017-12-19 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-18311:
--

 Summary: Enable smb_mapjoin_8.q for cli driver
 Key: HIVE-18311
 URL: https://issues.apache.org/jira/browse/HIVE-18311
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-18248) Clean up parameters

2017-12-07 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-18248:
--

 Summary: Clean up parameters
 Key: HIVE-18248
 URL: https://issues.apache.org/jira/browse/HIVE-18248
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


Clean up of parameters that need not change at run time.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-18166) Result of hive.query.string is encoded.

2017-11-28 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-18166:
--

 Summary: Result of hive.query.string is encoded.
 Key: HIVE-18166
 URL: https://issues.apache.org/jira/browse/HIVE-18166
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


set hive.query.string returns encoded string.
hive.query.string=%0A%0Aselect+*+from+t1




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-18147) Tests can fail with java.net.BindException: Address already in use

2017-11-25 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-18147:
--

 Summary: Tests can fail with java.net.BindException: Address 
already in use
 Key: HIVE-18147
 URL: https://issues.apache.org/jira/browse/HIVE-18147
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


Between the time a new port is picked up to start HMS or HS2, and the time when 
HMS or HS2 is started, the port can get used by some other service resulting in 
test failure.  



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-17942) HiveAlterHandler not using conf from threadlocal

2017-10-30 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-17942:
--

 Summary: HiveAlterHandler not using conf from threadlocal
 Key: HIVE-17942
 URL: https://issues.apache.org/jira/browse/HIVE-17942
 Project: Hive
  Issue Type: Bug
Affects Versions: 2.1.1
Reporter: Janaki Lahorani


When HiveAlterHandler looks for conf, it is not getting the one from thread 
local.  So, local changes are not visible.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-17832) Allow hive.metastore.disallow.incompatible.col.type.changes to be changed within a session

2017-10-18 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-17832:
--

 Summary: Allow 
hive.metastore.disallow.incompatible.col.type.changes to be changed within a 
session
 Key: HIVE-17832
 URL: https://issues.apache.org/jira/browse/HIVE-17832
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-17764) alter view fails when hive.metastore.disallow.incompatible.col.type.changes set to true

2017-10-10 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-17764:
--

 Summary: alter view fails when 
hive.metastore.disallow.incompatible.col.type.changes set to true
 Key: HIVE-17764
 URL: https://issues.apache.org/jira/browse/HIVE-17764
 Project: Hive
  Issue Type: Bug
Affects Versions: 2.1.1
Reporter: Janaki Lahorani
Assignee: Janaki Lahorani


A view is a virtual structure that derives the type information from the 
table(s) the view is based on.If the view definition is altered, the 
corresponding column types should be updated.  The relevance of the change 
depending on the previous structure of the view is irrelevant.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-17398) Support Costing/Heuristics to enable or disable DPP

2017-08-28 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-17398:
--

 Summary: Support Costing/Heuristics to enable or disable DPP
 Key: HIVE-17398
 URL: https://issues.apache.org/jira/browse/HIVE-17398
 Project: Hive
  Issue Type: Sub-task
Reporter: Janaki Lahorani






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-17396) Support DPP with map joins where the source and target belong in the same stage

2017-08-28 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-17396:
--

 Summary: Support DPP with map joins where the source and target 
belong in the same stage
 Key: HIVE-17396
 URL: https://issues.apache.org/jira/browse/HIVE-17396
 Project: Hive
  Issue Type: Sub-task
Reporter: Janaki Lahorani






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-17200) TestMiniSparkOnYarnCliDriver: [spark_vectorized_dynamic_partition_pruning]

2017-07-28 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-17200:
--

 Summary: TestMiniSparkOnYarnCliDriver: 
[spark_vectorized_dynamic_partition_pruning]
 Key: HIVE-17200
 URL: https://issues.apache.org/jira/browse/HIVE-17200
 Project: Hive
  Issue Type: Sub-task
Reporter: Janaki Lahorani


Error:
Client execution failed with error code = 3 running "
select count(*) from srcpart join (select ds as ds, ds as `date` from srcpart 
group by ds) s on (srcpart.ds = s.ds) where s.`date` = '2008-04-08'" 
fname=spark_vectorized_dynamic_partition_pruning.q 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-17199) TestHiveMetaStoreStatsMerge: testStatsMerge

2017-07-28 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-17199:
--

 Summary: TestHiveMetaStoreStatsMerge: testStatsMerge
 Key: HIVE-17199
 URL: https://issues.apache.org/jira/browse/HIVE-17199
 Project: Hive
  Issue Type: Sub-task
Reporter: Janaki Lahorani


Stack:
org.apache.thrift.transport.TTransportException
at 
org.apache.thrift.transport.TIOStreamTransport.read(TIOStreamTransport.java:132)
at org.apache.thrift.transport.TTransport.readAll(TTransport.java:86)
at 
org.apache.thrift.protocol.TBinaryProtocol.readAll(TBinaryProtocol.java:429)
at 
org.apache.thrift.protocol.TBinaryProtocol.readI32(TBinaryProtocol.java:318)
at 
org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:219)
at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:77)
at 
org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.recv_set_aggr_stats_for(ThriftHiveMetastore.java:3724)
at 
org.apache.hadoop.hive.metastore.api.ThriftHiveMetastore$Client.set_aggr_stats_for(ThriftHiveMetastore.java:3711)
at 
org.apache.hadoop.hive.metastore.HiveMetaStoreClient.setPartitionColumnStatistics(HiveMetaStoreClient.java:1704)
at 
org.apache.hadoop.hive.metastore.TestHiveMetaStoreStatsMerge.testStatsMerge(TestHiveMetaStoreStatsMerge.java:187)



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-17198) Flaky test: TestBeeLineDriver [smb_mapjoin_7]

2017-07-28 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-17198:
--

 Summary: Flaky test: TestBeeLineDriver [smb_mapjoin_7]
 Key: HIVE-17198
 URL: https://issues.apache.org/jira/browse/HIVE-17198
 Project: Hive
  Issue Type: Sub-task
Reporter: Janaki Lahorani


Error:
Exception running or analyzing the results of the query file: 
org.apache.hive.beeline.QFile@4f7b68ad



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-17197) Flaky test: TestMiniSparkOnYarnCliDriver

2017-07-28 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-17197:
--

 Summary: Flaky test: TestMiniSparkOnYarnCliDriver
 Key: HIVE-17197
 URL: https://issues.apache.org/jira/browse/HIVE-17197
 Project: Hive
  Issue Type: Sub-task
Reporter: Janaki Lahorani


Error:
Failed during createSources processLine with code=3




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-17057) Flaky test: TestHCatClient.testTableSchemaPropagation,testPartitionRegistrationWithCustomSchema,testPartitionSpecRegistrationWithCustomSchema

2017-07-06 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-17057:
--

 Summary: Flaky test: 
TestHCatClient.testTableSchemaPropagation,testPartitionRegistrationWithCustomSchema,testPartitionSpecRegistrationWithCustomSchema
 Key: HIVE-17057
 URL: https://issues.apache.org/jira/browse/HIVE-17057
 Project: Hive
  Issue Type: Sub-task
Reporter: Janaki Lahorani






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-17056) TestMiniLlapCliDriver.testCliDriver[orc_ppd_basic]

2017-07-06 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-17056:
--

 Summary: TestMiniLlapCliDriver.testCliDriver[orc_ppd_basic]
 Key: HIVE-17056
 URL: https://issues.apache.org/jira/browse/HIVE-17056
 Project: Hive
  Issue Type: Sub-task
Reporter: Janaki Lahorani


Client Execution succeeded but contained differences (error code = 1) after 
executing orc_ppd_basic.q 
1287a1288
> Stage-1 LLAP IO COUNTERS:
1317a1319
> Stage-1 LLAP IO COUNTERS:
1338a1341
> Stage-1 LLAP IO COUNTERS:
1342a1346
> Stage-1 LLAP IO COUNTERS:



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-17055) Flaky test: TestMiniLlapCliDriver.testCliDriver[llap_smb]

2017-07-06 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-17055:
--

 Summary: Flaky test: TestMiniLlapCliDriver.testCliDriver[llap_smb]
 Key: HIVE-17055
 URL: https://issues.apache.org/jira/browse/HIVE-17055
 Project: Hive
  Issue Type: Sub-task
Reporter: Janaki Lahorani


Client Execution succeeded but contained differences (error code = 1) after 
executing llap_smb.q 
324,325c324,325
< 2000 9 52
< 2001 0 139630
---
> 2001 4 139630
> 2001 6 52




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-17046) Flaky test: TestCliDriver[ppd_windowing2]

2017-07-05 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-17046:
--

 Summary: Flaky test: TestCliDriver[ppd_windowing2]
 Key: HIVE-17046
 URL: https://issues.apache.org/jira/browse/HIVE-17046
 Project: Hive
  Issue Type: Sub-task
Reporter: Janaki Lahorani






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-16960) Hive throws an ugly error exception when HDFS sticky bit is set

2017-06-26 Thread Janaki Lahorani (JIRA)
Janaki Lahorani created HIVE-16960:
--

 Summary: Hive throws an ugly error exception when HDFS sticky bit 
is set
 Key: HIVE-16960
 URL: https://issues.apache.org/jira/browse/HIVE-16960
 Project: Hive
  Issue Type: Bug
Reporter: Janaki Lahorani
Priority: Critical


When calling LOAD DATA INPATH ... OVERWRITE INTO TABLE ... from a Hive user 
other than the HDFS file owner, and the HDFS sticky bit is set, then Hive will 
throw an error exception message that the file cannot be moved due to 
permission issues.
Caused by: org.apache.hadoop.security.AccessControlException: Permission denied 
by sticky bit setting: user=hive, 
inode=sasdata-2016-04-20-17-13-43-630-e-1.dlv.bk
The permission denied is expected, but the error message does not make sense to 
users + the stack trace displayed is huge. We should display a better error 
message to users, and maybe provide with help information about how to fix it.




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)