Re: Review Request 59808: Enhance HiveFilterSetOpTransposeRule to remove union branches

2017-06-12 Thread Ashutosh Chauhan

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/59808/#review177678
---




ql/src/java/org/apache/hadoop/hive/ql/optimizer/calcite/rules/HiveFilterSetOpTransposeRule.java
Line 28 (original), 45 (patched)


Comment:
This rule rewrites Fil
|
  Union
  /   \
 Op1   Op2
 
 to
  Union
/\
FIL
| |
  Op1  Op2
  
 
 It additionally can remove branch(es) of filter if its able to determine 
that they are going to generate empty result set.



ql/src/java/org/apache/hadoop/hive/ql/optimizer/calcite/rules/HiveFilterSetOpTransposeRule.java
Lines 73 (patched)


Good to add reason for why its overridden?
which is to do branch elimination



ql/src/java/org/apache/hadoop/hive/ql/optimizer/calcite/rules/HiveFilterSetOpTransposeRule.java
Lines 102 (patched)


It might be better to call simplify(RexNode) so as not to miss 
simplificaiton on operands other than And.



ql/src/java/org/apache/hadoop/hive/ql/optimizer/calcite/rules/HiveFilterSetOpTransposeRule.java
Lines 124 (patched)


Comment: We assume alwaysFalse filter will get pushed down to TS so this 
branch so it won't read any data.



ql/src/java/org/apache/hadoop/hive/ql/optimizer/calcite/rules/HiveUnionMergeRule.java
Lines 42 (patched)


Should it extend UnionMergeRule instead and pass on HiveRelBuilder? If 
UnionMergeRule doesnt accept RelBuilder, please create a calcite jira.



ql/src/test/queries/clientpositive/filter_union.q
Lines 1 (patched)


Instead of CliDriver, can you add this to LlapCliDriver?



ql/src/test/queries/clientpositive/filter_union.q
Lines 2 (patched)


Also, add hive.optimize.metadataonly=true; That should quick in some of 
these queries.



ql/src/test/results/clientpositive/perf/query4.q.out
Lines 240-275 (original)


Pretty cool !


- Ashutosh Chauhan


On June 10, 2017, 9:57 p.m., pengcheng xiong wrote:
> 
> ---
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/59808/
> ---
> 
> (Updated June 10, 2017, 9:57 p.m.)
> 
> 
> Review request for hive and Ashutosh Chauhan.
> 
> 
> Repository: hive-git
> 
> 
> Description
> ---
> 
> HIVE-16797
> 
> 
> Diffs
> -
> 
>   
> ql/src/java/org/apache/hadoop/hive/ql/optimizer/calcite/rules/HiveFilterSetOpTransposeRule.java
>  3ee29e0482 
>   
> ql/src/java/org/apache/hadoop/hive/ql/optimizer/calcite/rules/HiveUnionMergeRule.java
>  PRE-CREATION 
>   ql/src/java/org/apache/hadoop/hive/ql/parse/CalcitePlanner.java 348331e052 
>   ql/src/test/queries/clientpositive/filter_union.q PRE-CREATION 
>   ql/src/test/queries/clientpositive/perf/query11.q PRE-CREATION 
>   ql/src/test/results/clientpositive/filter_aggr.q.out db7dcaed3f 
>   ql/src/test/results/clientpositive/filter_union.q.out PRE-CREATION 
>   ql/src/test/results/clientpositive/llap/explainuser_1.q.out 8b04bc9261 
>   ql/src/test/results/clientpositive/llap/explainuser_2.q.out e3f70b097f 
>   ql/src/test/results/clientpositive/llap/orc_ppd_basic.q.out 5382c42412 
>   ql/src/test/results/clientpositive/llap/tez_union_multiinsert.q.out 
> 14e8e4389f 
>   ql/src/test/results/clientpositive/perf/query11.q.out PRE-CREATION 
>   ql/src/test/results/clientpositive/perf/query14.q.out 048a17f92f 
>   ql/src/test/results/clientpositive/perf/query23.q.out 1fd8cb4f25 
>   ql/src/test/results/clientpositive/perf/query33.q.out c1a5fa28ed 
>   ql/src/test/results/clientpositive/perf/query4.q.out 1b2048649a 
>   ql/src/test/results/clientpositive/perf/query5.q.out a3f2d58fec 
>   ql/src/test/results/clientpositive/perf/query56.q.out 4ec7201fa7 
>   ql/src/test/results/clientpositive/perf/query60.q.out 12d8cdd9b4 
>   ql/src/test/results/clientpositive/perf/query71.q.out 44658081b5 
>   ql/src/test/results/clientpositive/perf/query74.q.out bb4a71e6ce 
>   ql/src/test/results/clientpositive/perf/query76.q.out dcd5004166 
>   ql/src/test/results/clientpositive/perf/query77.q.out d46ba6b13c 
>   ql/src/test/results/clientpositive/perf/query80.q.out 3cf41f3fed 
>   ql/src/test/results/clientpositive/spark/union30.q.out 12eda1d3b6 
>   

[jira] [Created] (HIVE-16887) Parquet rawDataSize Under Reported

2017-06-12 Thread BELUGA BEHR (JIRA)
BELUGA BEHR created HIVE-16887:
--

 Summary: Parquet rawDataSize Under Reported
 Key: HIVE-16887
 URL: https://issues.apache.org/jira/browse/HIVE-16887
 Project: Hive
  Issue Type: Improvement
  Components: HiveServer2, Query Planning
Affects Versions: 2.1.1
Reporter: BELUGA BEHR
Priority: Minor


{code:sql}
CREATE TABLE `test_stats`(
  `a` int, 
  `b` int, 
  `c` string, 
  `d` bigint) stored as parquet;


INSERT INTO test_stats VALUES (1,31,"This is a test", 4567890);
ANALYZE TABLE test_stats COMPUTE STATISTICS FOR COLUMNS;
EXPLAIN SELECT * FROM test_stats WHERE c IS NULL;
{code}

{code}
Explain
STAGE DEPENDENCIES:
  Stage-1 is a root stage
  Stage-0 depends on stages: Stage-1

STAGE PLANS:
  Stage: Stage-1
Map Reduce
  Map Operator Tree:
  TableScan
alias: test_stats
filterExpr: c is null (type: boolean)
Statistics: Num rows: 1 Data size: 4 Basic stats: COMPLETE Column 
stats: COMPLETE
Filter Operator
  predicate: c is null (type: boolean)
  Statistics: Num rows: 1 Data size: 114 Basic stats: COMPLETE 
Column stats: COMPLETE
  Select Operator
expressions: a (type: int), b (type: int), null (type: string), 
d (type: bigint)
outputColumnNames: _col0, _col1, _col2, _col3
Statistics: Num rows: 1 Data size: 114 Basic stats: COMPLETE 
Column stats: COMPLETE
File Output Operator
  compressed: false
  Statistics: Num rows: 1 Data size: 114 Basic stats: COMPLETE 
Column stats: COMPLETE
  table:
  input format: org.apache.hadoop.mapred.TextInputFormat
  output format: 
org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
  serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe

  Stage: Stage-0
Fetch Operator
  limit: -1
  Processor Tree:
ListSink
{code}

For Parquet tables, when Hive looks at Table stats, it sees one size for the 
table, when it looks at column stats, it sees another:

Statistics: Num rows: 1 Data size: 4 Basic stats: COMPLETE Column stats: 
COMPLETE
Statistics: Num rows: 1 Data size: 114 Basic stats: COMPLETE Column stats: 
COMPLETE

The row stats are much more accurate though I would expect them to be the same.

Total Table Size = (Num Rows) * (Average Row Size)

The rawDataSize is reported as 4 bytes. This is way under reporting the size of 
the data. Perhaps it is 4 bytes in Parquet format, but when this data loads 
into a Spark HashTable Sink or a Cache, this one row is going to be at least 
(4+4+8+14) 30 bytes.

In cases where we set {{hive.auto.convert.join.noconditionaltask.size}} to be 
10mb and it is based off of the rawDataSize, in Spark when stats are enabled, 
we will require 114 bytes instead of the 4 bytes as reported in table stats 
(28.5x).  You can imagine a case where the rawDataSize is reported as 10MB but 
the real amount of required memory to cache is  285MB!  This may break an 
executor with limited memory or where multiple tables are being cached.

The Parquet SerDe should be reporting the total size of the data (uncompressed) 
so that we know exactly how much data we will be loading into memory, much like 
they do it here:

https://github.com/apache/hive/blob/6b6a00ffb0dae651ef407a99bab00d5e74f0d6aa/ql/src/java/org/apache/hadoop/hive/ql/stats/StatsUtils.java#L544



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (HIVE-16886) HMS log notifications may have duplicated event IDs if multiple HMS are running concurrently

2017-06-12 Thread JIRA
Sergio Peña created HIVE-16886:
--

 Summary: HMS log notifications may have duplicated event IDs if 
multiple HMS are running concurrently
 Key: HIVE-16886
 URL: https://issues.apache.org/jira/browse/HIVE-16886
 Project: Hive
  Issue Type: Bug
  Components: Hive, Metastore
Reporter: Sergio Peña


When running multiple Hive Metastore servers and DB notifications are enabled, 
I could see that notifications can be persisted with a duplicated event ID. 

This does not happen when running multiple threads in a single HMS node due to 
the locking acquired on the DbNotificationsLog class, but multiple HMS could 
cause conflicts.

The issue is in the ObjectStore#addNotificationEvent() method. The event ID 
fetched from the datastore is used for the new notification, incremented in the 
server itself, then persisted or updated back to the datastore. If 2 servers 
read the same ID, then these 2 servers write a new notification with the same 
ID.

The event ID is not unique nor a primary key.

Here's a test case using the TestObjectStore class that confirms this issue:

{noformat}
@Test
  public void testConcurrentAddNotifications() throws ExecutionException, 
InterruptedException {
final int NUM_THREADS = 2;
CountDownLatch countIn = new CountDownLatch(NUM_THREADS);
CountDownLatch countOut = new CountDownLatch(1);

HiveConf conf = new HiveConf();
conf.setVar(HiveConf.ConfVars.METASTORE_EXPRESSION_PROXY_CLASS, 
MockPartitionExpressionProxy.class.getName());

ExecutorService executorService = Executors.newFixedThreadPool(NUM_THREADS);
FutureTask tasks[] = new FutureTask[NUM_THREADS];
for (int i=0; i

[GitHub] hive pull request #194: HIVE-16785: Ensure replication actions are idempoten...

2017-06-12 Thread sankarh
GitHub user sankarh opened a pull request:

https://github.com/apache/hive/pull/194

HIVE-16785: Ensure replication actions are idempotent if any series of 
events are applied again.

Idempotent behaviour for all events during incremental load.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/sankarh/hive HIVE-16785

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/hive/pull/194.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #194


commit 741a824e253064854487983aaa9a8d31300384dc
Author: Sankar Hariappan 
Date:   2017-06-05T05:32:30Z

HIVE-16785: Ensure replication actions are idempotent if any series of 
events are applied again.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Created] (HIVE-16885) Non-equi Joins: Filter clauses should be pushed into the ON clause

2017-06-12 Thread Gopal V (JIRA)
Gopal V created HIVE-16885:
--

 Summary: Non-equi Joins: Filter clauses should be pushed into the 
ON clause
 Key: HIVE-16885
 URL: https://issues.apache.org/jira/browse/HIVE-16885
 Project: Hive
  Issue Type: Bug
  Components: Logical Optimizer
Affects Versions: 3.0.0
Reporter: Gopal V
Assignee: Jesus Camacho Rodriguez


FIL_24 -> MAPJOIN_23

{code}
hive> explain  select * from part where p_size > (select max(p_size) from part 
group by p_type);
Warning: Map Join MAPJOIN[14][bigTable=?] in task 'Map 1' is a cross product
OK
Plan optimized by CBO.

Vertex dependency in root stage
Map 1 <- Reducer 3 (BROADCAST_EDGE)
Reducer 3 <- Map 2 (SIMPLE_EDGE)

Stage-0
  Fetch Operator
limit:-1
Stage-1
  Map 1 vectorized, llap
  File Output Operator [FS_26]
Select Operator [SEL_25] (rows=110 width=621)
  
Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7","_col8"]
  Filter Operator [FIL_24] (rows=110 width=625)
predicate:(_col5 > _col9)
Map Join Operator [MAPJOIN_23] (rows=330 width=625)
  
Conds:(Inner),Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7","_col8","_col9"]
<-Reducer 3 [BROADCAST_EDGE] vectorized, llap
  BROADCAST [RS_21]
Select Operator [SEL_20] (rows=165 width=4)
  Output:["_col0"]
  Group By Operator [GBY_19] (rows=165 width=109)

Output:["_col0","_col1"],aggregations:["max(VALUE._col0)"],keys:KEY._col0
  <-Map 2 [SIMPLE_EDGE] vectorized, llap
SHUFFLE [RS_18]
  PartitionCols:_col0
  Group By Operator [GBY_17] (rows=14190 width=109)

Output:["_col0","_col1"],aggregations:["max(p_size)"],keys:p_type
Select Operator [SEL_16] (rows=2 width=109)
  Output:["p_type","p_size"]
  TableScan [TS_2] (rows=2 width=109)

tpch_flat_orc_1000@part,part,Tbl:COMPLETE,Col:COMPLETE,Output:["p_type","p_size"]
<-Select Operator [SEL_22] (rows=2 width=621)

Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7","_col8"]
TableScan [TS_0] (rows=2 width=621)
  
tpch_flat_orc_1000@part,part,Tbl:COMPLETE,Col:COMPLETE,Output:["p_partkey","p_name","p_mfgr","p_brand","p_type","p_size","p_container","p_retailprice","p_comment"]
{code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


Review Request 60006: HIVE-14746: Remove branch and repositories from profiles by sending them from ptest-client

2017-06-12 Thread Barna Zsombor Klara

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/60006/
---

Review request for hive, Sergio Pena and Siddharth Seth.


Bugs: HIVE-14746
https://issues.apache.org/jira/browse/HIVE-14746


Repository: hive-git


Description
---

HIVE-14746: Remove branch and repositories from profiles by sending them from 
ptest-client


Diffs
-

  
testutils/ptest2/src/main/java/org/apache/hive/ptest/api/client/PTestClient.java
 8e2604d372ac29b94445b269f08423b058308efe 
  
testutils/ptest2/src/main/java/org/apache/hive/ptest/api/request/TestStartRequest.java
 8deed52ae0307d4fc075654a4d75e6cb09a5d9db 
  
testutils/ptest2/src/main/java/org/apache/hive/ptest/api/server/TestExecutor.java
 b2c61f03c5bf5f170894141848c89fc26129115a 
  testutils/ptest2/src/main/java/org/apache/hive/ptest/execution/PTest.java 
1cdfdb309acd8282e593abd7ed10c87721926c60 
  
testutils/ptest2/src/main/java/org/apache/hive/ptest/execution/context/CloudExecutionContextProvider.java
 8b82497bdaf43694e0e1552e125b5ffdce40f56c 
  
testutils/ptest2/src/main/java/org/apache/hive/ptest/execution/context/ExecutionContext.java
 b09de1d4d930cf2d4d26b500f3457cea3fffa9ce 
  
testutils/ptest2/src/main/java/org/apache/hive/ptest/execution/context/FixedExecutionContextProvider.java
 f7b50d6a61962d2727b5181215be9de2e64b05b7 
  
testutils/ptest2/src/main/java/org/apache/hive/ptest/execution/context/WorkingDirWrapper.java
 PRE-CREATION 
  
testutils/ptest2/src/test/java/org/apache/hive/ptest/api/server/TestTestExecutor.java
 a4a789b579305d9ed573d8c1fd0b6ce75787d50f 
  
testutils/ptest2/src/test/java/org/apache/hive/ptest/execution/conf/TestTestConfiguration.java
 848faf27af1ed8945d7013b6562bab544605e4bc 


Diff: https://reviews.apache.org/r/60006/diff/1/


Testing
---

Manually tested the PTestClient with and without the branch argument.
Updated and ran the unit tests.


Thanks,

Barna Zsombor Klara



[jira] [Created] (HIVE-16884) Replace the deprecated HBaseInterface with Table

2017-06-12 Thread Aihua Xu (JIRA)
Aihua Xu created HIVE-16884:
---

 Summary: Replace the deprecated HBaseInterface with Table  
 Key: HIVE-16884
 URL: https://issues.apache.org/jira/browse/HIVE-16884
 Project: Hive
  Issue Type: Improvement
  Components: HBase Handler
Affects Versions: 3.0.0
Reporter: Aihua Xu
Assignee: Aihua Xu


HBaseInterface has been deprecated and will get removed in HBase 2.0 by 
HBASE-13395. Replace it with the new one 
{{org.apache.hadoop.hbase.client.Table}}.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


Review Request 59998: HIVE-16867

2017-06-12 Thread Jesús Camacho Rodríguez

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/59998/
---

Review request for hive and Ashutosh Chauhan.


Bugs: HIVE-16867
https://issues.apache.org/jira/browse/HIVE-16867


Repository: hive-git


Description
---

HIVE-16867


Diffs
-

  ql/src/java/org/apache/hadoop/hive/ql/exec/OperatorUtils.java 
d5006bd52db42e3bb2b650e099d656746834b497 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/DynamicPartitionPruningOptimization.java
 a9099b868001f4a917b91540fb305db25ccac664 
  
ql/src/java/org/apache/hadoop/hive/ql/optimizer/OperatorComparatorFactory.java 
1da91641b989a43b4ce5f6a09e0eabe6487e9157 
  ql/src/java/org/apache/hadoop/hive/ql/optimizer/SharedScanOptimizer.java 
e31119fd081b5989e64e00c6d903c53040dfd8d3 
  ql/src/java/org/apache/hadoop/hive/ql/parse/GenTezUtils.java 
a9c1e61ba94574d786c3be912a0b4f9eab20db96 
  ql/src/java/org/apache/hadoop/hive/ql/plan/DynamicPruningEventDesc.java 
73bbebd888c508cf7add51287326ba133df9e4d3 
  ql/src/java/org/apache/hadoop/hive/ql/plan/TableScanDesc.java 
04686f7fb98b558d7c7d671341f6e121e7c282c6 
  ql/src/test/results/clientpositive/llap/auto_join0.q.out 
6d051ea3f5117c043f79cfc9c641f1a8ace3c87e 
  ql/src/test/results/clientpositive/llap/auto_join30.q.out 
90c4241f192cfe8f40acc8a1fc0e19fbc846bd97 
  ql/src/test/results/clientpositive/llap/auto_sortmerge_join_9.q.out 
bdb30d735bc481b11fa6b44563ba74861841071b 
  ql/src/test/results/clientpositive/llap/bucket_map_join_tez1.q.out 
042c60bf17f4d696e771dfb24d7f7a4fc146f309 
  ql/src/test/results/clientpositive/llap/correlationoptimizer2.q.out 
cdae4fbfbf107394ccd58e7d9d1d2aeaeaa285ec 
  ql/src/test/results/clientpositive/llap/correlationoptimizer3.q.out 
3e715463a48a18986a82c4e55d8c8576821b0a05 
  ql/src/test/results/clientpositive/llap/correlationoptimizer6.q.out 
82dae9afbb3ae101b7b7cf8214ecfdb08f9d5d34 
  ql/src/test/results/clientpositive/llap/dynamic_partition_pruning.q.out 
2875e13bbaccb4e5bfee3e682e5b2c88acaf31a5 
  ql/src/test/results/clientpositive/llap/except_distinct.q.out 
e4c2941f67858e0171d00032945e4e7d2b7500c9 
  ql/src/test/results/clientpositive/llap/explainuser_1.q.out 
8b04bc9261478fee17b82c780a161410a704f4ac 
  ql/src/test/results/clientpositive/llap/explainuser_2.q.out 
e3f70b097f11255276ffc80082c6c8c115a76a1e 
  ql/src/test/results/clientpositive/llap/intersect_merge.q.out 
a31296672023080cc91865fbb86cdbeb891b2167 
  ql/src/test/results/clientpositive/llap/limit_pushdown.q.out 
57594e016484ea9020b349339a8a9be63709a6cb 
  ql/src/test/results/clientpositive/llap/mrr.q.out 
726349c7f16823f1ff54a8575d8d8a27a50f1772 
  ql/src/test/results/clientpositive/llap/multiMapJoin2.q.out 
b4b0e93c82d73c60bcfe345cea1cb9c8e26ab145 
  ql/src/test/results/clientpositive/llap/offset_limit_ppd_optimizer.q.out 
c89ca6b5cd9fbb26cbf0b416e4ca71ee38d32a3f 
  ql/src/test/results/clientpositive/llap/partition_shared_scan.q.out 
34ba87cc9158b2d23ce8d3c6b3defd2b8a4d36f0 
  ql/src/test/results/clientpositive/llap/subquery_in.q.out 
1f9c9e447416c153f909ebbe960a8343454dde14 
  ql/src/test/results/clientpositive/llap/subquery_multi.q.out 
29516eff82a63a1c551a4173ea1e7ea640bdcef0 
  ql/src/test/results/clientpositive/llap/subquery_notin.q.out 
b4af91579bb44cd30503d48de72a2f8f36e98501 
  ql/src/test/results/clientpositive/llap/subquery_null_agg.q.out 
bff27810e9b0999a3794b1fd505bb602236d40ad 
  ql/src/test/results/clientpositive/llap/subquery_scalar.q.out 
e94edff262685a2e2e3c4ab4ba6382b5d2cc186d 
  ql/src/test/results/clientpositive/llap/subquery_select.q.out 
202980e975b4bdd988df856bc1e533990e5c664b 
  ql/src/test/results/clientpositive/llap/subquery_views.q.out 
1a21a02a309d7fc931513d2c89f0aee3bc8c0fc6 
  ql/src/test/results/clientpositive/llap/unionDistinct_1.q.out 
b4b601993b6eddbe3a62a917267e214a1c09ceef 
  ql/src/test/results/clientpositive/llap/union_top_level.q.out 
2fac8ccf0c8f0c1117874c502c2970d48060e476 
  ql/src/test/results/clientpositive/llap/vector_groupby_grouping_sets4.q.out 
e1ad06c7de5d4fe729ba56ea89be1a55c6b4d487 
  ql/src/test/results/clientpositive/llap/vector_join30.q.out 
ec767507003f09d93f8491f454abf2351bd4226b 
  
ql/src/test/results/clientpositive/llap/vectorized_dynamic_partition_pruning.q.out
 d9fc6b5f5834a041d1f19495234c30948aca2751 
  ql/src/test/results/clientpositive/perf/query1.q.out 
da4a65c86286555f63b25b8e63117d1df5430bc4 
  ql/src/test/results/clientpositive/perf/query10.q.out 
9b6621c1aa03728ea2736474d23bf0b89e42d8f6 
  ql/src/test/results/clientpositive/perf/query14.q.out 
048a17f92f67c4ca78ab1fcfcdbc4572d784e39a 
  ql/src/test/results/clientpositive/perf/query16.q.out 
a7f93f9ec28f38675429429c1668ae7a09b0e5c0 
  ql/src/test/results/clientpositive/perf/query2.q.out 
50d7f7bcfacc81cdc5e3d7485d9ea2c5b5bf1d52 
  ql/src/test/results/clientpositive/perf/query23.q.out 
1fd8cb4f259a12e4b805843e480575b055bb 
  

[jira] [Created] (HIVE-16883) HBaseStorageHandler Ignores Case for HBase Table Name

2017-06-12 Thread Shawn Weeks (JIRA)
Shawn Weeks created HIVE-16883:
--

 Summary: HBaseStorageHandler Ignores Case for HBase Table Name
 Key: HIVE-16883
 URL: https://issues.apache.org/jira/browse/HIVE-16883
 Project: Hive
  Issue Type: Bug
  Components: HBase Handler
Affects Versions: 1.2.1
 Environment: Hortonworks HDP 2.6.0.3, CentOS 7.0, VMWare ESXI
Reporter: Shawn Weeks
Priority: Minor


Currently the HBaseStorageHandler is lower casing the HBase Table name. This 
prevent use of the storage handler with existing HBase tables that are not all 
lower case.

{code}
create 'TestTable', 'd'

create external table `TestTable` (
id bigint,
hash String,
location String,
name String
)
stored by "org.apache.hadoop.hive.hbase.HBaseStorageHandler"
with serdeproperties (
"hbase.columns.mapping" = ":key,d:hash,d:location,d:name",
"hbase.table.name" = "TestTable"
);
{code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)