[jira] [Created] (HIVE-25078) [cachedstore]

2021-04-30 Thread Ashish Sharma (Jira)
Ashish Sharma created HIVE-25078:


 Summary: [cachedstore]
 Key: HIVE-25078
 URL: https://issues.apache.org/jira/browse/HIVE-25078
 Project: Hive
  Issue Type: Sub-task
  Components: Standalone Metastore
Affects Versions: 4.0.0
Reporter: Ashish Sharma
Assignee: Ashish Sharma


Description

Add Table id check in following while extracting (i.e. get call) cached table 
from cached store
1. Table
2. Partitions
3. Constrains 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-25080) Create metric about oldest entry in "ready for cleaning" state

2021-04-30 Thread Antal Sinkovits (Jira)
Antal Sinkovits created HIVE-25080:
--

 Summary: Create metric about oldest entry in "ready for cleaning" 
state
 Key: HIVE-25080
 URL: https://issues.apache.org/jira/browse/HIVE-25080
 Project: Hive
  Issue Type: Bug
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits


When a compaction txn commits, COMPACTION_QUEUE.CQ_COMMIT_TIME is updated with 
the current time. Then the compaction state is set to "ready for cleaning". 
(... and then the Cleaner runs and the state is set to "succeeded" hopefully)

Based on this we know (roughly) how long a compaction has been in state "ready 
for cleaning".

We should create a metric similar to compaction_oldest_enqueue_age_in_sec that 
would show that the cleaner is blocked by something i.e. find the compaction in 
"ready for cleaning" that has the oldest commit time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-25079) Create new metric about number of writes to tables with manually disabled compaction

2021-04-30 Thread Antal Sinkovits (Jira)
Antal Sinkovits created HIVE-25079:
--

 Summary: Create new metric about number of writes to tables with 
manually disabled compaction
 Key: HIVE-25079
 URL: https://issues.apache.org/jira/browse/HIVE-25079
 Project: Hive
  Issue Type: Bug
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits


Create a new metric that measures the number of writes tables that has 
compaction turned off manually. It does not matter if the write is committed or 
aborted (both are bad...)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-25081) Put metrics collection behind a feature flag

2021-04-30 Thread Antal Sinkovits (Jira)
Antal Sinkovits created HIVE-25081:
--

 Summary: Put metrics collection behind a feature flag
 Key: HIVE-25081
 URL: https://issues.apache.org/jira/browse/HIVE-25081
 Project: Hive
  Issue Type: Bug
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits


Most metrics we're creating are collected in AcidMetricsService, which is 
behind a feature flag. However there are some metrics that are collected 
outside of the service. These should be behind a feature flag in addition to 
hive.metastore.metrics.enabled.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-25082) Make SettableTreeReader updateTimezone a default method

2021-04-30 Thread Panagiotis Garefalakis (Jira)
Panagiotis Garefalakis created HIVE-25082:
-

 Summary: Make SettableTreeReader updateTimezone a default method
 Key: HIVE-25082
 URL: https://issues.apache.org/jira/browse/HIVE-25082
 Project: Hive
  Issue Type: Improvement
Reporter: Panagiotis Garefalakis
Assignee: Panagiotis Garefalakis


Avoid useless TimestampStreamReader instance checks by making updateTimezone() 
a default method in SettableTreeReader



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-25083) Extra reviewer pattern

2021-04-30 Thread Panagiotis Garefalakis (Jira)
Panagiotis Garefalakis created HIVE-25083:
-

 Summary: Extra reviewer pattern
 Key: HIVE-25083
 URL: https://issues.apache.org/jira/browse/HIVE-25083
 Project: Hive
  Issue Type: Sub-task
Reporter: Panagiotis Garefalakis
Assignee: Panagiotis Garefalakis






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-25084) Incorrect aggregate results on bucketed table

2021-04-30 Thread Naresh P R (Jira)
Naresh P R created HIVE-25084:
-

 Summary: Incorrect aggregate results on bucketed table
 Key: HIVE-25084
 URL: https://issues.apache.org/jira/browse/HIVE-25084
 Project: Hive
  Issue Type: Bug
Reporter: Naresh P R


Steps to repro
{code:java}
CREATE TABLE test_table(
col1 int,
col2 char(32),
col3 varchar(3))
CLUSTERED BY (col2)
 SORTED BY (
   col2 ASC,
   col3 ASC,
   col1 ASC)
 INTO 32 BUCKETS stored as orc;

set hive.query.results.cache.enabled=false;
insert into test_table values(2, "123456", "15");
insert into test_table values(1, "123456", "15");

SELECT col2, col3, max(col1) AS max_sequence FROM test_table GROUP BY col2, 
col3;
==> LocalFetch correct result <==
123456 15 2 

==> Wrong result with Tez/Llap <==
set hive.fetch.task.conversion=none;
123456 15 2 
123456 15 1 

==> Correct result with Tez/Llap disabling map aggregation <==
set hive.map.aggr=false;
123456 15 2 
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)