[jira] [Created] (HIVE-24293) Integer overflow in llap collision mask

2020-10-21 Thread Antal Sinkovits (Jira)
Antal Sinkovits created HIVE-24293:
--

 Summary: Integer overflow in llap collision mask
 Key: HIVE-24293
 URL: https://issues.apache.org/jira/browse/HIVE-24293
 Project: Hive
  Issue Type: Bug
Affects Versions: 4.0.0
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-23847) Extracting hive-parser module broke exec jar upload in tez

2020-07-14 Thread Antal Sinkovits (Jira)
Antal Sinkovits created HIVE-23847:
--

 Summary: Extracting hive-parser module broke exec jar upload in tez
 Key: HIVE-23847
 URL: https://issues.apache.org/jira/browse/HIVE-23847
 Project: Hive
  Issue Type: Bug
Reporter: Antal Sinkovits


2020-07-13 16:53:50,551 [INFO] [Dispatcher thread {Central}] 
|HistoryEventHandler.criticalEvents|: 
[HISTORY][DAG:dag_1594632473849_0001_1][Event:TASK_ATTEMPT_FINISHED]: 
vertexName=Map 1, taskAttemptId=attempt_1594632473849_0001_1_00_00_0, 
creationTime=1594652027059, allocationTime=1594652028460, 
startTime=1594652029356, finishTime=1594652030546, timeTaken=1190, 
status=FAILED, taskFailureType=NON_FATAL, errorEnum=FRAMEWORK_ERROR, 
diagnostics=Error: Error while running task ( failure ) : 
attempt_1594632473849_0001_1_00_00_0:java.lang.RuntimeException: 
java.lang.RuntimeException: Map operator initialization failed
at 
org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:296)
at 
org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:250)
at 
org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:381)
at 
org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:75)
at 
org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:62)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:422)
at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1729)
at 
org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:62)
at 
org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:38)
at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36)
at 
com.google.common.util.concurrent.TrustedListenableFutureTask$TrustedFutureInterruptibleTask.runInterruptibly(TrustedListenableFutureTask.java:125)
at 
com.google.common.util.concurrent.InterruptibleTask.run(InterruptibleTask.java:57)
at 
com.google.common.util.concurrent.TrustedListenableFutureTask.run(TrustedListenableFutureTask.java:78)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.RuntimeException: Map operator initialization failed
at 
org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.init(MapRecordProcessor.java:340)
at 
org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:266)
... 16 more
Caused by: java.lang.NoClassDefFoundError: 
org/apache/hadoop/hive/ql/parse/ParseException
at java.lang.Class.getDeclaredConstructors0(Native Method)
at java.lang.Class.privateGetDeclaredConstructors(Class.java:2671)
at java.lang.Class.getConstructor0(Class.java:3075)
at java.lang.Class.getDeclaredConstructor(Class.java:2178)
at 
org.apache.hive.common.util.ReflectionUtil.newInstance(ReflectionUtil.java:79)
at 
org.apache.hadoop.hive.ql.exec.Registry.registerGenericUDTF(Registry.java:225)
at 
org.apache.hadoop.hive.ql.exec.Registry.registerGenericUDTF(Registry.java:217)
at 
org.apache.hadoop.hive.ql.exec.FunctionRegistry.(FunctionRegistry.java:544)
at 
org.apache.hadoop.hive.ql.exec.ExprNodeGenericFuncEvaluator.isDeterministic(ExprNodeGenericFuncEvaluator.java:154)
at 
org.apache.hadoop.hive.ql.exec.ExprNodeEvaluator.isConsistentWithinQuery(ExprNodeEvaluator.java:117)
at 
org.apache.hadoop.hive.ql.exec.ExprNodeEvaluatorFactory.iterate(ExprNodeEvaluatorFactory.java:102)
at 
org.apache.hadoop.hive.ql.exec.ExprNodeEvaluatorFactory.toCachedEvals(ExprNodeEvaluatorFactory.java:76)
at 
org.apache.hadoop.hive.ql.exec.SelectOperator.initializeOp(SelectOperator.java:69)
at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:359)
at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:548)
at 
org.apache.hadoop.hive.ql.exec.Operator.initializeChildren(Operator.java:502)
at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:368)
at 
org.apache.hadoop.hive.ql.exec.MapOperator.initializeMapOperator(MapOperator.java:506)
at 
org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.init(MapRecordProcessor.java:303)
... 17 more
Caused by: java.lang.ClassNotFoundException: 
org.apache.hadoop.hive.ql.parse.ParseException
at java.net.URLClassLoader.findClass(URLClassLoader.java:382)
at java.lang.ClassLoader.loadClass(ClassLoader.java:418)
at 

[jira] [Created] (HIVE-23741) Store CacheTags in the file cache level

2020-06-22 Thread Antal Sinkovits (Jira)
Antal Sinkovits created HIVE-23741:
--

 Summary: Store CacheTags in the file cache level
 Key: HIVE-23741
 URL: https://issues.apache.org/jira/browse/HIVE-23741
 Project: Hive
  Issue Type: Improvement
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits


CacheTags are currently stored for every data buffer. The strings are 
internalized, but the number of cache tag objects can be reduced by moving them 
to the file cache level, and back referencing them.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-22992) ZkRegistryBase caching mechanism only caches the first instance

2020-03-06 Thread Antal Sinkovits (Jira)
Antal Sinkovits created HIVE-22992:
--

 Summary: ZkRegistryBase caching mechanism only caches the first 
instance
 Key: HIVE-22992
 URL: https://issues.apache.org/jira/browse/HIVE-22992
 Project: Hive
  Issue Type: Bug
  Components: llap
Affects Versions: 4.0.0
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits


ZkRegistryBase caching mechanism only caches the first instance of the llap 
node running on the same host.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-22898) CharsetDecoder race condition in OrcRecordUpdater

2020-02-18 Thread Antal Sinkovits (Jira)
Antal Sinkovits created HIVE-22898:
--

 Summary: CharsetDecoder race condition in OrcRecordUpdater 
 Key: HIVE-22898
 URL: https://issues.apache.org/jira/browse/HIVE-22898
 Project: Hive
  Issue Type: Bug
Affects Versions: 4.0.0
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits


Instances of CharsetDecoder are not thread safe, causing race condition in 
OrcRecordUpdater



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-21949) Revert HIVE-21232 LLAP: Add a cache-miss friendly split affinity provider

2019-07-03 Thread Antal Sinkovits (JIRA)
Antal Sinkovits created HIVE-21949:
--

 Summary: Revert HIVE-21232 LLAP: Add a cache-miss friendly split 
affinity provider
 Key: HIVE-21949
 URL: https://issues.apache.org/jira/browse/HIVE-21949
 Project: Hive
  Issue Type: Bug
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits
 Attachments: HIVE-21949.01.patch





--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-21610) Union operator can flow in the wrong stage causing NPE

2019-04-12 Thread Antal Sinkovits (JIRA)
Antal Sinkovits created HIVE-21610:
--

 Summary: Union operator can flow in the wrong stage causing NPE
 Key: HIVE-21610
 URL: https://issues.apache.org/jira/browse/HIVE-21610
 Project: Hive
  Issue Type: Bug
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits


Because of HIVE-16227 it can happen that a UnionOperator will partially go into 
the wrong stage, because the currTask is changed, and the UnionOperator is 
reinitialized in GenMRFileSink1 with the wrong task.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-21570) Convert llap iomem servlets output to json format

2019-04-03 Thread Antal Sinkovits (JIRA)
Antal Sinkovits created HIVE-21570:
--

 Summary: Convert llap iomem servlets output to json format
 Key: HIVE-21570
 URL: https://issues.apache.org/jira/browse/HIVE-21570
 Project: Hive
  Issue Type: Improvement
Affects Versions: 4.0.0
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-21315) Consolidate rawDataSize stat calculation

2019-02-25 Thread Antal Sinkovits (JIRA)
Antal Sinkovits created HIVE-21315:
--

 Summary: Consolidate rawDataSize stat calculation 
 Key: HIVE-21315
 URL: https://issues.apache.org/jira/browse/HIVE-21315
 Project: Hive
  Issue Type: Improvement
Affects Versions: 4.0.0
Reporter: Antal Sinkovits


RawDataSize statistics represents the table size, when loaded into memory. 
Sometimes this value is used to determine, whether a table should be used in a 
map join or not.
This value should probably be the same, regardless of the underlaying  file 
format used.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-21284) StatsWork should use footer scan for Parquet

2019-02-18 Thread Antal Sinkovits (JIRA)
Antal Sinkovits created HIVE-21284:
--

 Summary: StatsWork should use footer scan for Parquet
 Key: HIVE-21284
 URL: https://issues.apache.org/jira/browse/HIVE-21284
 Project: Hive
  Issue Type: Bug
Affects Versions: 4.0.0
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-21035) Race condition in SparkUtilities#getSparkSession

2018-12-12 Thread Antal Sinkovits (JIRA)
Antal Sinkovits created HIVE-21035:
--

 Summary: Race condition in SparkUtilities#getSparkSession
 Key: HIVE-21035
 URL: https://issues.apache.org/jira/browse/HIVE-21035
 Project: Hive
  Issue Type: Bug
  Components: Spark
Affects Versions: 4.0.0
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits


It can happen, that when in one given session, multiple queries are executed, 
that due to a race condition, multiple spark application master gets kicked off.
In this case, the one that started earlier, will not be killed, when the hive 
session closes, consuming resources.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-20907) TestGetPartitionsUsingProjectionAndFilterSpecs is flaky

2018-11-12 Thread Antal Sinkovits (JIRA)
Antal Sinkovits created HIVE-20907:
--

 Summary: TestGetPartitionsUsingProjectionAndFilterSpecs is flaky
 Key: HIVE-20907
 URL: https://issues.apache.org/jira/browse/HIVE-20907
 Project: Hive
  Issue Type: Bug
Affects Versions: 4.0.0
Reporter: Antal Sinkovits


private void verifyLocations(List origPartitions, StorageDescriptor 
sharedSD,
  List partitionWithoutSDS)

method expects, that the order of the two list are the same.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-20904) Yetus fails to resolve module dependencies due to usage of exec plugin in metastore-server

2018-11-12 Thread Antal Sinkovits (JIRA)
Antal Sinkovits created HIVE-20904:
--

 Summary: Yetus fails to resolve module dependencies due to usage 
of exec plugin in metastore-server
 Key: HIVE-20904
 URL: https://issues.apache.org/jira/browse/HIVE-20904
 Project: Hive
  Issue Type: Bug
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits


metastore-server uses exec-maven-plugin to generate metastore-site.xml.template 
with ConfTemplatePrinter.
It expects some arguments. 
Because yetus also uses the exec-maven-plugin to determine the order of the 
modules to be built, but with zero params, the execution fails.
https://github.com/apache/yetus/blob/6ebaa1119e611db14f219e289e33ab8ac5c254a7/precommit/src/main/shell/test-patch.d/maven.sh#L658

Steps to reproduce the issue:
mvn -q exec:exec -Dexec.executable=pwd -Dexec.args=''



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-20742) SparkSessionManagerImpl maintenance thread only cleans up session once

2018-10-13 Thread Antal Sinkovits (JIRA)
Antal Sinkovits created HIVE-20742:
--

 Summary: SparkSessionManagerImpl maintenance thread only cleans up 
session once
 Key: HIVE-20742
 URL: https://issues.apache.org/jira/browse/HIVE-20742
 Project: Hive
  Issue Type: Bug
Affects Versions: 4.0.0
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits


If there is a reconnect at the client session, the SparkSessionManagerImpl 
doesn't puts it back in the created sessions, so it will not time out the 
second time.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-20440) Create better cache eviction policy for SmallTableCache

2018-08-22 Thread Antal Sinkovits (JIRA)
Antal Sinkovits created HIVE-20440:
--

 Summary: Create better cache eviction policy for SmallTableCache
 Key: HIVE-20440
 URL: https://issues.apache.org/jira/browse/HIVE-20440
 Project: Hive
  Issue Type: Improvement
  Components: Spark
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits


Enhance the SmallTableCache, to use guava cache with soft references, so that 
we evict when there is memory pressure.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (HIVE-19486) Discrepancy between the config and the code in Hikari connectionPoolingType

2018-05-10 Thread Antal Sinkovits (JIRA)
Antal Sinkovits created HIVE-19486:
--

 Summary: Discrepancy between the config and the code in Hikari 
connectionPoolingType
 Key: HIVE-19486
 URL: https://issues.apache.org/jira/browse/HIVE-19486
 Project: Hive
  Issue Type: Bug
Reporter: Antal Sinkovits
Assignee: Antal Sinkovits


MetaStoreConf contains datanucleus.connectionPoolingType "HikariCP" not 
"Hikari".



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)