Michael Bieniosek created HIVE-18786:
----------------------------------------

             Summary: NPE in Hive windowing functions
                 Key: HIVE-18786
                 URL: https://issues.apache.org/jira/browse/HIVE-18786
             Project: Hive
          Issue Type: Bug
    Affects Versions: 2.3.2
            Reporter: Michael Bieniosek


When I run a Hive query with windowing functions, if there's enough data I get 
an NPE.

For example something like this query might break:

select id, created_date, max(created_date) over (partition by id) 
latest_created_any from ...

The only workaround I've found is to remove the windowing functions entirely.

The stacktrace looks suspiciously similar to HADOOP-2931, but I'm in hive-2.3.2 
which appears to have the bugfix applied.

 

Caused by: java.lang.NullPointerException
         at 
org.apache.hadoop.hive.ql.exec.persistence.PTFRowContainer.first(PTFRowContainer.java:115)
         at 
org.apache.hadoop.hive.ql.exec.PTFPartition.iterator(PTFPartition.java:114)
         at 
org.apache.hadoop.hive.ql.udf.ptf.BasePartitionEvaluator.getPartitionAgg(BasePartitionEvaluator.java:200)
         at 
org.apache.hadoop.hive.ql.udf.ptf.WindowingTableFunction.evaluateFunctionOnPartition(WindowingTableFunction.java:155)
         at 
org.apache.hadoop.hive.ql.udf.ptf.WindowingTableFunction.iterator(WindowingTableFunction.java:538)
         at 
org.apache.hadoop.hive.ql.exec.PTFOperator$PTFInvocation.finishPartition(PTFOperator.java:349)
         at 
org.apache.hadoop.hive.ql.exec.PTFOperator.process(PTFOperator.java:123)
         at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:897)
         at 
org.apache.hadoop.hive.ql.exec.SelectOperator.process(SelectOperator.java:95)
         at 
org.apache.hadoop.hive.ql.exec.tez.ReduceRecordSource$GroupIterator.next(ReduceRecordSource.java:356)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to