Kedar Sankar Behera created DRILL-6530:
------------------------------------------
Summary: JVM crash with a Lateral Unnest query involving multiple
json files with one file having a schema change of one column from string to
list
Key: DRILL-6530
URL: https://issues.apache.org/jira/browse/DRILL-6530
Project: Apache Drill
Issue Type: Bug
Affects Versions: 1.14.0
Reporter: Kedar Sankar Behera
Fix For: 1.14.0
Attachments: 0_0_92.json, 0_0_93.json, drillbit.log, drillbit.out,
hs_err_pid32076.log
JVM crash with a Lateral Unnest query involving multiple json files with one
file having a schema change of one column from string to list .
Query :-
{code}
SELECT customer.c_custkey,customer.c_acctbal,orders.o_orderkey,
orders.o_totalprice,orders.o_orderdate,orders.o_shippriority,customer.c_address,orders.o_orderpriority,customer.c_comment
FROM customer, LATERAL
(SELECT O.ord.o_orderkey as o_orderkey, O.ord.o_totalprice as
o_totalprice,O.ord.o_orderdate as o_orderdate ,O.ord.o_shippriority as
o_shippriority,O.ord.o_orderpriority
as o_orderpriority FROM UNNEST(customer.c_orders) O(ord))orders;
{code}
The error got was
{code}
o.a.d.e.p.impl.join.LateralJoinBatch - Output batch still has some space left,
getting new batches from left and right
2018-06-21 15:25:16,303 [24d3da36-bdb8-cb5b-594c-82135bfb84aa:frag:0:0] DEBUG
o.a.d.exec.physical.impl.ScanBatch - set record count 0 for vv c_custkey
2018-06-21 15:25:16,303 [24d3da36-bdb8-cb5b-594c-82135bfb84aa:frag:0:0] DEBUG
o.a.d.exec.physical.impl.ScanBatch - set record count 0 for vv c_phone
2018-06-21 15:25:16,303 [24d3da36-bdb8-cb5b-594c-82135bfb84aa:frag:0:0] DEBUG
o.a.d.exec.physical.impl.ScanBatch - set record count 0 for vv c_acctbal
2018-06-21 15:25:16,303 [24d3da36-bdb8-cb5b-594c-82135bfb84aa:frag:0:0] DEBUG
o.a.d.exec.physical.impl.ScanBatch - set record count 0 for vv c_orders
2018-06-21 15:25:16,303 [24d3da36-bdb8-cb5b-594c-82135bfb84aa:frag:0:0] DEBUG
o.a.d.exec.physical.impl.ScanBatch - set record count 0 for vv c_mktsegment
2018-06-21 15:25:16,303 [24d3da36-bdb8-cb5b-594c-82135bfb84aa:frag:0:0] DEBUG
o.a.d.exec.physical.impl.ScanBatch - set record count 0 for vv c_address
2018-06-21 15:25:16,303 [24d3da36-bdb8-cb5b-594c-82135bfb84aa:frag:0:0] DEBUG
o.a.d.exec.physical.impl.ScanBatch - set record count 0 for vv c_nationkey
2018-06-21 15:25:16,303 [24d3da36-bdb8-cb5b-594c-82135bfb84aa:frag:0:0] DEBUG
o.a.d.exec.physical.impl.ScanBatch - set record count 0 for vv c_name
2018-06-21 15:25:16,303 [24d3da36-bdb8-cb5b-594c-82135bfb84aa:frag:0:0] DEBUG
o.a.d.exec.physical.impl.ScanBatch - set record count 0 for vv c_comment
2018-06-21 15:25:16,316 [24d3da36-bdb8-cb5b-594c-82135bfb84aa:frag:0:0] DEBUG
o.a.d.e.v.c.AbstractContainerVector - Field [o_comment] mutated from
[NullableVarCharVector] to [RepeatedVarCharVector]
2018-06-21 15:25:16,318 [24d3da36-bdb8-cb5b-594c-82135bfb84aa:frag:0:0] DEBUG
o.a.drill.exec.vector.UInt4Vector - Reallocating vector [[`$offsets$`
(UINT4:REQUIRED)]]. # of bytes: [16384] -> [32768]
{code}
On Further investigating with [~shamirwasia] it's found that the crash only
happens when [o_comment] mutates from [NullableVarCharVector] to
[RepeatedVarCharVector],not the other way around
Please find the logs stack trace and the data file
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)