[
https://issues.apache.org/jira/browse/FLINK-8230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16285250#comment-16285250
]
Sebastian Klemke commented on FLINK-8230:
-----------------------------------------
If possible, I'd like this change to also be merged to release-1.4 branch.
flink-orc is new, so it can't create a regression.
> NPE in OrcRowInputFormat on nested structs
> ------------------------------------------
>
> Key: FLINK-8230
> URL: https://issues.apache.org/jira/browse/FLINK-8230
> Project: Flink
> Issue Type: Bug
> Components: Batch Connectors and Input/Output Formats
> Affects Versions: 1.4.0
> Reporter: Sebastian Klemke
> Priority: Minor
>
> OrcRowInputFormat ignores isNull and isRepeating on nested struct columns. If
> a struct column contains nulls, it tries to read struct fields, leading to
> NPE in case of string fields:
> {code}
> java.lang.NullPointerException
> at java.lang.String.checkBounds(String.java:384)
> at java.lang.String.<init>(String.java:462)
> at
> org.apache.flink.orc.OrcUtils.readNonNullBytesColumnAsString(OrcUtils.java:392)
> at org.apache.flink.orc.OrcUtils.readField(OrcUtils.java:215)
> at org.apache.flink.orc.OrcUtils.readStructColumn(OrcUtils.java:1203)
> at org.apache.flink.orc.OrcUtils.readField(OrcUtils.java:252)
> at
> org.apache.flink.orc.OrcUtils.readNonNullStructColumn(OrcUtils.java:677)
> at org.apache.flink.orc.OrcUtils.readField(OrcUtils.java:250)
> at org.apache.flink.orc.OrcUtils.fillRows(OrcUtils.java:142)
> at
> org.apache.flink.orc.OrcRowInputFormat.ensureBatch(OrcRowInputFormat.java:334)
> at
> org.apache.flink.orc.OrcRowInputFormat.reachedEnd(OrcRowInputFormat.java:314)
> at
> org.apache.flink.runtime.operators.DataSourceTask.invoke(DataSourceTask.java:165)
> at org.apache.flink.runtime.taskmanager.Task.run(Task.java:718)
> at java.lang.Thread.run(Thread.java:748)
> {code}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)