[ https://issues.apache.org/jira/browse/ORC-1873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17942356#comment-17942356 ]
dzcxzl commented on ORC-1873: ----------------------------- org.apache.orc.impl.TreeReaderFactory.BytesColumnVectorUtil#readOrcByteArrays [https://github.com/apache/orc/blob/main/java/core/src/java/org/apache/orc/impl/TreeReaderFactory.java#L2086-L2089] OR use flatten api {code:java} BytesColumnVector value = (BytesColumnVector) batch.cols[0]; value.flatten(false, batch.selected, batch.size); {code} > I'm writing repeated string values to a string column in an ORC file using > Java and while reading the ORC file back, encounter a NullPointerException > ----------------------------------------------------------------------------------------------------------------------------------------------------- > > Key: ORC-1873 > URL: https://issues.apache.org/jira/browse/ORC-1873 > Project: ORC > Issue Type: Bug > Reporter: Shiva > Priority: Major > Attachments: ORCReader.java, ORCWriter.java > > > When writing the same value to a specific column for every row within a > stripe, only the first row correctly returns the expected value. All > subsequent rows in that stripe return {{null}} for the same column. > However, if two or more distinct values are written to the column within the > same stripe, this issue does not occur — all values are returned as expected. > > Attached Reader and Writer code files. > > When I am trying to read all the rows, facing below issue > {code:java} > Row 1 value: value > Exception in thread "main" java.lang.NullPointerException: Cannot read the > array length because "bytes" is null > at java.base/java.lang.String.<init>(String.java:1455) > at com.ge.ORCReader.main(ORCReader.java:38){code} -- This message was sent by Atlassian Jira (v8.20.10#820010)