Maksym Rymar created DRILL-8486: ----------------------------------- Summary: ParquetDecodingException: could not read bytes at offset Key: DRILL-8486 URL: https://issues.apache.org/jira/browse/DRILL-8486 Project: Apache Drill Issue Type: Bug Components: Storage - Parquet Affects Versions: 1.21.1 Reporter: Maksym Rymar Assignee: Maksym Rymar Attachments: test.parquet
Drill fails to read a parquet file with the following exception: {code:java} Caused by: org.apache.parquet.io.ParquetDecodingException: could not read bytes at offset 591804 at org.apache.parquet.column.values.plain.BinaryPlainValuesReader.readBytes(BinaryPlainValuesReader.java:42) at org.apache.drill.exec.store.parquet.columnreaders.VarLenColumnBulkInput$ValuesReaderWrapper.getNextEntry(VarLenColumnBulkInput.java:754) ... 43 common frames omitted Caused by: java.io.EOFException: null at org.apache.parquet.bytes.SingleBufferInputStream.read(SingleBufferInputStream.java:52) at org.apache.parquet.bytes.BytesUtils.readIntLittleEndian(BytesUtils.java:83) at org.apache.parquet.column.values.plain.BinaryPlainValuesReader.readBytes(BinaryPlainValuesReader.java:39) ... 44 common frames omitted {code} This issue only affects queries with {{store.parquet.flat.reader.bulk}} set to {{{}true{}}}(by default). Attaching the parquet file for the reproduce: [^test.parquet]. Query: {{select log, app_name from dfs.tmp.`test.parquet`}} -- This message was sent by Atlassian Jira (v8.20.10#820010)