Peeyush Gupta created ASTERIXDB-3620:
----------------------------------------

             Summary: Exception on querying on columnar data
                 Key: ASTERIXDB-3620
                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-3620
             Project: Apache AsterixDB
          Issue Type: Bug
          Components: STO - Storage
            Reporter: Peeyush Gupta


Steps to reproduce:

Create collection

 

{{CREATE COLLECTION Events PRIMARY KEY (id:string)}}

Get Github Events data using

 

{{wget https://data.gharchive.org/2015-01-01-15.json.gz}}

Load this jsonl data into the Events collection

Run following query

 

{{SELECT * FROM Events}}

Causes the following exception

 

{{2025-06-03T16:30:01.510+00:00 WARN CBAS.apache.asterix 
[Executor-29:ClusterController] org.apache.parquet.io.ParquetDecodingException: 
no more value to read, total value count is 1
org.apache.hyracks.api.exceptions.HyracksDataException: 
org.apache.parquet.io.ParquetDecodingException: no more value to read, total 
value count is 1
        at 
org.apache.hyracks.api.exceptions.HyracksDataException.create(HyracksDataException.java:70)
 ~[hyracks-api-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.hyracks.api.util.ExceptionUtils.setNodeIds(ExceptionUtils.java:74) 
~[hyracks-api-1.1.1-1302.jar:1.1.1-1302]
        at org.apache.hyracks.control.nc.Task.run(Task.java:399) 
~[hyracks-control-nc-1.1.1-1302.jar:1.1.1-1302]
        at 
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1144)
 ~[?:?]
        at 
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:642)
 ~[?:?]
        at java.base/java.lang.Thread.run(Thread.java:1583) [?:?]
Caused by: org.apache.parquet.io.ParquetDecodingException: no more value to 
read, total value count is 1
        at 
org.apache.asterix.column.bytes.decoder.ParquetDeltaBinaryPackingValuesReader.checkRead(ParquetDeltaBinaryPackingValuesReader.java:127)
 ~[asterix-column-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.asterix.column.bytes.decoder.ParquetDeltaBinaryPackingValuesReader.readLong(ParquetDeltaBinaryPackingValuesReader.java:120)
 ~[asterix-column-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.asterix.column.bytes.decoder.ParquetDeltaBinaryPackingValuesReader.readInteger(ParquetDeltaBinaryPackingValuesReader.java:115)
 ~[asterix-column-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.asterix.column.bytes.decoder.ParquetDeltaByteArrayReader.readBytes(ParquetDeltaByteArrayReader.java:69)
 ~[asterix-column-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.asterix.column.values.reader.value.StringValueReader.nextValue(StringValueReader.java:44)
 ~[asterix-column-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.asterix.column.values.reader.PrimitiveColumnValuesReader.next(PrimitiveColumnValuesReader.java:60)
 ~[asterix-column-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.asterix.column.assembler.PrimitiveValueAssembler.next(PrimitiveValueAssembler.java:49)
 ~[asterix-column-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.asterix.column.operation.query.ColumnAssembler.nextValue(ColumnAssembler.java:87)
 ~[asterix-column-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.asterix.column.tuple.QueryColumnTupleReference.getAssembledValue(QueryColumnTupleReference.java:165)
 ~[asterix-column-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.asterix.column.operation.query.QueryColumnTupleProjector.getAssembledValue(QueryColumnTupleProjector.java:115)
 ~[asterix-column-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.asterix.column.operation.query.QueryColumnTupleProjector.project(QueryColumnTupleProjector.java:93)
 ~[asterix-column-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.hyracks.storage.am.common.dataflow.IndexSearchOperatorNodePushable.writeTupleToOutput(IndexSearchOperatorNodePushable.java:401)
 ~[hyracks-storage-am-common-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.hyracks.storage.am.common.dataflow.IndexSearchOperatorNodePushable.writeSearchResults(IndexSearchOperatorNodePushable.java:274)
 ~[hyracks-storage-am-common-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.hyracks.storage.am.common.dataflow.IndexSearchOperatorNodePushable.searchAllPartitions(IndexSearchOperatorNodePushable.java:470)
 ~[hyracks-storage-am-common-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.hyracks.storage.am.common.dataflow.IndexSearchOperatorNodePushable.nextFrame(IndexSearchOperatorNodePushable.java:316)
 ~[hyracks-storage-am-common-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.hyracks.dataflow.common.comm.io.AbstractFrameAppender.write(AbstractFrameAppender.java:94)
 ~[hyracks-dataflow-common-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.hyracks.algebricks.runtime.operators.std.EmptyTupleSourceRuntimeFactory$1.open(EmptyTupleSourceRuntimeFactory.java:55)
 ~[algebricks-runtime-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.hyracks.algebricks.runtime.operators.meta.AlgebricksMetaOperatorDescriptor$SourcePushRuntime.initialize(AlgebricksMetaOperatorDescriptor.java:176)
 ~[algebricks-runtime-1.1.1-1302.jar:1.1.1-1302]
        at 
org.apache.hyracks.api.rewriter.runtime.SuperActivityOperatorNodePushable.lambda$runInParallel$0(SuperActivityOperatorNodePushable.java:245)
 ~[hyracks-api-1.1.1-1302.jar:1.1.1-1302]
        at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:317) 
~[?:?]
        ... 3 more}}

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to