fujianhua168 commented on issue #8754:
URL: https://github.com/apache/hudi/issues/8754#issuecomment-1562375375
>
When I performed data validation on the branch, I found a strange problem:
after the flink yarn-session process was suddenly interrupted, I executed the
sql statement again through the trino branch( but spark3 read this hudi table
is normal) , and an error would be generated: Query 20230525_ 063757_ 00006_
bdgnm failed: Index 0 out of bounds for length 0.
error sql statement: select count(1) as cnt from
hudi_demo.ods_pub_log_qt_collect_mor_d;
Here is then error Stack Trace:
java.lang.IndexOutOfBoundsException: Index 0 out of bounds for length 0
at
java.base/jdk.internal.util.Preconditions.outOfBounds(Preconditions.java:64)
at
java.base/jdk.internal.util.Preconditions.outOfBoundsCheckIndex(Preconditions.java:70)
at
java.base/jdk.internal.util.Preconditions.checkIndex(Preconditions.java:266)
at java.base/java.util.Objects.checkIndex(Objects.java:361)
at java.base/java.util.ArrayList.get(ArrayList.java:427)
at io.trino.plugin.hudi.HudiUtil.getHudiBaseFile(HudiUtil.java:176)
at
io.trino.plugin.hudi.HudiRecordCursor.createRealtimeRecordCursor(HudiRecordCursor.java:71)
at
io.trino.plugin.hudi.HudiPageSourceProvider.createPageSource(HudiPageSourceProvider.java:193)
at
io.trino.plugin.base.classloader.ClassLoaderSafeConnectorPageSourceProvider.createPageSource(ClassLoaderSafeConnectorPageSourceProvider.java:49)
at
io.trino.split.PageSourceManager.createPageSource(PageSourceManager.java:62)
at
io.trino.operator.TableScanOperator.getOutput(TableScanOperator.java:298)
at io.trino.operator.Driver.processInternal(Driver.java:402)
at io.trino.operator.Driver.lambda$process$8(Driver.java:305)
at io.trino.operator.Driver.tryWithLock(Driver.java:701)
at io.trino.operator.Driver.process(Driver.java:297)
at io.trino.operator.Driver.processForDuration(Driver.java:268)
at
io.trino.execution.SqlTaskExecution$DriverSplitRunner.processFor(SqlTaskExecution.java:888)
at
io.trino.execution.executor.PrioritizedSplitRunner.process(PrioritizedSplitRunner.java:187)
at
io.trino.execution.executor.TaskExecutor$TaskRunner.run(TaskExecutor.java:561)
at
io.trino.$gen.Trino_359_8548_g9e61519_dirty____20230525_055552_2.run(Unknown
Source)
at
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
at
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
at java.base/java.lang.Thread.run(Thread.java:833)


--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]