sivabalan narayanan created HUDI-8644:
-----------------------------------------
Summary: Clustering based on new FG reader hits Concurrent
modification exception
Key: HUDI-8644
URL: https://issues.apache.org/jira/browse/HUDI-8644
Project: Apache Hudi
Issue Type: Bug
Components: clustering
Reporter: sivabalan narayanan
{code:java}
24/12/04 01:53:19 WARN TaskSetManager: Lost task 3.0 in stage 175.0 (TID 7640)
(ip-10-0-106-23.us-west-2.compute.internal executor 7):
com.esotericsoftware.kryo.KryoException: java.lang.UnsupportedOperationException
Serialization trace:
reserved (org.apache.avro.Schema$Field)
fieldMap (org.apache.avro.Schema$RecordSchema)
right (org.apache.hudi.common.util.collection.ImmutablePair)
at
com.esotericsoftware.kryo.serializers.ObjectField.read(ObjectField.java:144)
at
com.esotericsoftware.kryo.serializers.FieldSerializer.read(FieldSerializer.java:543)
at com.esotericsoftware.kryo.Kryo.readClassAndObject(Kryo.java:813)
at
com.esotericsoftware.kryo.serializers.MapSerializer.read(MapSerializer.java:161)
at
com.esotericsoftware.kryo.serializers.MapSerializer.read(MapSerializer.java:39)
at com.esotericsoftware.kryo.Kryo.readObject(Kryo.java:731)
at
com.esotericsoftware.kryo.serializers.ObjectField.read(ObjectField.java:125)
at
com.esotericsoftware.kryo.serializers.FieldSerializer.read(FieldSerializer.java:543)
at com.esotericsoftware.kryo.Kryo.readClassAndObject(Kryo.java:813)
at
com.esotericsoftware.kryo.serializers.MapSerializer.read(MapSerializer.java:161)
at
com.esotericsoftware.kryo.serializers.MapSerializer.read(MapSerializer.java:39)
at com.esotericsoftware.kryo.Kryo.readObject(Kryo.java:731)
at
com.esotericsoftware.kryo.serializers.ObjectField.read(ObjectField.java:125)
at
com.esotericsoftware.kryo.serializers.FieldSerializer.read(FieldSerializer.java:543)
at com.esotericsoftware.kryo.Kryo.readClassAndObject(Kryo.java:813)
at
org.apache.hudi.common.util.SerializationUtils$KryoSerializerInstance.deserialize(SerializationUtils.java:103){code}
{code:java}
at
org.apache.hudi.common.util.SerializationUtils.deserialize(SerializationUtils.java:77)
at
org.apache.hudi.common.util.collection.BitCaskDiskMap.get(BitCaskDiskMap.java:209)
at
org.apache.hudi.common.util.collection.BitCaskDiskMap.get(BitCaskDiskMap.java:202)
at
org.apache.hudi.common.util.collection.BitCaskDiskMap.get(BitCaskDiskMap.java:198)
at
org.apache.hudi.common.util.collection.BitCaskDiskMap.remove(BitCaskDiskMap.java:245)
at
org.apache.hudi.common.util.collection.BitCaskDiskMap.remove(BitCaskDiskMap.java:67)
at
org.apache.hudi.common.util.collection.ExternalSpillableMap.remove(ExternalSpillableMap.java:240)
at
org.apache.hudi.common.table.read.HoodiePositionBasedFileGroupRecordBuffer.hasNextBaseRecord(HoodiePositionBasedFileGroupRecordBuffer.java:221)
at
org.apache.hudi.common.table.read.HoodieKeyBasedFileGroupRecordBuffer.doHasNext(HoodieKeyBasedFileGroupRecordBuffer.java:135)
at
org.apache.hudi.common.table.read.HoodieBaseFileGroupRecordBuffer.hasNext(HoodieBaseFileGroupRecordBuffer.java:156)
at
org.apache.hudi.common.table.read.HoodieFileGroupReader.hasNext(HoodieFileGroupReader.java:236)
at
org.apache.hudi.common.table.read.HoodieFileGroupReader$HoodieFileGroupReaderIterator.hasNext(HoodieFileGroupReader.java:296)
at
org.apache.hudi.io.HoodieSparkMergeHandleV2.write(HoodieSparkMergeHandleV2.java:230)
at
org.apache.hudi.table.HoodieSparkCopyOnWriteTable.compactUsingFileGroupReader(HoodieSparkCopyOnWriteTable.java:285)
at
org.apache.hudi.table.action.compact.HoodieCompactor.compact(HoodieCompactor.java:295)
at
org.apache.hudi.table.action.compact.HoodieCompactor.lambda$compact$41887b12$1(HoodieCompactor.java:150)
at
org.apache.spark.api.java.JavaPairRDD$.$anonfun$toScalaFunction$1(JavaPairRDD.scala:1070)
at scala.collection.Iterator$$anon$10.next(Iterator.scala:461)
at scala.collection.Iterator$$anon$11.nextCur(Iterator.scala:486)
at scala.collection.Iterator$$anon$11.hasNext(Iterator.scala:492)
at
org.apache.spark.storage.memory.MemoryStore.putIterator(MemoryStore.scala:223){code}
{code:java}
at
org.apache.spark.storage.memory.MemoryStore.putIteratorAsBytes(MemoryStore.scala:352)
at
org.apache.spark.storage.BlockManager.$anonfun$doPutIterator$1(BlockManager.scala:1614)
at
org.apache.spark.storage.BlockManager.org$apache$spark$storage$BlockManager$$doPut(BlockManager.scala:1524)
at
org.apache.spark.storage.BlockManager.doPutIterator(BlockManager.scala:1588)
at
org.apache.spark.storage.BlockManager.getOrElseUpdate(BlockManager.scala:1389)
at
org.apache.spark.storage.BlockManager.getOrElseUpdateRDDBlock(BlockManager.scala:1343)
at org.apache.spark.rdd.RDD.getOrCompute(RDD.scala:379)
at org.apache.spark.rdd.RDD.iterator(RDD.scala:329)
at
org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:52)
at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:367)
at org.apache.spark.rdd.RDD.iterator(RDD.scala:331)
at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:93)
at
org.apache.spark.TaskContext.runTaskWithListeners(TaskContext.scala:166)
at org.apache.spark.scheduler.Task.run(Task.scala:141)
at
org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$4(Executor.scala:620)
at
org.apache.spark.util.SparkErrorUtils.tryWithSafeFinally(SparkErrorUtils.scala:64)
at
org.apache.spark.util.SparkErrorUtils.tryWithSafeFinally$(SparkErrorUtils.scala:61)
at org.apache.spark.util.Utils$.tryWithSafeFinally(Utils.scala:94)
at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:623)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:750)
Caused by: java.lang.UnsupportedOperationException
at
java.util.Collections$UnmodifiableCollection.add(Collections.java:1057)
at
com.esotericsoftware.kryo.serializers.CollectionSerializer.read(CollectionSerializer.java:134)
at
com.esotericsoftware.kryo.serializers.CollectionSerializer.read(CollectionSerializer.java:40)
at com.esotericsoftware.kryo.Kryo.readObject(Kryo.java:731){code}
{code:java}
at com.esotericsoftware.kryo.Kryo.readObject(Kryo.java:731)
at
com.esotericsoftware.kryo.serializers.ObjectField.read(ObjectField.java:125)
... 58 more
24/12/04 01:53:19 INFO TaskSetManager: Starting task 3.1 in stage 175.0 (TID
9109) (ip-10-0-99-46.us-west-2.compute.internal, executor 6, partition 3,
PROCESS_LOCAL, 9798 bytes) {code}
{code:java}
24/12/04 01:53:20 INFO TaskSetManager: Lost task 4.0 in stage 175.0 (TID 7641)
on ip-10-0-99-46.us-west-2.compute.internal, executor 6:
com.esotericsoftware.kryo.KryoException (java.lang.UnsupportedOperationException
Serialization trace:
reserved (org.apache.avro.Schema$Field)
fieldMap (org.apache.avro.Schema$RecordSchema)
right (org.apache.hudi.common.util.collection.ImmutablePair)) [duplicate 1]
24/12/04 01:53:20 INFO TaskSetManager: Starting task 4.1 in stage 175.0 (TID
9110) (ip-10-0-90-216.us-west-2.compute.internal, executor 4, partition 4,
PROCESS_LOCAL, 9798 bytes)
24/12/04 01:53:21 INFO BlockManagerInfo: Added rdd_359_37 in memory on
ip-10-0-66-62.us-west-2.compute.internal:40389 (size: 363.0 B, free: 29.4 GiB)
24/12/04 01:53:21 INFO TaskSetManager: Finished task 37.0 in stage 175.0 (TID
7674) in 43143 ms on ip-10-0-66-62.us-west-2.compute.internal (executor 10)
(43/79)
24/12/04 01:53:21 INFO TaskSetManager: Lost task 9.0 in stage 175.0 (TID 7646)
on ip-10-0-118-204.us-west-2.compute.internal, executor 9:
com.esotericsoftware.kryo.KryoException (java.lang.UnsupportedOperationException
Serialization trace:
reserved (org.apache.avro.Schema$Field)
fieldMap (org.apache.avro.Schema$RecordSchema)
right (org.apache.hudi.common.util.collection.ImmutablePair)) [duplicate 2]
24/12/04 01:53:21 INFO TaskSetManager: Starting task 9.1 in stage 175.0 (TID
9111) (ip-10-0-88-53.us-west-2.compute.internal, executor 5, partition 9,
PROCESS_LOCAL, 9798 bytes)
24/12/04 01:53:21 INFO BlockManagerInfo: Added rdd_359_23 in memory on
ip-10-0-106-23.us-west-2.compute.internal:39533 (size: 362.0 B, free: 29.6 GiB)
24/12/04 01:53:21 INFO TaskSetManager: Finished task 23.0 in stage 175.0 (TID
7660) in 43215 ms on ip-10-0-106-23.us-west-2.compute.internal (executor 7)
(44/79)
24/12/04 01:53:21 INFO TaskSetManager: Lost task 12.0 in stage 175.0 (TID 7649)
on ip-10-0-88-53.us-west-2.compute.internal, executor 5:
com.esotericsoftware.kryo.KryoException
(java.lang.UnsupportedOperationException {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)