tamama commented on PR #37206:
URL: https://github.com/apache/spark/pull/37206#issuecomment-1486861885

   > The exceptions reported in 
[SPARK-39696](https://issues.apache.org/jira/browse/SPARK-39696) are as follows:
   > 
   > ```
   > 2022-06-21 18:17:49.289Z ERROR [executor-heartbeater] 
org.apache.spark.util.Utils - Uncaught exception in thread executor-heartbeater
   > java.util.ConcurrentModificationException: mutation occurred during 
iteration
   >         at 
scala.collection.mutable.MutationTracker$.checkMutations(MutationTracker.scala:43)
 ~[scala-library-2.13.8.jar:?]
   >         at 
scala.collection.mutable.CheckedIndexedSeqView$CheckedIterator.hasNext(CheckedIndexedSeqView.scala:47)
 ~[scala-library-2.13.8.jar:?]
   >         at 
scala.collection.IterableOnceOps.copyToArray(IterableOnce.scala:873) 
~[scala-library-2.13.8.jar:?]
   >         at 
scala.collection.IterableOnceOps.copyToArray$(IterableOnce.scala:869) 
~[scala-library-2.13.8.jar:?]
   >         at 
scala.collection.AbstractIterator.copyToArray(Iterator.scala:1293) 
~[scala-library-2.13.8.jar:?]
   >         at 
scala.collection.IterableOnceOps.copyToArray(IterableOnce.scala:852) 
~[scala-library-2.13.8.jar:?]
   >         at 
scala.collection.IterableOnceOps.copyToArray$(IterableOnce.scala:852) 
~[scala-library-2.13.8.jar:?]
   >         at 
scala.collection.AbstractIterator.copyToArray(Iterator.scala:1293) 
~[scala-library-2.13.8.jar:?]
   >         at 
scala.collection.immutable.VectorStatics$.append1IfSpace(Vector.scala:1959) 
~[scala-library-2.13.8.jar:?]
   >         at 
scala.collection.immutable.Vector1.appendedAll0(Vector.scala:425) 
~[scala-library-2.13.8.jar:?]
   >         at scala.collection.immutable.Vector.appendedAll(Vector.scala:203) 
~[scala-library-2.13.8.jar:?]
   >         at scala.collection.immutable.Vector.appendedAll(Vector.scala:113) 
~[scala-library-2.13.8.jar:?]
   >         at scala.collection.SeqOps.concat(Seq.scala:187) 
~[scala-library-2.13.8.jar:?]
   >         at scala.collection.SeqOps.concat$(Seq.scala:187) 
~[scala-library-2.13.8.jar:?]
   >         at scala.collection.AbstractSeq.concat(Seq.scala:1161) 
~[scala-library-2.13.8.jar:?]
   >         at scala.collection.IterableOps.$plus$plus(Iterable.scala:726) 
~[scala-library-2.13.8.jar:?]
   >         at scala.collection.IterableOps.$plus$plus$(Iterable.scala:726) 
~[scala-library-2.13.8.jar:?]
   >         at 
scala.collection.AbstractIterable.$plus$plus(Iterable.scala:926) 
~[scala-library-2.13.8.jar:?]
   >         at 
org.apache.spark.executor.TaskMetrics.accumulators(TaskMetrics.scala:261) 
~[spark-core_2.13-3.3.0.jar:3.3.0]
   >         at 
org.apache.spark.executor.Executor.$anonfun$reportHeartBeat$1(Executor.scala:1042)
 ~[spark-core_2.13-3.3.0.jar:3.3.0]
   >         at 
scala.collection.IterableOnceOps.foreach(IterableOnce.scala:563) 
~[scala-library-2.13.8.jar:?]
   >         at 
scala.collection.IterableOnceOps.foreach$(IterableOnce.scala:561) 
~[scala-library-2.13.8.jar:?]
   >         at scala.collection.AbstractIterable.foreach(Iterable.scala:926) 
~[scala-library-2.13.8.jar:?]
   >         at 
org.apache.spark.executor.Executor.reportHeartBeat(Executor.scala:1036) 
~[spark-core_2.13-3.3.0.jar:3.3.0]
   >         at 
org.apache.spark.executor.Executor.$anonfun$heartbeater$1(Executor.scala:238) 
~[spark-core_2.13-3.3.0.jar:3.3.0]
   >         at 
scala.runtime.java8.JFunction0$mcV$sp.apply(JFunction0$mcV$sp.scala:18) 
~[scala-library-2.13.8.jar:?]
   >         at 
org.apache.spark.util.Utils$.logUncaughtExceptions(Utils.scala:2066) 
~[spark-core_2.13-3.3.0.jar:3.3.0]
   >         at org.apache.spark.Heartbeater$$anon$1.run(Heartbeater.scala:46) 
~[spark-core_2.13-3.3.0.jar:3.3.0]
   >         at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:539) ~[?:?]
   >         at 
java.util.concurrent.FutureTask.runAndReset(FutureTask.java:305) ~[?:?]
   >         at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:305)
 ~[?:?]
   >         at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) 
~[?:?]
   >         at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) 
~[?:?]
   >         at java.lang.Thread.run(Thread.java:833) ~[?:?] 
   > ```
   > 
   > It seems to be a small probability event
   
   Hi - our executors crashed almost every 30 min while using HDFS as 
checkpoint (Kafka source). FYI - using SSHFS as checkpoint would crash the 
executors randomly say once/twice per day...
   
   From our opinion, this is too dangerous in production, as this would cause 
streaming jitters - thus causing unncessary backpressure on our data pipeline. 
...
   
   We intend to fallback to Spark-3.3.1 Scala-2.12 (instead of Scala 2.13)
   
   Is there a Spark LTS release that is stable for production use?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to