Spark on k8s : spark 3.0.1 spark.kubernetes.executor.deleteontermination issue
Hi spark team Have cluster wide property spark.kubernetis.executor.deleteontermination to true. During the long running job, some of the executor got deleted which have shuffle data. Because of this, in the subsequent stage , we get lot of spark shuffle fetch fail exceptions. Please let me know , is there a way to fix it. Note if setting above property to false , I face no shuffle fetch exception. Regards Pralabh
question of shorten syntax for rdd
Hello May I know from what version of spark, the RDD syntax can be shorten as this? rdd.groupByKey().mapValues(lambda x:len(x)).collect() [('b', 2), ('d', 1), ('a', 2)] rdd.groupByKey().mapValues(len).collect() [('b', 2), ('d', 1), ('a', 2)] I know in scala the syntax: xxx(x => x.len) can be written as: xxx(_.len). But I never know in pyspark the "_" placeholder can even be ignored. Thank you. - To unsubscribe e-mail: user-unsubscr...@spark.apache.org