Spark on k8s : spark 3.0.1 spark.kubernetes.executor.deleteontermination issue

2022-01-17 Thread Pralabh Kumar
Hi spark team

Have cluster wide property spark.kubernetis.executor.deleteontermination to
true.
During the long running job, some of the executor got deleted which have
shuffle data. Because of this,  in the subsequent stage , we get lot of
spark shuffle fetch fail exceptions.


Please let me know , is there a way to fix it. Note if setting above
property to false , I face no shuffle fetch exception.


Regards
Pralabh


question of shorten syntax for rdd

2022-01-17 Thread capitnfrakass

Hello

May I know from what version of spark, the RDD syntax can be shorten as 
this?



rdd.groupByKey().mapValues(lambda x:len(x)).collect()

[('b', 2), ('d', 1), ('a', 2)]

rdd.groupByKey().mapValues(len).collect()

[('b', 2), ('d', 1), ('a', 2)]

I know in scala the syntax: xxx(x => x.len) can be written as: 
xxx(_.len).

But I never know in pyspark the "_" placeholder can even be ignored.

Thank you.

-
To unsubscribe e-mail: user-unsubscr...@spark.apache.org