spark.cleaner.ttl and spark.streaming.unpersist

Luis Ángel Vicente Sánchez Tue, 09 Sep 2014 14:22:05 -0700

The executors of my spark streaming application are being killed due to
memory issues. The memory consumption is quite high on startup because is
the first run and there are quite a few events on the kafka queues that are
consumed at a rate of 100K events per sec.


I wonder if it's recommended to use spark.cleaner.ttl and
spark.streaming.unpersist together to mitigate that problem. And I also
wonder if new RDD are being batched while a RDD is being processed.

Regards,

Luis

spark.cleaner.ttl and spark.streaming.unpersist

Reply via email to