Is there a way to decide what RDDs get cached in the Spark Runner?

augusto . mcc Tue, 14 May 2019 05:11:39 -0700

Hi,

I guess the title says it all, right now it seems like BEAM caches all the 
intermediate RDD results for my pipeline when using the Spark runner, this 
leads to a very inefficient usage of memory. Any way to control this?


Best regards,
Augusto

Is there a way to decide what RDDs get cached in the Spark Runner?

Reply via email to