You can remove cached RDDs by calling unpersist() on them.

You can also use SparkContext.getRDDStorageInfo to get info on cache usage, 
though this is a developer API so it may change in future versions. We will add 
a standard API eventually but this is just very closely tied to framework 
internals.

Matei

On May 28, 2014, at 5:32 PM, Sung Hwan Chung <coded...@cs.stanford.edu> wrote:

> Hi,
> 
> Is there a programmatic way of checking whether RDD has been 100% cached or 
> not? I'd like to do this to have two different path ways.
> 
> Additionally, how do you clear cache (e.g. if you want to cache different 
> RDDs, and you'd like to clear an existing cached RDD).
> 
> Thanks!

Reply via email to