Re: RDD storage in spark steaming

Jeffrey Jedele Mon, 23 Mar 2015 07:43:47 -0700

Hey Abhi,
many of StreamingContext's methods to create input streams take a
StorageLevel parameter to configure this behavior. RDD partitions are
generally stored in the in-memory cache of worker nodes I think. You can
also configure replication and spilling to disk if needed.


Regards,
Jeff

2015-03-23 15:26 GMT+01:00 abhi <abhishek...@gmail.com>:

> HI,
> i have a simple question about creating RDD . Whenever RDD is created in
> spark streaming for the particular time window .When does the RDD gets
> stored .
>
> 1. Does it get stored at the Driver machine ? or it gets stored on all the
> machines in the cluster ?
> 2. Does the data gets stored in memory by default ? Can it store at the
> memory and disk ? How can it configured ?
>
>
> Thanks,
> Abhi
>
>

Re: RDD storage in spark steaming

Reply via email to