Please see the doc at the beginning of RDD class: * A Resilient Distributed Dataset (RDD), the basic abstraction in Spark. Represents an immutable, * partitioned collection of elements that can be operated on in parallel. This class contains the * basic operations available on all RDDs, such as `map`, `filter`, and `persist`. In addition,
On Fri, May 6, 2016 at 5:25 AM, HARSH TAKKAR <takkarha...@gmail.com> wrote: > Hi > > Is there a way i can modify a RDD, in for-each loop, > > Basically, i have a use case in which i need to perform multiple iteration > over data and modify few values in each iteration. > > > Please help. >