Lets say I have an RDD of Strings and there are 26 machines in the cluster. How can I repartition the RDD in such a way that all strings starting with A gets collected on machine1, B on machine2 and so on.
- RDD and Partition David Thomas
- Re: RDD and Partition Nick Pentreath
- Re: RDD and Partition Mark Hamstra
- Re: RDD and Partition David Thomas
- Re: RDD and Partition Mark Hamstra
- Re: RDD and Partition Mark Hamstra
- Re: RDD and Partition Christopher Nguyen
- Re: RDD and Partition Mark Hamstra
- Re: RDD and Partition Christopher Nguyen
- Re: RDD and Partition David Thomas
- Re: RDD and Partition Christopher Nguyen
