Hi, Please use proper subject when sending email to user@ In your example below, what do the values inside curly braces represent ? I assume not the keys since values for same key should go to the same partition.
Cheers On Mon, Jan 11, 2016 at 10:51 AM, Daniel Imberman <daniel.imber...@gmail.com > wrote: > Hi all, > > I'm looking for a way to efficiently partition an RDD, but allow the same > data to exists on multiple partitions. > > > Lets say I have a key-value RDD with keys {1,2,3,4} > > I want to be able to to repartition the RDD so that so the partitions look > like > > p1 = {1,2} > p2 = {2,3} > p3 = {3,4} > > Locality is important in this situation as I would be doing internal > comparison values. > > Does anyone have any thoughts as to how I could go about doing this? > > Thank you >