Hi,
You just need to extend Partitioner and override the numPartitions and
getPartition methods, see below
class MyPartitioner extends partitioner {
def numPartitions: Int = // Return the number of partitions
def getPartition(key Any): Int = // Return the partition for a given key
}
On Tue, Sep 1, 2015 at 10:15 AM shahid qadri <[email protected]>
wrote:
> Hi Sparkians
>
> How can we create a customer partition in pyspark
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
>
>