Why my default partition size is set to 52 ?

Jaonary Rabarisoa Fri, 05 Dec 2014 06:52:49 -0800

Hi all,

I'm trying to run some spark job with spark-shell. What I want to do is
just to count the number of lines in a file.
I start the spark-shell with the default argument i.e just with
./bin/spark-shell.


Load the text file with sc.textFile("path") and then call count on my data.

When I do this, my data is always split in 52 partitions. I don't
understand why since I run it on a local machine with 8 cores and the
sc.defaultParallelism gives me 8.

Even, if I load the file with sc.textFile("path",8), I always get
data.partitions.size = 52

I use spark 1.1.1.


Any ideas ?



Cheers,

Jao

Why my default partition size is set to 52 ?

Reply via email to