Hi

I am new to Spark. I am confused between correlation in threads and
physical cores.

As per my understanding, according to number of partitions in data set,
number of tasks is created. For example I have a machine which has 10
physical cores and I have data set which has 100 partitions then in
Executor JVM 100 tasks (one per each partitioner will be created)

Query 1) But how will it be decided how many threads in Executor are
created to execute these 100 tasks and who creates these threads.

Query 2) Does parameter "total-executor-cores" define how many threads will
be launched in executor JVM to process tasks. If not than what is meaning
of "total-executor-cores" in context of both threads inside Executor JVM
and physical cores.

Thanks and Regards
Aniruddh

Reply via email to