Github user suyanNone commented on the pull request:
https://github.com/apache/spark/pull/3582#issuecomment-66031430
@JoshRosen
I guessï¼
1. Tow Thread in Same Executor
1.1 Two Thread in same Executor,
Executor have 4 core, and cpu per task is 1.
RDDC <-- other: TaskSet1
RDDA.cache <--
RDDB <-- other: TaskSet2
Thread A give to TaskSet1
Thread B give to TaskSet2, because A can't get any task because the
locality or A's tasks are all scheduled.
It so chanced that Thread A and B all deal with the same partition
at the same time.
2. Two Thread in different Executor.
2.2 replicate blockA to ExecutorB, and ExecutorB is just to cache blockA
after call cacheManager.putInBlockManager.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]