Github user suyanNone commented on the pull request:

    https://github.com/apache/spark/pull/3582#issuecomment-66031430
  
    @JoshRosen 
    
    I guess:
    1. Tow Thread in Same Executor
    1.1 Two Thread in  same Executor,
           Executor have 4 core, and cpu per task is 1.
                                 RDDC <-- other:  TaskSet1
    RDDA.cache <--
                                 RDDB <-- other:   TaskSet2
           Thread A give to TaskSet1
           Thread B give to TaskSet2, because A can't get any task because the 
locality or A's tasks are all scheduled.
            It so chanced that Thread A and B all deal with the same partition 
at the same time.
     
    
    2.  Two Thread in different Executor.
    2.2  replicate blockA  to ExecutorB, and ExecutorB is just to cache blockA 
after call cacheManager.putInBlockManager.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to