Hi Reading the docs it seems that the parallel methods (on the same node) all use process to process communication over TCP rather than threads. Correct me if I'm wrong here. It seems to be the same sort of approach as Python multiprocessing (due to its single threaded runtime). This approach requires a data copy ( 2 actually) and theoretically puts a high overhead on pipeline type processing because of the data copy, transmission time and consequent memory churn. Am I reading this correctly?
Thanks -- bob
