Re: Q: About scenarios where driver execution flow may block...

2014-09-07 Thread Mayur Rustagi
Statements are executed only when you try to cause some effect on the server (produce data, collect data on driver). At time of execution Spark does all the depedency resolution truncates paths that dont go anywhere as well as optimize execution pipelines. So you really dont have to worry about

Q: About scenarios where driver execution flow may block...

2014-09-06 Thread didata
Hello friends: I have a theory question about call blocking in a Spark driver. Consider this (admittedly contrived =:)) snippet to illustrate this question... x = rdd01.reduceByKey() # or maybe some other 'shuffle-requiring action'. b = sc.broadcast(x. take(20)) # Or any statement that