The number of tasks is decided by the input partition numbers. If you want only one map or flatMap at once, just call coalesce() or repartition() to associate data into one partition. However, this is not recommend because it was not executed parallel efficiently.
2014-10-28 17:27 GMT+08:00 <[email protected]>: > Hi, > > I am currently struggling with how to properly set Spark to perform only > one map, flatMap, etc at once. In other words my map uses multi core > algorithm so I would like to have only one map running to be able to use > all the machine cores. > > Thank you in advance for advices and replies. > > Jan > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > >
