Hi! I load data from list( sc.parallelize() ) with length about 1400000 items. After that I run data.filter(func1).map(func2). This operation runs less, then a second. But after that function count() (or collect() ) takes about 30 seconds. Please, help me to reduce this time! Best Regards, Valentin
- RDD.count() take a lot of time Valentin Michajlenko
- Re: RDD.count() take a lot of time Meisam Fathi
- Re: RDD.count() take a lot of time Valentin Michajlenko
- Re: RDD.count() take a lot of time Meisam Fathi
- Re: RDD.count() take a lot of time Valentin Michajlenko
