Thanks for your reply:
What is the size of RDD two?
RDD two is a paried rdd, during iterating, its size may differ from
40000 to 4500000.
You want to map à line from RDD one to multiple values from RDD two and
get the sum of all of them?
Yes
So as result you would have an rdd of size RDD1 and containing a number
per line?
Yes
Thank you again. This problem has puzzled us for several days...
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/How-to-efficiently-join-this-two-complicated-rdds-tp1665p1674.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.