Hi,
I am finding it difficult to understand the following problem :
I count the number of records before and after applying the mapValues
transformation for a JavaPairRDD. As expected the number of records were
same before and after.

Now, I counted number of distinct keys before and after applying the
mapValues transformation for the same JavaPairRDD. However, I get less
count after applying the transformation. I expected mapValues will not
change the keys. Then why am I getting lesser distinct keys? Note that -
the total records are the same only distinct keys have dropped.

using spark-1.4.1.

Thanks in advance.

Regards,

Sushrut Ikhar
[image: https://]about.me/sushrutikhar
<https://about.me/sushrutikhar?promo=email_sig>

Reply via email to