Hi, I am finding it difficult to understand the following problem : I count the number of records before and after applying the mapValues transformation for a JavaPairRDD. As expected the number of records were same before and after.
Now, I counted number of distinct keys before and after applying the mapValues transformation for the same JavaPairRDD. However, I get less count after applying the transformation. I expected mapValues will not change the keys. Then why am I getting lesser distinct keys? Note that - the total records are the same only distinct keys have dropped. using spark-1.4.1. Thanks in advance. Regards, Sushrut Ikhar [image: https://]about.me/sushrutikhar <https://about.me/sushrutikhar?promo=email_sig>
