Fwd: Re: reading a specific key-value

Yadid Ayzenberg Fri, 13 Dec 2013 12:09:06 -0800

oops, ,meant to send to the entire list...


-------- Original Message --------
Subject:        Re: reading a specific key-value
Date:   Fri, 13 Dec 2013 14:56:22 -0500
From:   Yadid Ayzenberg <ya...@media.mit.edu>
To:     K. Shankari <shank...@eecs.berkeley.edu>

Its says more efficient if the RDD has a "known" partitioner. What doesthat mean?


Yadid


On 12/13/13 2:11 PM, K. Shankari wrote:

I think that you want the lookup() method in PairRDDFunctions?
http://spark.incubator.apache.org/docs/latest/api/core/index.html#org.apache.spark.rdd.PairRDDFunctions

It is supposed to be more efficient than filter...

Shankari

On Thu, Dec 12, 2013 at 7:30 PM, Yadid <ya...@media.mit.edu<mailto:ya...@media.mit.edu>> wrote:


    I have a pairRDD and I would like to access a specific key-value.
    The first thing that comes to mind is filtering using the
    specified key, but that seems very inefficient as that would
    iterate over the entire RDD. And even more so if I need to access
    several keys.

    Is there any other way to perform this ? this seems like a really
    useful feature. Im guessing that in order to implement this, I
    would need a mapping of keys to partitions, and a method to access
    data from a specific partition.

    Yadid

Fwd: Re: reading a specific key-value

Reply via email to