Antonin Delpeuch created SPARK-31518:
----------------------------------------

             Summary: Expose filterByRange in JavaPairRDD
                 Key: SPARK-31518
                 URL: https://issues.apache.org/jira/browse/SPARK-31518
             Project: Spark
          Issue Type: Improvement
          Components: Spark Core
    Affects Versions: 2.4.5
            Reporter: Antonin Delpeuch


The `filterByRange` method makes it possible to efficiently filter a sorted RDD 
using bounds on its keys. It prunes out partitions instead of scanning them if 
a RangePartitioner is available in the RDD.

This method is part of the Scala API, defined in OrderedRDDFunctions, but is 
not exposed in the Java API as far as I can tell. All other methods defined in 
OrderedRDDFunctions are exposed in JavaPairRDD, therefore it seems natural to 
expose `filterByRange` there too.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to