Github user JoshRosen commented on a diff in the pull request:
https://github.com/apache/spark/pull/1381#discussion_r16908892
--- Diff:
core/src/main/scala/org/apache/spark/rdd/OrderedRDDFunctions.scala ---
@@ -67,4 +67,39 @@ class OrderedRDDFunctions[K : Ordering : ClassTag,
}
}, preservesPartitioning = true)
}
+
+ /**
+ * Returns an RDD in which only elements in the range `lower` to
`upper`. If the RDD has been
+ * partitioned using the `RangePartitioner` this is an operation that
can be done efficiently.
+ * If not a standard `filter` is used.
+ */
+ def filterByRange(lower: K, upper: K) : RDD[P] = {
+ val partitionIndicies =
+ self.partitioner match {
+ case Some(p) => {
+ p match {
+ case rp: RangePartitioner[K, V] => {
+ (rp.getPartition(lower),rp.getPartition(upper)) match {
--- End diff --
Style nit: there should be extra space after the comma. The same goes for
a few places in the code below, too.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]