Re: pre-filtered hadoop RDD use case

2014-07-29 Thread Reynold Xin
on the issue. If you think there is PR in this I’d be happy to code it up and submit it. Thank you -- Eugene Cheipesh -- View this message in context: http://apache-spark-developers-list.1001551.n3.nabble.com/pre-filtered-hadoop-RDD-use-case-tp7484.html Sent from the Apache Spark Developers

RE: pre-filtered hadoop RDD use case

2014-07-29 Thread Yan Zhou.sc
12:55 AM To: dev@spark.apache.org Subject: Re: pre-filtered hadoop RDD use case Would something like this help? https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/rdd/PartitionPruningRDD.scala On Thu, Jul 24, 2014 at 8:40 AM, Eugene Cheipesh echeip...@gmail.com

Re: pre-filtered hadoop RDD use case

2014-07-29 Thread Reynold Xin
Message- From: Reynold Xin [mailto:r...@databricks.com] Sent: Tuesday, July 29, 2014 12:55 AM To: dev@spark.apache.org Subject: Re: pre-filtered hadoop RDD use case Would something like this help? https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/rdd

RE: pre-filtered hadoop RDD use case

2014-07-29 Thread Yan Zhou.sc
...@databricks.com] Sent: Tuesday, July 29, 2014 11:44 AM To: dev@spark.apache.org Subject: Re: pre-filtered hadoop RDD use case I am not sure if I agree that it lacks the mechanism to do pushdowns. Hadoop InputFormat itself provides some basic mechanism to push down predicates already. The HBase