[jira] [Commented] (HBASE-15516) Add flatMap to hbaseRDD

Ted Yu (JIRA) Wed, 23 Mar 2016 13:23:43 -0700

    [ 
https://issues.apache.org/jira/browse/HBASE-15516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15209092#comment-15209092
 ]


Ted Yu commented on HBASE-15516:
--------------------------------

Don't know why there were two hunks on the same source file in the patch.
{code}
[ERROR] 
/Users/tyu/trunk/hbase-spark/src/main/scala/org/apache/hadoop/hbase/spark/HBaseContext.scala:452:
 error: double definition:
[ERROR] method hbaseRDD:[U](tableName: org.apache.hadoop.hbase.TableName, scan: 
org.apache.hadoop.hbase.client.Scan, f: 
((org.apache.hadoop.hbase.io.ImmutableBytesWritable, 
org.apache.hadoop.hbase.client.Result)) => Seq[U])(implicit evidence$6: 
scala.reflect.ClassTag[U])org.apache.spark.rdd.RDD[U] and
[ERROR] method hbaseRDD:[U](tableName: org.apache.hadoop.hbase.TableName, scan: 
org.apache.hadoop.hbase.client.Scan, f: 
((org.apache.hadoop.hbase.io.ImmutableBytesWritable, 
org.apache.hadoop.hbase.client.Result)) => U)(implicit evidence$5: 
scala.reflect.ClassTag[U])org.apache.spark.rdd.RDD[U] at line 436
[ERROR] have same type after erasure: (tableName: 
org.apache.hadoop.hbase.TableName, scan: org.apache.hadoop.hbase.client.Scan, 
f: Function1, implicit evidence$6: reflect.ClassTag)org.apache.spark.rdd.RDD
[ERROR]   def hbaseRDD[U: ClassTag](tableName: TableName, scan: Scan,
[ERROR]       ^
{code}
Please fix the compilation error.

> Add flatMap to hbaseRDD
> -----------------------
>
>                 Key: HBASE-15516
>                 URL: https://issues.apache.org/jira/browse/HBASE-15516
>             Project: HBase
>          Issue Type: Improvement
>          Components: spark
>            Reporter: MahmoudHanafy
>            Priority: Minor
>         Attachments: HBASE-15516.patch
>
>
> HBaseContext supports reading RDDs using hbaseRDD method with map function, I 
> think it will be good also to add flatMap to hbaseRDD.
> Another improvement:
> Currently reading RDDs is done by using method hbaseRDD with map function as 
> the default method for loading RDDs. When trying to just load the RDD without 
> mapping, you are also loading the RDD and map all elements to itself. So I 
> think using hbaseRDD method without mapping function for loading RDDs would 
> be better.
> Also, I can contribute to hbase-spark module. If there any issues related to 
> this module, Please tell me about them



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-15516) Add flatMap to hbaseRDD

Reply via email to