[ 
https://issues.apache.org/jira/browse/HBASE-17933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15975161#comment-15975161
 ] 

Yi Liang commented on HBASE-17933:
----------------------------------

Hi Sean, 
  Thanks for review, for your second comment, when I start designing the API, I 
also want to use the approach you mentioned above. But since it is a java api, 
we have some limitation in Java language. From its scala API
{code}
  def bulkLoad[T](rdd:RDD[T],
                  tableName: TableName,
                  flatMap: (T) => Iterator[(KeyFamilyQualifier, Array[Byte])],
                  stagingDir:String,
                  familyHFileWriteOptionsMap:
                  util.Map[Array[Byte], FamilyHFileWriteOptions] =
                  new util.HashMap[Array[Byte], FamilyHFileWriteOptions],
                  compactionExclude: Boolean = false,
                  maxSize:Long = HConstants.DEFAULT_MAX_FILE_SIZE)
{code}

As you can see,  the scala API allow callers to provide a method can transform 
any type [T] to a key-value pair (KeyFamilyQualifier, Array[Byte]).  But in 
Java, a method(actually a class that implement mapFunction class) can not 
directly transfer T to a kv pair like scala, since Java does not support (k,v) 
pair I need to transfer it to a  defined type, and in scala transfer this type 
to the kv pair.

> [hbase-spark]  Support Java api for bulkload
> --------------------------------------------
>
>                 Key: HBASE-17933
>                 URL: https://issues.apache.org/jira/browse/HBASE-17933
>             Project: HBase
>          Issue Type: New Feature
>          Components: spark
>    Affects Versions: 2.0.0
>            Reporter: Yi Liang
>            Assignee: Yi Liang
>             Fix For: 2.0.0
>
>         Attachments: HBase-17933-V1.patch
>
>
> In JavaHBaseContext, there are java api for bulkPut, bulkDelete ...., but no 
> Java api for bulkload. And this jira will add bulkload java api to hbase-spark



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to