[ 
https://issues.apache.org/jira/browse/HBASE-13992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14625515#comment-14625515
 ] 

Ted Malaska commented on HBASE-13992:
-------------------------------------

Oh for help for the reviews.  All the magic is in HBaseContext.

Everything else is ether one of the following:
1. Examples
2. Tests
3. Implicit Scala Functions
4. Java port

Also this doesn't include the following, which will come in following patches:
1. Validation that the code will be able to accept new HBase Kerberos tickets 
given through Spark-Submit in Yarn-Cluster mode.
2. Integration with DataFrames.  This is easy to do I just wanted to separate 
it out into a different jira.
3. Better unit testing.  I'm testing every function with the HBase test 
cluster, but I'm not the best at unit test, so on a following patch I will work 
with others to add more tests.
4. More Examples.  I would like to build on common Spark Stream use cases with 
HBase.
5. Documentation.

> Integrate SparkOnHBase into HBase
> ---------------------------------
>
>                 Key: HBASE-13992
>                 URL: https://issues.apache.org/jira/browse/HBASE-13992
>             Project: HBase
>          Issue Type: Bug
>          Components: spark
>            Reporter: Ted Malaska
>            Assignee: Ted Malaska
>             Fix For: 2.0.0
>
>         Attachments: HBASE-13992.patch
>
>
> This Jira is to ask if SparkOnHBase can find a home in side HBase core.
> Here is the github: 
> https://github.com/cloudera-labs/SparkOnHBase
> I am the core author of this project and the license is Apache 2.0
> A blog explaining this project is here
> http://blog.cloudera.com/blog/2014/12/new-in-cloudera-labs-sparkonhbase/
> A spark Streaming example is here
> http://blog.cloudera.com/blog/2014/11/how-to-do-near-real-time-sessionization-with-spark-streaming-and-apache-hadoop/
> A real customer using this in produce is blogged here
> http://blog.cloudera.com/blog/2015/03/how-edmunds-com-used-spark-streaming-to-build-a-near-real-time-dashboard/
> Please debate and let me know what I can do to make this happen.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to