[
https://issues.apache.org/jira/browse/HBASE-13992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14625515#comment-14625515
]
Ted Malaska commented on HBASE-13992:
-------------------------------------
Oh for help for the reviews. All the magic is in HBaseContext.
Everything else is ether one of the following:
1. Examples
2. Tests
3. Implicit Scala Functions
4. Java port
Also this doesn't include the following, which will come in following patches:
1. Validation that the code will be able to accept new HBase Kerberos tickets
given through Spark-Submit in Yarn-Cluster mode.
2. Integration with DataFrames. This is easy to do I just wanted to separate
it out into a different jira.
3. Better unit testing. I'm testing every function with the HBase test
cluster, but I'm not the best at unit test, so on a following patch I will work
with others to add more tests.
4. More Examples. I would like to build on common Spark Stream use cases with
HBase.
5. Documentation.
> Integrate SparkOnHBase into HBase
> ---------------------------------
>
> Key: HBASE-13992
> URL: https://issues.apache.org/jira/browse/HBASE-13992
> Project: HBase
> Issue Type: Bug
> Components: spark
> Reporter: Ted Malaska
> Assignee: Ted Malaska
> Fix For: 2.0.0
>
> Attachments: HBASE-13992.patch
>
>
> This Jira is to ask if SparkOnHBase can find a home in side HBase core.
> Here is the github:
> https://github.com/cloudera-labs/SparkOnHBase
> I am the core author of this project and the license is Apache 2.0
> A blog explaining this project is here
> http://blog.cloudera.com/blog/2014/12/new-in-cloudera-labs-sparkonhbase/
> A spark Streaming example is here
> http://blog.cloudera.com/blog/2014/11/how-to-do-near-real-time-sessionization-with-spark-streaming-and-apache-hadoop/
> A real customer using this in produce is blogged here
> http://blog.cloudera.com/blog/2015/03/how-edmunds-com-used-spark-streaming-to-build-a-near-real-time-dashboard/
> Please debate and let me know what I can do to make this happen.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)