[
https://issues.apache.org/jira/browse/SPARK-2447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14200554#comment-14200554
]
Ted Malaska commented on SPARK-2447:
------------------------------------
Hey Norman,
Totally agree. TD and I talked about SparkOnHBase at Hadoop World. Times
where crazy leading up to Hadoop World.
So I'm doing the following things:
1. I'm writing up a Blog for SparkOnHBase
2. TD is working on directions for how this code should be integrated with Spark
3. I have been working out little bugs with Java integration
4. I want to build a couple more examples
5. I'm having a problem with Maven where the Java JUnits are not executing
6. I adding support for Kerberos
But yes the facade is coming. :)
Let me know if you want to help. Just do a pull request on
https://github.com/tmalaska/SparkOnHBase
> Add common solution for sending upsert actions to HBase (put, deletes, and
> increment)
> -------------------------------------------------------------------------------------
>
> Key: SPARK-2447
> URL: https://issues.apache.org/jira/browse/SPARK-2447
> Project: Spark
> Issue Type: New Feature
> Components: Spark Core, Streaming
> Reporter: Ted Malaska
> Assignee: Ted Malaska
>
> Going to review the design with Tdas today.
> But first thoughts is to have an extension of VoidFunction that handles the
> connection to HBase and allows for options such as turning auto flush off for
> higher through put.
> Need to answer the following questions first.
> - Can it be written in Java or should it be written in Scala?
> - What is the best way to add the HBase dependency? (will review how Flume
> does this as the first option)
> - What is the best way to do testing? (will review how Flume does this as the
> first option)
> - How to support python? (python may be a different Jira it is unknown at
> this time)
> Goals:
> - Simple to use
> - Stable
> - Supports high load
> - Documented (May be in a separate Jira need to ask Tdas)
> - Supports Java, Scala, and hopefully Python
> - Supports Streaming and normal Spark
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]