[ 
https://issues.apache.org/jira/browse/GORA-485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16791658#comment-16791658
 ] 

Kevin Ratnasekera commented on GORA-485:
----------------------------------------

Hi John, 

Thanks for your interest in project. Apache Gora basically provides key value 
based datastore abstraction to persist, query over real databases/backends like 
Eg:- HBase, such persisted data can be analysed using Hadoop, Spark. Official 
website [2] and wiki [1] has very good material to get things started. 

Try Go through Apache Gora source code and understand how datastore abstraction 
is implemented other datastores like Eg:- HBase. Then try to understand  Kudu 
data model [3] and see how you can apply key value based abstraction here. 

Kudu is written from C++, however you don't to deal native libraries. There 
java client [4] which you can use to talking to Kudu server to perform all the 
datastore functions. However when you write test cases you will need to start 
embedded Kudu server, at there you need to come with a solution which you can 
propose. 

I suggest you to go through source code, build it and start using it. Once you 
done fix some issues before you make a proposal as warm up. Subscribe to our 
dev list which you can start a discussion over there, you can discuss these 
things in more wider audience.

[1] https://cwiki.apache.org/confluence/display/GORA/
[2] http://gora.apache.org/index.html
[3] https://kudu.apache.org/overview.html
[4] https://github.com/apache/kudu/tree/master/examples/java

Regards
Kevin

> Apache Kudu datastore for Gora
> ------------------------------
>
>                 Key: GORA-485
>                 URL: https://issues.apache.org/jira/browse/GORA-485
>             Project: Apache Gora
>          Issue Type: New Feature
>    Affects Versions: 0.8
>            Reporter: Kevin Ratnasekera
>            Assignee: Cihad Guzel
>            Priority: Major
>              Labels: gora-kudu, gsoc2017, gsoc2019
>             Fix For: 0.9
>
>
> Apache Kudu is getting lot of attention. Kudu is a key/value store optimized 
> for analytic workloads. [1] 
> [1] 
> https://blog.cloudera.com/blog/2015/09/kudu-new-apache-hadoop-storage-for-fast-analytics-on-fast-data/



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to