Re: Apache Gora Benchmark

2019-03-26 Thread Sheriffo Ceesay
I have updated the Benchmark Module after some suggestions from Renato. So
basically, the suggestion was to consider extending YCSB to include Gora
since YCSB already have an implementation of other KV stores.

So it will be great if a potential mentor could have look at this and give
me some feedback. We are currently in the proposal submission period of
GSoC timeline, so any comment on the document will really help.

Please find below link to the shared Google doc.

https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit



**Sheriffo Ceesay**


On Mon, Mar 25, 2019 at 12:30 PM Sheriffo Ceesay 
wrote:

> Hi Renato,
>
> Thanks for the reply and the comments on the Google doc.
>
> I think, adding Gora to YCBS framework will be the best approach. Like, I
> mentioned in the shared doc, I will dig more into this and update the
> proposal accordingly.
>
> Thank you.
>
>
> **Sheriffo Ceesay**
>
>
> On Mon, Mar 25, 2019 at 12:05 PM Renato Marroquín Mogrovejo <
> renatoj.marroq...@gmail.com> wrote:
>
>> Hey Sheriffo,
>>
>> Thanks for sharing this. I went quickly over it, and it looks good
>> overall.
>> One question I have is the one I left on the proposal as well. The
>> proposal
>> is about implementing a benckmarking module but why aren't we
>> using/integrating with something like YCSB?
>>
>> I am asking this because it has a few benefits:
>> - Most of the operations one would be interested in kv-stores are already
>> modeled by YCSB (as you know)
>> - With this we would already get support for most key-value stores and we
>> wouldn't have to implement it(or support it) later on.
>> - We get a benchmark module that is already accepted and understood by
>> people using key-value stores.
>>
>> The resulting deliverables could be the integration (adding Gora to YCSB,
>> the module could live in Gora and also could live in YCSB if they want to
>> take it), and the scripts to run it.
>> What do you guys think?
>>
>>
>> Best,
>>
>> Renato M.
>>
>> El dom., 24 mar. 2019 a las 13:05, Sheriffo Ceesay (<
>> sneceesa...@gmail.com>)
>> escribió:
>>
>> > Hi Renato,
>> >
>> > Thanks for the reply. As far as I am concerned all options are on the
>> > table. I have shared my draft project proposal with the dev email list
>> for
>> > comments. I will visit it again and see how best your ideas can be
>> added to
>> > the implementation.
>> >
>> > Below is the Google doc file, please feel free to add comments.
>> >
>> >
>> >
>> https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit?usp=sharing
>> >
>> > Thank you.
>> >
>> > **Sheriffo Ceesay**
>> >
>> >
>> > On Sun, Mar 24, 2019 at 11:08 AM Renato Marroquín Mogrovejo <
>> > renatoj.marroq...@gmail.com> wrote:
>> >
>> > > Hi Sheriffo,
>> > >
>> > > Thanks for your interest in Gora and in this project.
>> > > We have discussed this a bit already and what the important bit is to
>> > > figure out Gora's overhead compared to using just the kv stores.
>> > > Obviously, we incurr in overheads, but it'd be interesting to know
>> where
>> > > exactly (most likely serialization) and not just say how slow Gora is.
>> > > Ideally, one could fix the easy performance bugs but this might be
>> out of
>> > > the scope, but anyway, that would be nice.
>> > > Another idea would be to actually get the final benchmark run as part
>> of
>> > > CI? So we know how every change impacts performance.
>> > >
>> > >
>> > > Best,
>> > >
>> > > Renato M.
>> > > El mié., 20 mar. 2019 a las 17:15, sneceesa...@gmail.com (<
>> > > sneceesa...@gmail.com>) escribió:
>> > > >
>> > > >
>> > > >
>> > > > On 2017/12/23 20:17:12, Furkan KAMACI 
>> wrote:
>> > > > > Hi Fellows,
>> > > > >
>> > > > > As you know that our project is defined as:
>> > > > >
>> > > > > "*The Apache Gora™ open source framework provides an in-memory
>> data
>> > > model
>> > > > > and persistence for big data.*[1]"
>> > > > >
>> > > > > I believe that Apache Gora is a special project and it touches
>> many
>> > > > > projects. I always wonder the performance of NoSQL DBs as
>> individual
>> > > and
>> > > > > accessed via Apache Gora.
>> > > > >
>> > > > > I think that we should make a benchmark and publish it, and
>> Yahoo!’s
>> > > Cloud
>> > > > > Serving Benchmark (YCSB) [2] is the most suitable tool for such a
>> > > purpose.
>> > > > > I found a recent research about Object-NoSQL Database Mapper
>> (ONDM)
>> > > > > benchmark [3] which includes Apache Gora and they have produced
>> the
>> > > > > benchmark source code as ASF 2.0 licensed [4].
>> > > > >
>> > > > > Here is an example from Apache Accumulo which is based on YCSB too
>> > [5].
>> > > > >
>> > > > > What do you think about it? Who wants to join that work apart from
>> > me?
>> > > > >
>> > > > > Kind Regards,
>> > > > > Furkan KAMACI
>> > > > >
>> > > > >
>> > > > > [1] https://gora.apache.org
>> > > > > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R.
>> > > Benchmarking
>> > > > > cloud serving 

[jira] [Commented] (GORA-481) Using Docker For Unit Testing

2019-03-26 Thread Cihad Guzel (JIRA)


[ 
https://issues.apache.org/jira/browse/GORA-481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16801618#comment-16801618
 ] 

Cihad Guzel commented on GORA-481:
--

Yes, we can close this issue. [~djkevincr]

> Using Docker For Unit Testing
> -
>
> Key: GORA-481
> URL: https://issues.apache.org/jira/browse/GORA-481
> Project: Apache Gora
>  Issue Type: New Feature
>Reporter: Cihad Guzel
>Priority: Major
> Fix For: 0.9
>
>
> I have implemented [couchdb 
> datastore|https://issues.apache.org/jira/browse/GORA-437] on [my 
> repo|https://github.com/cguzel/gora-couchdb] . I want to add test class. So, 
> I must start couchdb server programmatically. But couchdb doesn't support 
> that to start couchdb server programmatically using java. Situations like it 
> may occur in the other datastore implementations. So If docker is used, the 
> datastore starts for testing.
> [TestContainers|https://github.com/testcontainers/testcontainers-java] is a 
> Java library that supports JUnit tests, providing lightweight, throwaway 
> instances of common databases that can run in a Docker container.
> So, GORA should support [Test 
> Containers|https://github.com/testcontainers/testcontainers-java] 
> There is a docker container of couchdb 
> [here|https://hub.docker.com/r/klaemo/couchdb/]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (GORA-532) Benchmarking Module

2019-03-26 Thread Sheriffo Ceesay (JIRA)


[ 
https://issues.apache.org/jira/browse/GORA-532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16801797#comment-16801797
 ] 

Sheriffo Ceesay commented on GORA-532:
--

[~kamaci] and [~djkevincr], I have made some progress on my proposal and I 
would like to know if you have any comments? 

I am planning to submit my proposal before end of the week so it will be 
helpful if a potential mentor can have a look at the document and provide 
comments. 

Link to doc : 
https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit
 

> Benchmarking Module
> ---
>
> Key: GORA-532
> URL: https://issues.apache.org/jira/browse/GORA-532
> Project: Apache Gora
>  Issue Type: New Feature
>Reporter: Furkan KAMACI
>Assignee: Furkan KAMACI
>Priority: Major
>  Labels: gsoc2018, gsoc2019
> Fix For: 0.9
>
>
> We should make a benchmark and publish it, and improved version of the 
> Yahoo!’s Cloud Serving Benchmark (YCSB++) [1] is the most suitable tool for 
> such a purpose. Here is a recent research about Object-NoSQL Database Mapper 
> (ONDM) benchmark [2] which includes Apache Gora and they have produced the 
> benchmark source code as ASF 2.0 licensed [3].
>  
> Also, here is an example from Apache Accumulo which is based on YCSB [4].
>  
> [1] [http://www.cs.cmu.edu/~wtantisi/files/tablebenchmark-pdl11-talk.pdf]
> [2] [https://doi.org/10.1186/s13174-016-0052-x]
> [3] [https://github.com/vreniers/ONDM-Benchmarker]
> [4] [https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (GORA-481) Using Docker For Unit Testing

2019-03-26 Thread Kevin Ratnasekera (JIRA)


 [ 
https://issues.apache.org/jira/browse/GORA-481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Ratnasekera updated GORA-481:
---
Component/s: build process

> Using Docker For Unit Testing
> -
>
> Key: GORA-481
> URL: https://issues.apache.org/jira/browse/GORA-481
> Project: Apache Gora
>  Issue Type: New Feature
>  Components: build process
>Affects Versions: 0.8
>Reporter: Cihad Guzel
>Assignee: Cihad Guzel
>Priority: Major
> Fix For: 0.9
>
>
> I have implemented [couchdb 
> datastore|https://issues.apache.org/jira/browse/GORA-437] on [my 
> repo|https://github.com/cguzel/gora-couchdb] . I want to add test class. So, 
> I must start couchdb server programmatically. But couchdb doesn't support 
> that to start couchdb server programmatically using java. Situations like it 
> may occur in the other datastore implementations. So If docker is used, the 
> datastore starts for testing.
> [TestContainers|https://github.com/testcontainers/testcontainers-java] is a 
> Java library that supports JUnit tests, providing lightweight, throwaway 
> instances of common databases that can run in a Docker container.
> So, GORA should support [Test 
> Containers|https://github.com/testcontainers/testcontainers-java] 
> There is a docker container of couchdb 
> [here|https://hub.docker.com/r/klaemo/couchdb/]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (GORA-481) Using Docker For Unit Testing

2019-03-26 Thread Kevin Ratnasekera (JIRA)


 [ 
https://issues.apache.org/jira/browse/GORA-481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Ratnasekera updated GORA-481:
---
Affects Version/s: 0.8

> Using Docker For Unit Testing
> -
>
> Key: GORA-481
> URL: https://issues.apache.org/jira/browse/GORA-481
> Project: Apache Gora
>  Issue Type: New Feature
>Affects Versions: 0.8
>Reporter: Cihad Guzel
>Assignee: Cihad Guzel
>Priority: Major
> Fix For: 0.9
>
>
> I have implemented [couchdb 
> datastore|https://issues.apache.org/jira/browse/GORA-437] on [my 
> repo|https://github.com/cguzel/gora-couchdb] . I want to add test class. So, 
> I must start couchdb server programmatically. But couchdb doesn't support 
> that to start couchdb server programmatically using java. Situations like it 
> may occur in the other datastore implementations. So If docker is used, the 
> datastore starts for testing.
> [TestContainers|https://github.com/testcontainers/testcontainers-java] is a 
> Java library that supports JUnit tests, providing lightweight, throwaway 
> instances of common databases that can run in a Docker container.
> So, GORA should support [Test 
> Containers|https://github.com/testcontainers/testcontainers-java] 
> There is a docker container of couchdb 
> [here|https://hub.docker.com/r/klaemo/couchdb/]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (GORA-481) Using Docker For Unit Testing

2019-03-26 Thread Kevin Ratnasekera (JIRA)


 [ 
https://issues.apache.org/jira/browse/GORA-481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Ratnasekera resolved GORA-481.

Resolution: Fixed

> Using Docker For Unit Testing
> -
>
> Key: GORA-481
> URL: https://issues.apache.org/jira/browse/GORA-481
> Project: Apache Gora
>  Issue Type: New Feature
>  Components: build process
>Affects Versions: 0.8
>Reporter: Cihad Guzel
>Assignee: Cihad Guzel
>Priority: Major
> Fix For: 0.9
>
>
> I have implemented [couchdb 
> datastore|https://issues.apache.org/jira/browse/GORA-437] on [my 
> repo|https://github.com/cguzel/gora-couchdb] . I want to add test class. So, 
> I must start couchdb server programmatically. But couchdb doesn't support 
> that to start couchdb server programmatically using java. Situations like it 
> may occur in the other datastore implementations. So If docker is used, the 
> datastore starts for testing.
> [TestContainers|https://github.com/testcontainers/testcontainers-java] is a 
> Java library that supports JUnit tests, providing lightweight, throwaway 
> instances of common databases that can run in a Docker container.
> So, GORA should support [Test 
> Containers|https://github.com/testcontainers/testcontainers-java] 
> There is a docker container of couchdb 
> [here|https://hub.docker.com/r/klaemo/couchdb/]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (GORA-481) Using Docker For Unit Testing

2019-03-26 Thread Kevin Ratnasekera (JIRA)


 [ 
https://issues.apache.org/jira/browse/GORA-481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Ratnasekera reassigned GORA-481:
--

Assignee: Cihad Guzel

> Using Docker For Unit Testing
> -
>
> Key: GORA-481
> URL: https://issues.apache.org/jira/browse/GORA-481
> Project: Apache Gora
>  Issue Type: New Feature
>Reporter: Cihad Guzel
>Assignee: Cihad Guzel
>Priority: Major
> Fix For: 0.9
>
>
> I have implemented [couchdb 
> datastore|https://issues.apache.org/jira/browse/GORA-437] on [my 
> repo|https://github.com/cguzel/gora-couchdb] . I want to add test class. So, 
> I must start couchdb server programmatically. But couchdb doesn't support 
> that to start couchdb server programmatically using java. Situations like it 
> may occur in the other datastore implementations. So If docker is used, the 
> datastore starts for testing.
> [TestContainers|https://github.com/testcontainers/testcontainers-java] is a 
> Java library that supports JUnit tests, providing lightweight, throwaway 
> instances of common databases that can run in a Docker container.
> So, GORA should support [Test 
> Containers|https://github.com/testcontainers/testcontainers-java] 
> There is a docker container of couchdb 
> [here|https://hub.docker.com/r/klaemo/couchdb/]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] [gora] carlosrmng commented on issue #156: GORA 411 - Add exists(key) to DataStore interface

2019-03-26 Thread GitBox
carlosrmng commented on issue #156: GORA 411 - Add exists(key) to DataStore 
interface
URL: https://github.com/apache/gora/pull/156#issuecomment-476932498
 
 
   Nice work @jhnmora000 .  
   The implementation for the Ignite Datastore looks good in my opinion. But, I 
would suggest you to use  two-space tab indentation and format your code.
   Thanks for your contribution. 


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services