Re: Apache Gora Benchmark Module Draft Proposal

2019-04-08 Thread Sheriffo Ceesay
Thank you. I will do the final submission later today.

Thank you.


**Sheriffo Ceesay**


On Mon, Apr 8, 2019 at 5:40 PM Kevin Ratnasekera 
wrote:

> Hi Sheriffo,
>
> The proposal looks very good. I have added one small comment. Please make
> sure you do the final submission before the deadline.
>
> Good luck with your submission :)
>
> Regards
> Kevin
>
> On Sun, Mar 24, 2019 at 2:14 AM Sheriffo Ceesay 
> wrote:
>
> > Hi All,
> >
> > As advised by Kevin Ratnasekera and Furkan Kamaci, please find below the
> > link to the draft proposal for Gora benchmark module.
> >
> > Please let me if you have any comments to improve the document. (Don't
> > worry, I have another copy.)
> >
> >
> >
> https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit?usp=sharing
> >
> >
> > Please add your comments using the sidebar comment functionality if
> > possible.
> >
> > Thank you.
> >
> >
> > **Sheriffo Ceesay**
> >
>


Re: Apache Gora Benchmark Module Draft Proposal

2019-04-08 Thread Kevin Ratnasekera
Hi Sheriffo,

The proposal looks very good. I have added one small comment. Please make
sure you do the final submission before the deadline.

Good luck with your submission :)

Regards
Kevin

On Sun, Mar 24, 2019 at 2:14 AM Sheriffo Ceesay 
wrote:

> Hi All,
>
> As advised by Kevin Ratnasekera and Furkan Kamaci, please find below the
> link to the draft proposal for Gora benchmark module.
>
> Please let me if you have any comments to improve the document. (Don't
> worry, I have another copy.)
>
>
> https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit?usp=sharing
>
>
> Please add your comments using the sidebar comment functionality if
> possible.
>
> Thank you.
>
>
> **Sheriffo Ceesay**
>


Re: Apache Gora Benchmark

2019-03-26 Thread Sheriffo Ceesay
I have updated the Benchmark Module after some suggestions from Renato. So
basically, the suggestion was to consider extending YCSB to include Gora
since YCSB already have an implementation of other KV stores.

So it will be great if a potential mentor could have look at this and give
me some feedback. We are currently in the proposal submission period of
GSoC timeline, so any comment on the document will really help.

Please find below link to the shared Google doc.

https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit



**Sheriffo Ceesay**


On Mon, Mar 25, 2019 at 12:30 PM Sheriffo Ceesay 
wrote:

> Hi Renato,
>
> Thanks for the reply and the comments on the Google doc.
>
> I think, adding Gora to YCBS framework will be the best approach. Like, I
> mentioned in the shared doc, I will dig more into this and update the
> proposal accordingly.
>
> Thank you.
>
>
> **Sheriffo Ceesay**
>
>
> On Mon, Mar 25, 2019 at 12:05 PM Renato Marroquín Mogrovejo <
> renatoj.marroq...@gmail.com> wrote:
>
>> Hey Sheriffo,
>>
>> Thanks for sharing this. I went quickly over it, and it looks good
>> overall.
>> One question I have is the one I left on the proposal as well. The
>> proposal
>> is about implementing a benckmarking module but why aren't we
>> using/integrating with something like YCSB?
>>
>> I am asking this because it has a few benefits:
>> - Most of the operations one would be interested in kv-stores are already
>> modeled by YCSB (as you know)
>> - With this we would already get support for most key-value stores and we
>> wouldn't have to implement it(or support it) later on.
>> - We get a benchmark module that is already accepted and understood by
>> people using key-value stores.
>>
>> The resulting deliverables could be the integration (adding Gora to YCSB,
>> the module could live in Gora and also could live in YCSB if they want to
>> take it), and the scripts to run it.
>> What do you guys think?
>>
>>
>> Best,
>>
>> Renato M.
>>
>> El dom., 24 mar. 2019 a las 13:05, Sheriffo Ceesay (<
>> sneceesa...@gmail.com>)
>> escribió:
>>
>> > Hi Renato,
>> >
>> > Thanks for the reply. As far as I am concerned all options are on the
>> > table. I have shared my draft project proposal with the dev email list
>> for
>> > comments. I will visit it again and see how best your ideas can be
>> added to
>> > the implementation.
>> >
>> > Below is the Google doc file, please feel free to add comments.
>> >
>> >
>> >
>> https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit?usp=sharing
>> >
>> > Thank you.
>> >
>> > **Sheriffo Ceesay**
>> >
>> >
>> > On Sun, Mar 24, 2019 at 11:08 AM Renato Marroquín Mogrovejo <
>> > renatoj.marroq...@gmail.com> wrote:
>> >
>> > > Hi Sheriffo,
>> > >
>> > > Thanks for your interest in Gora and in this project.
>> > > We have discussed this a bit already and what the important bit is to
>> > > figure out Gora's overhead compared to using just the kv stores.
>> > > Obviously, we incurr in overheads, but it'd be interesting to know
>> where
>> > > exactly (most likely serialization) and not just say how slow Gora is.
>> > > Ideally, one could fix the easy performance bugs but this might be
>> out of
>> > > the scope, but anyway, that would be nice.
>> > > Another idea would be to actually get the final benchmark run as part
>> of
>> > > CI? So we know how every change impacts performance.
>> > >
>> > >
>> > > Best,
>> > >
>> > > Renato M.
>> > > El mié., 20 mar. 2019 a las 17:15, sneceesa...@gmail.com (<
>> > > sneceesa...@gmail.com>) escribió:
>> > > >
>> > > >
>> > > >
>> > > > On 2017/12/23 20:17:12, Furkan KAMACI 
>> wrote:
>> > > > > Hi Fellows,
>> > > > >
>> > > > > As you know that our project is defined as:
>> > > > >
>> > > > > "*The Apache Gora™ open source framework provides an in-memory
>> data
>> > > model
>> > > > > and persistence for big data.*[1]"
>> > > > >
>> > > > > I believe that Apache Gora is a special project and it touches
>> many
>> > > > > projects. I always wonder the performance of NoSQL DBs as
>> individual
>> > > and
>> > > > > accessed via Apache Gora.
>> > > > >
>> > > > > I think that we should make a benchmark and publish it, and
>> Yahoo!’s
>> > > Cloud
>> > > > > Serving Benchmark (YCSB) [2] is the most suitable tool for such a
>> > > purpose.
>> > > > > I found a recent research about Object-NoSQL Database Mapper
>> (ONDM)
>> > > > > benchmark [3] which includes Apache Gora and they have produced
>> the
>> > > > > benchmark source code as ASF 2.0 licensed [4].
>> > > > >
>> > > > > Here is an example from Apache Accumulo which is based on YCSB too
>> > [5].
>> > > > >
>> > > > > What do you think about it? Who wants to join that work apart from
>> > me?
>> > > > >
>> > > > > Kind Regards,
>> > > > > Furkan KAMACI
>> > > > >
>> > > > >
>> > > > > [1] https://gora.apache.org
>> > > > > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R.
>> > > Benchmarking
>> > > > > cloud serving system

Re: Apache Gora Benchmark

2019-03-25 Thread Sheriffo Ceesay
Hi Renato,

Thanks for the reply and the comments on the Google doc.

I think, adding Gora to YCBS framework will be the best approach. Like, I
mentioned in the shared doc, I will dig more into this and update the
proposal accordingly.

Thank you.


**Sheriffo Ceesay**


On Mon, Mar 25, 2019 at 12:05 PM Renato Marroquín Mogrovejo <
renatoj.marroq...@gmail.com> wrote:

> Hey Sheriffo,
>
> Thanks for sharing this. I went quickly over it, and it looks good overall.
> One question I have is the one I left on the proposal as well. The proposal
> is about implementing a benckmarking module but why aren't we
> using/integrating with something like YCSB?
>
> I am asking this because it has a few benefits:
> - Most of the operations one would be interested in kv-stores are already
> modeled by YCSB (as you know)
> - With this we would already get support for most key-value stores and we
> wouldn't have to implement it(or support it) later on.
> - We get a benchmark module that is already accepted and understood by
> people using key-value stores.
>
> The resulting deliverables could be the integration (adding Gora to YCSB,
> the module could live in Gora and also could live in YCSB if they want to
> take it), and the scripts to run it.
> What do you guys think?
>
>
> Best,
>
> Renato M.
>
> El dom., 24 mar. 2019 a las 13:05, Sheriffo Ceesay ( >)
> escribió:
>
> > Hi Renato,
> >
> > Thanks for the reply. As far as I am concerned all options are on the
> > table. I have shared my draft project proposal with the dev email list
> for
> > comments. I will visit it again and see how best your ideas can be added
> to
> > the implementation.
> >
> > Below is the Google doc file, please feel free to add comments.
> >
> >
> >
> https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit?usp=sharing
> >
> > Thank you.
> >
> > **Sheriffo Ceesay**
> >
> >
> > On Sun, Mar 24, 2019 at 11:08 AM Renato Marroquín Mogrovejo <
> > renatoj.marroq...@gmail.com> wrote:
> >
> > > Hi Sheriffo,
> > >
> > > Thanks for your interest in Gora and in this project.
> > > We have discussed this a bit already and what the important bit is to
> > > figure out Gora's overhead compared to using just the kv stores.
> > > Obviously, we incurr in overheads, but it'd be interesting to know
> where
> > > exactly (most likely serialization) and not just say how slow Gora is.
> > > Ideally, one could fix the easy performance bugs but this might be out
> of
> > > the scope, but anyway, that would be nice.
> > > Another idea would be to actually get the final benchmark run as part
> of
> > > CI? So we know how every change impacts performance.
> > >
> > >
> > > Best,
> > >
> > > Renato M.
> > > El mié., 20 mar. 2019 a las 17:15, sneceesa...@gmail.com (<
> > > sneceesa...@gmail.com>) escribió:
> > > >
> > > >
> > > >
> > > > On 2017/12/23 20:17:12, Furkan KAMACI 
> wrote:
> > > > > Hi Fellows,
> > > > >
> > > > > As you know that our project is defined as:
> > > > >
> > > > > "*The Apache Gora™ open source framework provides an in-memory data
> > > model
> > > > > and persistence for big data.*[1]"
> > > > >
> > > > > I believe that Apache Gora is a special project and it touches many
> > > > > projects. I always wonder the performance of NoSQL DBs as
> individual
> > > and
> > > > > accessed via Apache Gora.
> > > > >
> > > > > I think that we should make a benchmark and publish it, and
> Yahoo!’s
> > > Cloud
> > > > > Serving Benchmark (YCSB) [2] is the most suitable tool for such a
> > > purpose.
> > > > > I found a recent research about Object-NoSQL Database Mapper (ONDM)
> > > > > benchmark [3] which includes Apache Gora and they have produced the
> > > > > benchmark source code as ASF 2.0 licensed [4].
> > > > >
> > > > > Here is an example from Apache Accumulo which is based on YCSB too
> > [5].
> > > > >
> > > > > What do you think about it? Who wants to join that work apart from
> > me?
> > > > >
> > > > > Kind Regards,
> > > > > Furkan KAMACI
> > > > >
> > > > >
> > > > > [1] https://gora.apache.org
> > > > > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R.
> > > Benchmarking
> > > > > cloud serving systems with YCSB. In: Proceedings of the 1st ACM
> > > symposium
> > > > > on Cloud computing - SoCC ’10. Association for Computing Machinery
> > > (ACM):
> > > > > 2010. p. 143–154, doi:10.1145/1807128.1807152.
> > > > > http://dx.doi.org/10.1145/1807128.1807152.
> > > > > [3] https://doi.org/10.1186/s13174-016-0052-x
> > > > > [4] https://github.com/vreniers/ONDM-Benchmarker
> > > > > [5]
> https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf
> > > > >
> > > >
> > > > Hi All, I was advised by Kevin Ratnasekera to start or reignite this
> > > discussion. I am currently going over the documentation, installation
> and
> > > familiarising myself with the code base. Any good pointers here will be
> > > helpful.
> > >
> >
>


Re: Apache Gora Benchmark

2019-03-25 Thread Renato Marroquín Mogrovejo
Hey Sheriffo,

Thanks for sharing this. I went quickly over it, and it looks good overall.
One question I have is the one I left on the proposal as well. The proposal
is about implementing a benckmarking module but why aren't we
using/integrating with something like YCSB?

I am asking this because it has a few benefits:
- Most of the operations one would be interested in kv-stores are already
modeled by YCSB (as you know)
- With this we would already get support for most key-value stores and we
wouldn't have to implement it(or support it) later on.
- We get a benchmark module that is already accepted and understood by
people using key-value stores.

The resulting deliverables could be the integration (adding Gora to YCSB,
the module could live in Gora and also could live in YCSB if they want to
take it), and the scripts to run it.
What do you guys think?


Best,

Renato M.

El dom., 24 mar. 2019 a las 13:05, Sheriffo Ceesay ()
escribió:

> Hi Renato,
>
> Thanks for the reply. As far as I am concerned all options are on the
> table. I have shared my draft project proposal with the dev email list for
> comments. I will visit it again and see how best your ideas can be added to
> the implementation.
>
> Below is the Google doc file, please feel free to add comments.
>
>
> https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit?usp=sharing
>
> Thank you.
>
> **Sheriffo Ceesay**
>
>
> On Sun, Mar 24, 2019 at 11:08 AM Renato Marroquín Mogrovejo <
> renatoj.marroq...@gmail.com> wrote:
>
> > Hi Sheriffo,
> >
> > Thanks for your interest in Gora and in this project.
> > We have discussed this a bit already and what the important bit is to
> > figure out Gora's overhead compared to using just the kv stores.
> > Obviously, we incurr in overheads, but it'd be interesting to know where
> > exactly (most likely serialization) and not just say how slow Gora is.
> > Ideally, one could fix the easy performance bugs but this might be out of
> > the scope, but anyway, that would be nice.
> > Another idea would be to actually get the final benchmark run as part of
> > CI? So we know how every change impacts performance.
> >
> >
> > Best,
> >
> > Renato M.
> > El mié., 20 mar. 2019 a las 17:15, sneceesa...@gmail.com (<
> > sneceesa...@gmail.com>) escribió:
> > >
> > >
> > >
> > > On 2017/12/23 20:17:12, Furkan KAMACI  wrote:
> > > > Hi Fellows,
> > > >
> > > > As you know that our project is defined as:
> > > >
> > > > "*The Apache Gora™ open source framework provides an in-memory data
> > model
> > > > and persistence for big data.*[1]"
> > > >
> > > > I believe that Apache Gora is a special project and it touches many
> > > > projects. I always wonder the performance of NoSQL DBs as individual
> > and
> > > > accessed via Apache Gora.
> > > >
> > > > I think that we should make a benchmark and publish it, and Yahoo!’s
> > Cloud
> > > > Serving Benchmark (YCSB) [2] is the most suitable tool for such a
> > purpose.
> > > > I found a recent research about Object-NoSQL Database Mapper (ONDM)
> > > > benchmark [3] which includes Apache Gora and they have produced the
> > > > benchmark source code as ASF 2.0 licensed [4].
> > > >
> > > > Here is an example from Apache Accumulo which is based on YCSB too
> [5].
> > > >
> > > > What do you think about it? Who wants to join that work apart from
> me?
> > > >
> > > > Kind Regards,
> > > > Furkan KAMACI
> > > >
> > > >
> > > > [1] https://gora.apache.org
> > > > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R.
> > Benchmarking
> > > > cloud serving systems with YCSB. In: Proceedings of the 1st ACM
> > symposium
> > > > on Cloud computing - SoCC ’10. Association for Computing Machinery
> > (ACM):
> > > > 2010. p. 143–154, doi:10.1145/1807128.1807152.
> > > > http://dx.doi.org/10.1145/1807128.1807152.
> > > > [3] https://doi.org/10.1186/s13174-016-0052-x
> > > > [4] https://github.com/vreniers/ONDM-Benchmarker
> > > > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf
> > > >
> > >
> > > Hi All, I was advised by Kevin Ratnasekera to start or reignite this
> > discussion. I am currently going over the documentation, installation and
> > familiarising myself with the code base. Any good pointers here will be
> > helpful.
> >
>


Re: Apache Gora Benchmark

2019-03-24 Thread Sheriffo Ceesay
Hi Renato,

Thanks for the reply. As far as I am concerned all options are on the
table. I have shared my draft project proposal with the dev email list for
comments. I will visit it again and see how best your ideas can be added to
the implementation.

Below is the Google doc file, please feel free to add comments.

https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit?usp=sharing

Thank you.

**Sheriffo Ceesay**


On Sun, Mar 24, 2019 at 11:08 AM Renato Marroquín Mogrovejo <
renatoj.marroq...@gmail.com> wrote:

> Hi Sheriffo,
>
> Thanks for your interest in Gora and in this project.
> We have discussed this a bit already and what the important bit is to
> figure out Gora's overhead compared to using just the kv stores.
> Obviously, we incurr in overheads, but it'd be interesting to know where
> exactly (most likely serialization) and not just say how slow Gora is.
> Ideally, one could fix the easy performance bugs but this might be out of
> the scope, but anyway, that would be nice.
> Another idea would be to actually get the final benchmark run as part of
> CI? So we know how every change impacts performance.
>
>
> Best,
>
> Renato M.
> El mié., 20 mar. 2019 a las 17:15, sneceesa...@gmail.com (<
> sneceesa...@gmail.com>) escribió:
> >
> >
> >
> > On 2017/12/23 20:17:12, Furkan KAMACI  wrote:
> > > Hi Fellows,
> > >
> > > As you know that our project is defined as:
> > >
> > > "*The Apache Gora™ open source framework provides an in-memory data
> model
> > > and persistence for big data.*[1]"
> > >
> > > I believe that Apache Gora is a special project and it touches many
> > > projects. I always wonder the performance of NoSQL DBs as individual
> and
> > > accessed via Apache Gora.
> > >
> > > I think that we should make a benchmark and publish it, and Yahoo!’s
> Cloud
> > > Serving Benchmark (YCSB) [2] is the most suitable tool for such a
> purpose.
> > > I found a recent research about Object-NoSQL Database Mapper (ONDM)
> > > benchmark [3] which includes Apache Gora and they have produced the
> > > benchmark source code as ASF 2.0 licensed [4].
> > >
> > > Here is an example from Apache Accumulo which is based on YCSB too [5].
> > >
> > > What do you think about it? Who wants to join that work apart from me?
> > >
> > > Kind Regards,
> > > Furkan KAMACI
> > >
> > >
> > > [1] https://gora.apache.org
> > > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R.
> Benchmarking
> > > cloud serving systems with YCSB. In: Proceedings of the 1st ACM
> symposium
> > > on Cloud computing - SoCC ’10. Association for Computing Machinery
> (ACM):
> > > 2010. p. 143–154, doi:10.1145/1807128.1807152.
> > > http://dx.doi.org/10.1145/1807128.1807152.
> > > [3] https://doi.org/10.1186/s13174-016-0052-x
> > > [4] https://github.com/vreniers/ONDM-Benchmarker
> > > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf
> > >
> >
> > Hi All, I was advised by Kevin Ratnasekera to start or reignite this
> discussion. I am currently going over the documentation, installation and
> familiarising myself with the code base. Any good pointers here will be
> helpful.
>


Re: Apache Gora Benchmark

2019-03-24 Thread Renato Marroquín Mogrovejo
Hi Sheriffo,

Thanks for your interest in Gora and in this project.
We have discussed this a bit already and what the important bit is to
figure out Gora's overhead compared to using just the kv stores.
Obviously, we incurr in overheads, but it'd be interesting to know where
exactly (most likely serialization) and not just say how slow Gora is.
Ideally, one could fix the easy performance bugs but this might be out of
the scope, but anyway, that would be nice.
Another idea would be to actually get the final benchmark run as part of
CI? So we know how every change impacts performance.


Best,

Renato M.
El mié., 20 mar. 2019 a las 17:15, sneceesa...@gmail.com (<
sneceesa...@gmail.com>) escribió:
>
>
>
> On 2017/12/23 20:17:12, Furkan KAMACI  wrote:
> > Hi Fellows,
> >
> > As you know that our project is defined as:
> >
> > "*The Apache Gora™ open source framework provides an in-memory data
model
> > and persistence for big data.*[1]"
> >
> > I believe that Apache Gora is a special project and it touches many
> > projects. I always wonder the performance of NoSQL DBs as individual and
> > accessed via Apache Gora.
> >
> > I think that we should make a benchmark and publish it, and Yahoo!’s
Cloud
> > Serving Benchmark (YCSB) [2] is the most suitable tool for such a
purpose.
> > I found a recent research about Object-NoSQL Database Mapper (ONDM)
> > benchmark [3] which includes Apache Gora and they have produced the
> > benchmark source code as ASF 2.0 licensed [4].
> >
> > Here is an example from Apache Accumulo which is based on YCSB too [5].
> >
> > What do you think about it? Who wants to join that work apart from me?
> >
> > Kind Regards,
> > Furkan KAMACI
> >
> >
> > [1] https://gora.apache.org
> > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R.
Benchmarking
> > cloud serving systems with YCSB. In: Proceedings of the 1st ACM
symposium
> > on Cloud computing - SoCC ’10. Association for Computing Machinery
(ACM):
> > 2010. p. 143–154, doi:10.1145/1807128.1807152.
> > http://dx.doi.org/10.1145/1807128.1807152.
> > [3] https://doi.org/10.1186/s13174-016-0052-x
> > [4] https://github.com/vreniers/ONDM-Benchmarker
> > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf
> >
>
> Hi All, I was advised by Kevin Ratnasekera to start or reignite this
discussion. I am currently going over the documentation, installation and
familiarising myself with the code base. Any good pointers here will be
helpful.


Re: Apache Gora Benchmark

2019-03-20 Thread sneceesay77



On 2017/12/23 20:17:12, Furkan KAMACI  wrote: 
> Hi Fellows,
> 
> As you know that our project is defined as:
> 
> "*The Apache Gora™ open source framework provides an in-memory data model
> and persistence for big data.*[1]"
> 
> I believe that Apache Gora is a special project and it touches many
> projects. I always wonder the performance of NoSQL DBs as individual and
> accessed via Apache Gora.
> 
> I think that we should make a benchmark and publish it, and Yahoo!’s Cloud
> Serving Benchmark (YCSB) [2] is the most suitable tool for such a purpose.
> I found a recent research about Object-NoSQL Database Mapper (ONDM)
> benchmark [3] which includes Apache Gora and they have produced the
> benchmark source code as ASF 2.0 licensed [4].
> 
> Here is an example from Apache Accumulo which is based on YCSB too [5].
> 
> What do you think about it? Who wants to join that work apart from me?
> 
> Kind Regards,
> Furkan KAMACI
> 
> 
> [1] https://gora.apache.org
> [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R. Benchmarking
> cloud serving systems with YCSB. In: Proceedings of the 1st ACM symposium
> on Cloud computing - SoCC ’10. Association for Computing Machinery (ACM):
> 2010. p. 143–154, doi:10.1145/1807128.1807152.
> http://dx.doi.org/10.1145/1807128.1807152.
> [3] https://doi.org/10.1186/s13174-016-0052-x
> [4] https://github.com/vreniers/ONDM-Benchmarker
> [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf
> 

Hi All, I was advised by Kevin Ratnasekera to start or reignite this 
discussion. I am currently going over the documentation, installation and 
familiarising myself with the code base. Any good pointers here will be 
helpful.  


Re: Apache Gora Benchmark

2018-02-20 Thread Furkan KAMACI
Hi,

I've created https://issues.apache.org/jira/browse/GORA-532 for creating a
benchmarking module.

Kind Regards,
Furkan KAMACI

On Thu, Feb 15, 2018 at 8:00 AM, Furkan KAMACI 
wrote:

> Hi Fellows,
>
> Let’s decide what do we need for the new release?
>
> Kind Regards,
> Furkan KAMACI
>
>
> 26 Ara 2017 Sal, saat 22:53 tarihinde Furkan KAMACI <
> furkankam...@gmail.com> şunu yazdı:
>
>> Hi Lewis,
>>
>> That's cool! We can make a release and than work on that.
>>
>> BTW, here is the improved version of YCSB: YCSB++ [1]. We can use it for
>> our purpose.
>>
>> [1] http://www.cs.cmu.edu/~wtantisi/files/tablebenchmark-pdl11-talk.pdf
>>
>>
>> Kind Regards,
>> Furkan KAMACI
>>
>> On Tue, Dec 26, 2017 at 4:38 PM, lewis john mcgibbney > > wrote:
>>
>>> Hi Furkan,
>>> I think it's an excellent idea. I don't have much time to do it however I
>>> do have a comment.
>>> One issue I see is that we should really be benchmarking recent Gora
>>> modules. This requires quite a few updates across the board so that is
>>> where I suggest we start.
>>> If we can begin working our way through this, make a release of Gora,
>>> then
>>> we can come back and seriously approach a paper for a workshop or
>>> journal.
>>> What do you think?
>>> lewis
>>>
>>> On Sun, Dec 24, 2017 at 12:59 PM, 
>>> wrote:
>>>
>>> >
>>> > From: Furkan KAMACI 
>>> > To: "" 
>>> > Cc:
>>> > Bcc:
>>> > Date: Sat, 23 Dec 2017 22:17:12 +0200
>>> > Subject: Apache Gora Benchmark
>>> > Hi Fellows,
>>> >
>>> > As you know that our project is defined as:
>>> >
>>> > "*The Apache Gora™ open source framework provides an in-memory data
>>> model
>>> > and persistence for big data.*[1]"
>>> >
>>> > I believe that Apache Gora is a special project and it touches many
>>> > projects. I always wonder the performance of NoSQL DBs as individual
>>> and
>>> > accessed via Apache Gora.
>>> >
>>> > I think that we should make a benchmark and publish it, and Yahoo!’s
>>> Cloud
>>> > Serving Benchmark (YCSB) [2] is the most suitable tool for such a
>>> purpose.
>>> > I found a recent research about Object-NoSQL Database Mapper (ONDM)
>>> > benchmark [3] which includes Apache Gora and they have produced the
>>> > benchmark source code as ASF 2.0 licensed [4].
>>> >
>>> > Here is an example from Apache Accumulo which is based on YCSB too [5].
>>> >
>>> > What do you think about it? Who wants to join that work apart from me?
>>> >
>>> > Kind Regards,
>>> > Furkan KAMACI
>>> >
>>> >
>>> > [1] https://gora.apache.org
>>> > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R.
>>> Benchmarking
>>> > cloud serving systems with YCSB. In: Proceedings of the 1st ACM
>>> symposium
>>> > on Cloud computing - SoCC ’10. Association for Computing Machinery
>>> (ACM):
>>> > 2010. p. 143–154, doi:10.1145/1807128.1807152.
>>> > http://dx.doi.org/10.1145/1807128.1807152.
>>> > [3] https://doi.org/10.1186/s13174-016-0052-x
>>> > [4] https://github.com/vreniers/ONDM-Benchmarker
>>> > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf
>>> >
>>> >
>>>
>>
>>


Re: Apache Gora Benchmark

2018-02-14 Thread Furkan KAMACI
Hi Fellows,

Let’s decide what do we need for the new release?

Kind Regards,
Furkan KAMACI


26 Ara 2017 Sal, saat 22:53 tarihinde Furkan KAMACI 
şunu yazdı:

> Hi Lewis,
>
> That's cool! We can make a release and than work on that.
>
> BTW, here is the improved version of YCSB: YCSB++ [1]. We can use it for
> our purpose.
>
> [1] http://www.cs.cmu.edu/~wtantisi/files/tablebenchmark-pdl11-talk.pdf
>
>
> Kind Regards,
> Furkan KAMACI
>
> On Tue, Dec 26, 2017 at 4:38 PM, lewis john mcgibbney 
> wrote:
>
>> Hi Furkan,
>> I think it's an excellent idea. I don't have much time to do it however I
>> do have a comment.
>> One issue I see is that we should really be benchmarking recent Gora
>> modules. This requires quite a few updates across the board so that is
>> where I suggest we start.
>> If we can begin working our way through this, make a release of Gora, then
>> we can come back and seriously approach a paper for a workshop or journal.
>> What do you think?
>> lewis
>>
>> On Sun, Dec 24, 2017 at 12:59 PM, 
>> wrote:
>>
>> >
>> > From: Furkan KAMACI 
>> > To: "" 
>> > Cc:
>> > Bcc:
>> > Date: Sat, 23 Dec 2017 22:17:12 +0200
>> > Subject: Apache Gora Benchmark
>> > Hi Fellows,
>> >
>> > As you know that our project is defined as:
>> >
>> > "*The Apache Gora™ open source framework provides an in-memory data
>> model
>> > and persistence for big data.*[1]"
>> >
>> > I believe that Apache Gora is a special project and it touches many
>> > projects. I always wonder the performance of NoSQL DBs as individual and
>> > accessed via Apache Gora.
>> >
>> > I think that we should make a benchmark and publish it, and Yahoo!’s
>> Cloud
>> > Serving Benchmark (YCSB) [2] is the most suitable tool for such a
>> purpose.
>> > I found a recent research about Object-NoSQL Database Mapper (ONDM)
>> > benchmark [3] which includes Apache Gora and they have produced the
>> > benchmark source code as ASF 2.0 licensed [4].
>> >
>> > Here is an example from Apache Accumulo which is based on YCSB too [5].
>> >
>> > What do you think about it? Who wants to join that work apart from me?
>> >
>> > Kind Regards,
>> > Furkan KAMACI
>> >
>> >
>> > [1] https://gora.apache.org
>> > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R.
>> Benchmarking
>> > cloud serving systems with YCSB. In: Proceedings of the 1st ACM
>> symposium
>> > on Cloud computing - SoCC ’10. Association for Computing Machinery
>> (ACM):
>> > 2010. p. 143–154, doi:10.1145/1807128.1807152.
>> > http://dx.doi.org/10.1145/1807128.1807152.
>> > [3] https://doi.org/10.1186/s13174-016-0052-x
>> > [4] https://github.com/vreniers/ONDM-Benchmarker
>> > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf
>> >
>> >
>>
>
>


Re: Apache Gora Benchmark

2017-12-26 Thread Furkan KAMACI
Hi Lewis,

That's cool! We can make a release and than work on that.

BTW, here is the improved version of YCSB: YCSB++ [1]. We can use it for
our purpose.

[1] http://www.cs.cmu.edu/~wtantisi/files/tablebenchmark-pdl11-talk.pdf


Kind Regards,
Furkan KAMACI

On Tue, Dec 26, 2017 at 4:38 PM, lewis john mcgibbney 
wrote:

> Hi Furkan,
> I think it's an excellent idea. I don't have much time to do it however I
> do have a comment.
> One issue I see is that we should really be benchmarking recent Gora
> modules. This requires quite a few updates across the board so that is
> where I suggest we start.
> If we can begin working our way through this, make a release of Gora, then
> we can come back and seriously approach a paper for a workshop or journal.
> What do you think?
> lewis
>
> On Sun, Dec 24, 2017 at 12:59 PM,  wrote:
>
> >
> > From: Furkan KAMACI 
> > To: "" 
> > Cc:
> > Bcc:
> > Date: Sat, 23 Dec 2017 22:17:12 +0200
> > Subject: Apache Gora Benchmark
> > Hi Fellows,
> >
> > As you know that our project is defined as:
> >
> > "*The Apache Gora™ open source framework provides an in-memory data model
> > and persistence for big data.*[1]"
> >
> > I believe that Apache Gora is a special project and it touches many
> > projects. I always wonder the performance of NoSQL DBs as individual and
> > accessed via Apache Gora.
> >
> > I think that we should make a benchmark and publish it, and Yahoo!’s
> Cloud
> > Serving Benchmark (YCSB) [2] is the most suitable tool for such a
> purpose.
> > I found a recent research about Object-NoSQL Database Mapper (ONDM)
> > benchmark [3] which includes Apache Gora and they have produced the
> > benchmark source code as ASF 2.0 licensed [4].
> >
> > Here is an example from Apache Accumulo which is based on YCSB too [5].
> >
> > What do you think about it? Who wants to join that work apart from me?
> >
> > Kind Regards,
> > Furkan KAMACI
> >
> >
> > [1] https://gora.apache.org
> > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R.
> Benchmarking
> > cloud serving systems with YCSB. In: Proceedings of the 1st ACM symposium
> > on Cloud computing - SoCC ’10. Association for Computing Machinery (ACM):
> > 2010. p. 143–154, doi:10.1145/1807128.1807152.
> > http://dx.doi.org/10.1145/1807128.1807152.
> > [3] https://doi.org/10.1186/s13174-016-0052-x
> > [4] https://github.com/vreniers/ONDM-Benchmarker
> > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf
> >
> >
>


Re: Apache Gora Benchmark

2017-12-26 Thread lewis john mcgibbney
Hi Furkan,
I think it's an excellent idea. I don't have much time to do it however I
do have a comment.
One issue I see is that we should really be benchmarking recent Gora
modules. This requires quite a few updates across the board so that is
where I suggest we start.
If we can begin working our way through this, make a release of Gora, then
we can come back and seriously approach a paper for a workshop or journal.
What do you think?
lewis

On Sun, Dec 24, 2017 at 12:59 PM,  wrote:

>
> From: Furkan KAMACI 
> To: "" 
> Cc:
> Bcc:
> Date: Sat, 23 Dec 2017 22:17:12 +0200
> Subject: Apache Gora Benchmark
> Hi Fellows,
>
> As you know that our project is defined as:
>
> "*The Apache Gora™ open source framework provides an in-memory data model
> and persistence for big data.*[1]"
>
> I believe that Apache Gora is a special project and it touches many
> projects. I always wonder the performance of NoSQL DBs as individual and
> accessed via Apache Gora.
>
> I think that we should make a benchmark and publish it, and Yahoo!’s Cloud
> Serving Benchmark (YCSB) [2] is the most suitable tool for such a purpose.
> I found a recent research about Object-NoSQL Database Mapper (ONDM)
> benchmark [3] which includes Apache Gora and they have produced the
> benchmark source code as ASF 2.0 licensed [4].
>
> Here is an example from Apache Accumulo which is based on YCSB too [5].
>
> What do you think about it? Who wants to join that work apart from me?
>
> Kind Regards,
> Furkan KAMACI
>
>
> [1] https://gora.apache.org
> [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R. Benchmarking
> cloud serving systems with YCSB. In: Proceedings of the 1st ACM symposium
> on Cloud computing - SoCC ’10. Association for Computing Machinery (ACM):
> 2010. p. 143–154, doi:10.1145/1807128.1807152.
> http://dx.doi.org/10.1145/1807128.1807152.
> [3] https://doi.org/10.1186/s13174-016-0052-x
> [4] https://github.com/vreniers/ONDM-Benchmarker
> [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf
>
>


Re: Apache Gora Benchmark

2017-12-23 Thread Alfonso Nishikawa
Hi, Furkan.

Thank you very much for the links and the information. I find them VERY
interesting :)
I have my hands full, but I would like to stay updated =) Maybe I will have
some time in the mid term.

I really appreciate your links!

Regards,

Alfonso

2017-12-23 19:17 GMT-01:00 Furkan KAMACI :

> Hi Fellows,
>
> As you know that our project is defined as:
>
> "*The Apache Gora™ open source framework provides an in-memory data model
> and persistence for big data.*[1]"
>
> I believe that Apache Gora is a special project and it touches many
> projects. I always wonder the performance of NoSQL DBs as individual and
> accessed via Apache Gora.
>
> I think that we should make a benchmark and publish it, and Yahoo!’s Cloud
> Serving Benchmark (YCSB) [2] is the most suitable tool for such a purpose.
> I found a recent research about Object-NoSQL Database Mapper (ONDM)
> benchmark [3] which includes Apache Gora and they have produced the
> benchmark source code as ASF 2.0 licensed [4].
>
> Here is an example from Apache Accumulo which is based on YCSB too [5].
>
> What do you think about it? Who wants to join that work apart from me?
>
> Kind Regards,
> Furkan KAMACI
>
>
> [1] https://gora.apache.org
> [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R. Benchmarking
> cloud serving systems with YCSB. In: Proceedings of the 1st ACM symposium
> on Cloud computing - SoCC ’10. Association for Computing Machinery (ACM):
> 2010. p. 143–154, doi:10.1145/1807128.1807152.
> http://dx.doi.org/10.1145/1807128.1807152.
> [3] https://doi.org/10.1186/s13174-016-0052-x
> [4] https://github.com/vreniers/ONDM-Benchmarker
> [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf
>