Re: Apache Gora Benchmark Module Draft Proposal
Thank you. I will do the final submission later today. Thank you. **Sheriffo Ceesay** On Mon, Apr 8, 2019 at 5:40 PM Kevin Ratnasekera wrote: > Hi Sheriffo, > > The proposal looks very good. I have added one small comment. Please make > sure you do the final submission before the deadline. > > Good luck with your submission :) > > Regards > Kevin > > On Sun, Mar 24, 2019 at 2:14 AM Sheriffo Ceesay > wrote: > > > Hi All, > > > > As advised by Kevin Ratnasekera and Furkan Kamaci, please find below the > > link to the draft proposal for Gora benchmark module. > > > > Please let me if you have any comments to improve the document. (Don't > > worry, I have another copy.) > > > > > > > https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit?usp=sharing > > > > > > Please add your comments using the sidebar comment functionality if > > possible. > > > > Thank you. > > > > > > **Sheriffo Ceesay** > > >
Re: Apache Gora Benchmark Module Draft Proposal
Hi Sheriffo, The proposal looks very good. I have added one small comment. Please make sure you do the final submission before the deadline. Good luck with your submission :) Regards Kevin On Sun, Mar 24, 2019 at 2:14 AM Sheriffo Ceesay wrote: > Hi All, > > As advised by Kevin Ratnasekera and Furkan Kamaci, please find below the > link to the draft proposal for Gora benchmark module. > > Please let me if you have any comments to improve the document. (Don't > worry, I have another copy.) > > > https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit?usp=sharing > > > Please add your comments using the sidebar comment functionality if > possible. > > Thank you. > > > **Sheriffo Ceesay** >
Re: Apache Gora Benchmark
I have updated the Benchmark Module after some suggestions from Renato. So basically, the suggestion was to consider extending YCSB to include Gora since YCSB already have an implementation of other KV stores. So it will be great if a potential mentor could have look at this and give me some feedback. We are currently in the proposal submission period of GSoC timeline, so any comment on the document will really help. Please find below link to the shared Google doc. https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit **Sheriffo Ceesay** On Mon, Mar 25, 2019 at 12:30 PM Sheriffo Ceesay wrote: > Hi Renato, > > Thanks for the reply and the comments on the Google doc. > > I think, adding Gora to YCBS framework will be the best approach. Like, I > mentioned in the shared doc, I will dig more into this and update the > proposal accordingly. > > Thank you. > > > **Sheriffo Ceesay** > > > On Mon, Mar 25, 2019 at 12:05 PM Renato Marroquín Mogrovejo < > renatoj.marroq...@gmail.com> wrote: > >> Hey Sheriffo, >> >> Thanks for sharing this. I went quickly over it, and it looks good >> overall. >> One question I have is the one I left on the proposal as well. The >> proposal >> is about implementing a benckmarking module but why aren't we >> using/integrating with something like YCSB? >> >> I am asking this because it has a few benefits: >> - Most of the operations one would be interested in kv-stores are already >> modeled by YCSB (as you know) >> - With this we would already get support for most key-value stores and we >> wouldn't have to implement it(or support it) later on. >> - We get a benchmark module that is already accepted and understood by >> people using key-value stores. >> >> The resulting deliverables could be the integration (adding Gora to YCSB, >> the module could live in Gora and also could live in YCSB if they want to >> take it), and the scripts to run it. >> What do you guys think? >> >> >> Best, >> >> Renato M. >> >> El dom., 24 mar. 2019 a las 13:05, Sheriffo Ceesay (< >> sneceesa...@gmail.com>) >> escribió: >> >> > Hi Renato, >> > >> > Thanks for the reply. As far as I am concerned all options are on the >> > table. I have shared my draft project proposal with the dev email list >> for >> > comments. I will visit it again and see how best your ideas can be >> added to >> > the implementation. >> > >> > Below is the Google doc file, please feel free to add comments. >> > >> > >> > >> https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit?usp=sharing >> > >> > Thank you. >> > >> > **Sheriffo Ceesay** >> > >> > >> > On Sun, Mar 24, 2019 at 11:08 AM Renato Marroquín Mogrovejo < >> > renatoj.marroq...@gmail.com> wrote: >> > >> > > Hi Sheriffo, >> > > >> > > Thanks for your interest in Gora and in this project. >> > > We have discussed this a bit already and what the important bit is to >> > > figure out Gora's overhead compared to using just the kv stores. >> > > Obviously, we incurr in overheads, but it'd be interesting to know >> where >> > > exactly (most likely serialization) and not just say how slow Gora is. >> > > Ideally, one could fix the easy performance bugs but this might be >> out of >> > > the scope, but anyway, that would be nice. >> > > Another idea would be to actually get the final benchmark run as part >> of >> > > CI? So we know how every change impacts performance. >> > > >> > > >> > > Best, >> > > >> > > Renato M. >> > > El mié., 20 mar. 2019 a las 17:15, sneceesa...@gmail.com (< >> > > sneceesa...@gmail.com>) escribió: >> > > > >> > > > >> > > > >> > > > On 2017/12/23 20:17:12, Furkan KAMACI >> wrote: >> > > > > Hi Fellows, >> > > > > >> > > > > As you know that our project is defined as: >> > > > > >> > > > > "*The Apache Gora™ open source framework provides an in-memory >> data >> > > model >> > > > > and persistence for big data.*[1]" >> > > > > >> > > > > I believe that Apache Gora is a special project and it touches >> many >> > > > > projects. I always wonder the performance of NoSQL DBs as >> individual >> > > and >> > > > > accessed via Apache Gora. >> > > > > >> > > > > I think that we should make a benchmark and publish it, and >> Yahoo!’s >> > > Cloud >> > > > > Serving Benchmark (YCSB) [2] is the most suitable tool for such a >> > > purpose. >> > > > > I found a recent research about Object-NoSQL Database Mapper >> (ONDM) >> > > > > benchmark [3] which includes Apache Gora and they have produced >> the >> > > > > benchmark source code as ASF 2.0 licensed [4]. >> > > > > >> > > > > Here is an example from Apache Accumulo which is based on YCSB too >> > [5]. >> > > > > >> > > > > What do you think about it? Who wants to join that work apart from >> > me? >> > > > > >> > > > > Kind Regards, >> > > > > Furkan KAMACI >> > > > > >> > > > > >> > > > > [1] https://gora.apache.org >> > > > > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R. >> > > Benchmarking >> > > > > cloud serving system
Re: Apache Gora Benchmark
Hi Renato, Thanks for the reply and the comments on the Google doc. I think, adding Gora to YCBS framework will be the best approach. Like, I mentioned in the shared doc, I will dig more into this and update the proposal accordingly. Thank you. **Sheriffo Ceesay** On Mon, Mar 25, 2019 at 12:05 PM Renato Marroquín Mogrovejo < renatoj.marroq...@gmail.com> wrote: > Hey Sheriffo, > > Thanks for sharing this. I went quickly over it, and it looks good overall. > One question I have is the one I left on the proposal as well. The proposal > is about implementing a benckmarking module but why aren't we > using/integrating with something like YCSB? > > I am asking this because it has a few benefits: > - Most of the operations one would be interested in kv-stores are already > modeled by YCSB (as you know) > - With this we would already get support for most key-value stores and we > wouldn't have to implement it(or support it) later on. > - We get a benchmark module that is already accepted and understood by > people using key-value stores. > > The resulting deliverables could be the integration (adding Gora to YCSB, > the module could live in Gora and also could live in YCSB if they want to > take it), and the scripts to run it. > What do you guys think? > > > Best, > > Renato M. > > El dom., 24 mar. 2019 a las 13:05, Sheriffo Ceesay ( >) > escribió: > > > Hi Renato, > > > > Thanks for the reply. As far as I am concerned all options are on the > > table. I have shared my draft project proposal with the dev email list > for > > comments. I will visit it again and see how best your ideas can be added > to > > the implementation. > > > > Below is the Google doc file, please feel free to add comments. > > > > > > > https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit?usp=sharing > > > > Thank you. > > > > **Sheriffo Ceesay** > > > > > > On Sun, Mar 24, 2019 at 11:08 AM Renato Marroquín Mogrovejo < > > renatoj.marroq...@gmail.com> wrote: > > > > > Hi Sheriffo, > > > > > > Thanks for your interest in Gora and in this project. > > > We have discussed this a bit already and what the important bit is to > > > figure out Gora's overhead compared to using just the kv stores. > > > Obviously, we incurr in overheads, but it'd be interesting to know > where > > > exactly (most likely serialization) and not just say how slow Gora is. > > > Ideally, one could fix the easy performance bugs but this might be out > of > > > the scope, but anyway, that would be nice. > > > Another idea would be to actually get the final benchmark run as part > of > > > CI? So we know how every change impacts performance. > > > > > > > > > Best, > > > > > > Renato M. > > > El mié., 20 mar. 2019 a las 17:15, sneceesa...@gmail.com (< > > > sneceesa...@gmail.com>) escribió: > > > > > > > > > > > > > > > > On 2017/12/23 20:17:12, Furkan KAMACI > wrote: > > > > > Hi Fellows, > > > > > > > > > > As you know that our project is defined as: > > > > > > > > > > "*The Apache Gora™ open source framework provides an in-memory data > > > model > > > > > and persistence for big data.*[1]" > > > > > > > > > > I believe that Apache Gora is a special project and it touches many > > > > > projects. I always wonder the performance of NoSQL DBs as > individual > > > and > > > > > accessed via Apache Gora. > > > > > > > > > > I think that we should make a benchmark and publish it, and > Yahoo!’s > > > Cloud > > > > > Serving Benchmark (YCSB) [2] is the most suitable tool for such a > > > purpose. > > > > > I found a recent research about Object-NoSQL Database Mapper (ONDM) > > > > > benchmark [3] which includes Apache Gora and they have produced the > > > > > benchmark source code as ASF 2.0 licensed [4]. > > > > > > > > > > Here is an example from Apache Accumulo which is based on YCSB too > > [5]. > > > > > > > > > > What do you think about it? Who wants to join that work apart from > > me? > > > > > > > > > > Kind Regards, > > > > > Furkan KAMACI > > > > > > > > > > > > > > > [1] https://gora.apache.org > > > > > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R. > > > Benchmarking > > > > > cloud serving systems with YCSB. In: Proceedings of the 1st ACM > > > symposium > > > > > on Cloud computing - SoCC ’10. Association for Computing Machinery > > > (ACM): > > > > > 2010. p. 143–154, doi:10.1145/1807128.1807152. > > > > > http://dx.doi.org/10.1145/1807128.1807152. > > > > > [3] https://doi.org/10.1186/s13174-016-0052-x > > > > > [4] https://github.com/vreniers/ONDM-Benchmarker > > > > > [5] > https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf > > > > > > > > > > > > > Hi All, I was advised by Kevin Ratnasekera to start or reignite this > > > discussion. I am currently going over the documentation, installation > and > > > familiarising myself with the code base. Any good pointers here will be > > > helpful. > > > > > >
Re: Apache Gora Benchmark
Hey Sheriffo, Thanks for sharing this. I went quickly over it, and it looks good overall. One question I have is the one I left on the proposal as well. The proposal is about implementing a benckmarking module but why aren't we using/integrating with something like YCSB? I am asking this because it has a few benefits: - Most of the operations one would be interested in kv-stores are already modeled by YCSB (as you know) - With this we would already get support for most key-value stores and we wouldn't have to implement it(or support it) later on. - We get a benchmark module that is already accepted and understood by people using key-value stores. The resulting deliverables could be the integration (adding Gora to YCSB, the module could live in Gora and also could live in YCSB if they want to take it), and the scripts to run it. What do you guys think? Best, Renato M. El dom., 24 mar. 2019 a las 13:05, Sheriffo Ceesay () escribió: > Hi Renato, > > Thanks for the reply. As far as I am concerned all options are on the > table. I have shared my draft project proposal with the dev email list for > comments. I will visit it again and see how best your ideas can be added to > the implementation. > > Below is the Google doc file, please feel free to add comments. > > > https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit?usp=sharing > > Thank you. > > **Sheriffo Ceesay** > > > On Sun, Mar 24, 2019 at 11:08 AM Renato Marroquín Mogrovejo < > renatoj.marroq...@gmail.com> wrote: > > > Hi Sheriffo, > > > > Thanks for your interest in Gora and in this project. > > We have discussed this a bit already and what the important bit is to > > figure out Gora's overhead compared to using just the kv stores. > > Obviously, we incurr in overheads, but it'd be interesting to know where > > exactly (most likely serialization) and not just say how slow Gora is. > > Ideally, one could fix the easy performance bugs but this might be out of > > the scope, but anyway, that would be nice. > > Another idea would be to actually get the final benchmark run as part of > > CI? So we know how every change impacts performance. > > > > > > Best, > > > > Renato M. > > El mié., 20 mar. 2019 a las 17:15, sneceesa...@gmail.com (< > > sneceesa...@gmail.com>) escribió: > > > > > > > > > > > > On 2017/12/23 20:17:12, Furkan KAMACI wrote: > > > > Hi Fellows, > > > > > > > > As you know that our project is defined as: > > > > > > > > "*The Apache Gora™ open source framework provides an in-memory data > > model > > > > and persistence for big data.*[1]" > > > > > > > > I believe that Apache Gora is a special project and it touches many > > > > projects. I always wonder the performance of NoSQL DBs as individual > > and > > > > accessed via Apache Gora. > > > > > > > > I think that we should make a benchmark and publish it, and Yahoo!’s > > Cloud > > > > Serving Benchmark (YCSB) [2] is the most suitable tool for such a > > purpose. > > > > I found a recent research about Object-NoSQL Database Mapper (ONDM) > > > > benchmark [3] which includes Apache Gora and they have produced the > > > > benchmark source code as ASF 2.0 licensed [4]. > > > > > > > > Here is an example from Apache Accumulo which is based on YCSB too > [5]. > > > > > > > > What do you think about it? Who wants to join that work apart from > me? > > > > > > > > Kind Regards, > > > > Furkan KAMACI > > > > > > > > > > > > [1] https://gora.apache.org > > > > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R. > > Benchmarking > > > > cloud serving systems with YCSB. In: Proceedings of the 1st ACM > > symposium > > > > on Cloud computing - SoCC ’10. Association for Computing Machinery > > (ACM): > > > > 2010. p. 143–154, doi:10.1145/1807128.1807152. > > > > http://dx.doi.org/10.1145/1807128.1807152. > > > > [3] https://doi.org/10.1186/s13174-016-0052-x > > > > [4] https://github.com/vreniers/ONDM-Benchmarker > > > > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf > > > > > > > > > > Hi All, I was advised by Kevin Ratnasekera to start or reignite this > > discussion. I am currently going over the documentation, installation and > > familiarising myself with the code base. Any good pointers here will be > > helpful. > > >
Re: Apache Gora Benchmark
Hi Renato, Thanks for the reply. As far as I am concerned all options are on the table. I have shared my draft project proposal with the dev email list for comments. I will visit it again and see how best your ideas can be added to the implementation. Below is the Google doc file, please feel free to add comments. https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit?usp=sharing Thank you. **Sheriffo Ceesay** On Sun, Mar 24, 2019 at 11:08 AM Renato Marroquín Mogrovejo < renatoj.marroq...@gmail.com> wrote: > Hi Sheriffo, > > Thanks for your interest in Gora and in this project. > We have discussed this a bit already and what the important bit is to > figure out Gora's overhead compared to using just the kv stores. > Obviously, we incurr in overheads, but it'd be interesting to know where > exactly (most likely serialization) and not just say how slow Gora is. > Ideally, one could fix the easy performance bugs but this might be out of > the scope, but anyway, that would be nice. > Another idea would be to actually get the final benchmark run as part of > CI? So we know how every change impacts performance. > > > Best, > > Renato M. > El mié., 20 mar. 2019 a las 17:15, sneceesa...@gmail.com (< > sneceesa...@gmail.com>) escribió: > > > > > > > > On 2017/12/23 20:17:12, Furkan KAMACI wrote: > > > Hi Fellows, > > > > > > As you know that our project is defined as: > > > > > > "*The Apache Gora™ open source framework provides an in-memory data > model > > > and persistence for big data.*[1]" > > > > > > I believe that Apache Gora is a special project and it touches many > > > projects. I always wonder the performance of NoSQL DBs as individual > and > > > accessed via Apache Gora. > > > > > > I think that we should make a benchmark and publish it, and Yahoo!’s > Cloud > > > Serving Benchmark (YCSB) [2] is the most suitable tool for such a > purpose. > > > I found a recent research about Object-NoSQL Database Mapper (ONDM) > > > benchmark [3] which includes Apache Gora and they have produced the > > > benchmark source code as ASF 2.0 licensed [4]. > > > > > > Here is an example from Apache Accumulo which is based on YCSB too [5]. > > > > > > What do you think about it? Who wants to join that work apart from me? > > > > > > Kind Regards, > > > Furkan KAMACI > > > > > > > > > [1] https://gora.apache.org > > > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R. > Benchmarking > > > cloud serving systems with YCSB. In: Proceedings of the 1st ACM > symposium > > > on Cloud computing - SoCC ’10. Association for Computing Machinery > (ACM): > > > 2010. p. 143–154, doi:10.1145/1807128.1807152. > > > http://dx.doi.org/10.1145/1807128.1807152. > > > [3] https://doi.org/10.1186/s13174-016-0052-x > > > [4] https://github.com/vreniers/ONDM-Benchmarker > > > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf > > > > > > > Hi All, I was advised by Kevin Ratnasekera to start or reignite this > discussion. I am currently going over the documentation, installation and > familiarising myself with the code base. Any good pointers here will be > helpful. >
Re: Apache Gora Benchmark
Hi Sheriffo, Thanks for your interest in Gora and in this project. We have discussed this a bit already and what the important bit is to figure out Gora's overhead compared to using just the kv stores. Obviously, we incurr in overheads, but it'd be interesting to know where exactly (most likely serialization) and not just say how slow Gora is. Ideally, one could fix the easy performance bugs but this might be out of the scope, but anyway, that would be nice. Another idea would be to actually get the final benchmark run as part of CI? So we know how every change impacts performance. Best, Renato M. El mié., 20 mar. 2019 a las 17:15, sneceesa...@gmail.com (< sneceesa...@gmail.com>) escribió: > > > > On 2017/12/23 20:17:12, Furkan KAMACI wrote: > > Hi Fellows, > > > > As you know that our project is defined as: > > > > "*The Apache Gora™ open source framework provides an in-memory data model > > and persistence for big data.*[1]" > > > > I believe that Apache Gora is a special project and it touches many > > projects. I always wonder the performance of NoSQL DBs as individual and > > accessed via Apache Gora. > > > > I think that we should make a benchmark and publish it, and Yahoo!’s Cloud > > Serving Benchmark (YCSB) [2] is the most suitable tool for such a purpose. > > I found a recent research about Object-NoSQL Database Mapper (ONDM) > > benchmark [3] which includes Apache Gora and they have produced the > > benchmark source code as ASF 2.0 licensed [4]. > > > > Here is an example from Apache Accumulo which is based on YCSB too [5]. > > > > What do you think about it? Who wants to join that work apart from me? > > > > Kind Regards, > > Furkan KAMACI > > > > > > [1] https://gora.apache.org > > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R. Benchmarking > > cloud serving systems with YCSB. In: Proceedings of the 1st ACM symposium > > on Cloud computing - SoCC ’10. Association for Computing Machinery (ACM): > > 2010. p. 143–154, doi:10.1145/1807128.1807152. > > http://dx.doi.org/10.1145/1807128.1807152. > > [3] https://doi.org/10.1186/s13174-016-0052-x > > [4] https://github.com/vreniers/ONDM-Benchmarker > > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf > > > > Hi All, I was advised by Kevin Ratnasekera to start or reignite this discussion. I am currently going over the documentation, installation and familiarising myself with the code base. Any good pointers here will be helpful.
Re: Apache Gora Benchmark
On 2017/12/23 20:17:12, Furkan KAMACI wrote: > Hi Fellows, > > As you know that our project is defined as: > > "*The Apache Gora™ open source framework provides an in-memory data model > and persistence for big data.*[1]" > > I believe that Apache Gora is a special project and it touches many > projects. I always wonder the performance of NoSQL DBs as individual and > accessed via Apache Gora. > > I think that we should make a benchmark and publish it, and Yahoo!’s Cloud > Serving Benchmark (YCSB) [2] is the most suitable tool for such a purpose. > I found a recent research about Object-NoSQL Database Mapper (ONDM) > benchmark [3] which includes Apache Gora and they have produced the > benchmark source code as ASF 2.0 licensed [4]. > > Here is an example from Apache Accumulo which is based on YCSB too [5]. > > What do you think about it? Who wants to join that work apart from me? > > Kind Regards, > Furkan KAMACI > > > [1] https://gora.apache.org > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R. Benchmarking > cloud serving systems with YCSB. In: Proceedings of the 1st ACM symposium > on Cloud computing - SoCC ’10. Association for Computing Machinery (ACM): > 2010. p. 143–154, doi:10.1145/1807128.1807152. > http://dx.doi.org/10.1145/1807128.1807152. > [3] https://doi.org/10.1186/s13174-016-0052-x > [4] https://github.com/vreniers/ONDM-Benchmarker > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf > Hi All, I was advised by Kevin Ratnasekera to start or reignite this discussion. I am currently going over the documentation, installation and familiarising myself with the code base. Any good pointers here will be helpful.
Re: Apache Gora Benchmark
Hi, I've created https://issues.apache.org/jira/browse/GORA-532 for creating a benchmarking module. Kind Regards, Furkan KAMACI On Thu, Feb 15, 2018 at 8:00 AM, Furkan KAMACI wrote: > Hi Fellows, > > Let’s decide what do we need for the new release? > > Kind Regards, > Furkan KAMACI > > > 26 Ara 2017 Sal, saat 22:53 tarihinde Furkan KAMACI < > furkankam...@gmail.com> şunu yazdı: > >> Hi Lewis, >> >> That's cool! We can make a release and than work on that. >> >> BTW, here is the improved version of YCSB: YCSB++ [1]. We can use it for >> our purpose. >> >> [1] http://www.cs.cmu.edu/~wtantisi/files/tablebenchmark-pdl11-talk.pdf >> >> >> Kind Regards, >> Furkan KAMACI >> >> On Tue, Dec 26, 2017 at 4:38 PM, lewis john mcgibbney > > wrote: >> >>> Hi Furkan, >>> I think it's an excellent idea. I don't have much time to do it however I >>> do have a comment. >>> One issue I see is that we should really be benchmarking recent Gora >>> modules. This requires quite a few updates across the board so that is >>> where I suggest we start. >>> If we can begin working our way through this, make a release of Gora, >>> then >>> we can come back and seriously approach a paper for a workshop or >>> journal. >>> What do you think? >>> lewis >>> >>> On Sun, Dec 24, 2017 at 12:59 PM, >>> wrote: >>> >>> > >>> > From: Furkan KAMACI >>> > To: "" >>> > Cc: >>> > Bcc: >>> > Date: Sat, 23 Dec 2017 22:17:12 +0200 >>> > Subject: Apache Gora Benchmark >>> > Hi Fellows, >>> > >>> > As you know that our project is defined as: >>> > >>> > "*The Apache Gora™ open source framework provides an in-memory data >>> model >>> > and persistence for big data.*[1]" >>> > >>> > I believe that Apache Gora is a special project and it touches many >>> > projects. I always wonder the performance of NoSQL DBs as individual >>> and >>> > accessed via Apache Gora. >>> > >>> > I think that we should make a benchmark and publish it, and Yahoo!’s >>> Cloud >>> > Serving Benchmark (YCSB) [2] is the most suitable tool for such a >>> purpose. >>> > I found a recent research about Object-NoSQL Database Mapper (ONDM) >>> > benchmark [3] which includes Apache Gora and they have produced the >>> > benchmark source code as ASF 2.0 licensed [4]. >>> > >>> > Here is an example from Apache Accumulo which is based on YCSB too [5]. >>> > >>> > What do you think about it? Who wants to join that work apart from me? >>> > >>> > Kind Regards, >>> > Furkan KAMACI >>> > >>> > >>> > [1] https://gora.apache.org >>> > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R. >>> Benchmarking >>> > cloud serving systems with YCSB. In: Proceedings of the 1st ACM >>> symposium >>> > on Cloud computing - SoCC ’10. Association for Computing Machinery >>> (ACM): >>> > 2010. p. 143–154, doi:10.1145/1807128.1807152. >>> > http://dx.doi.org/10.1145/1807128.1807152. >>> > [3] https://doi.org/10.1186/s13174-016-0052-x >>> > [4] https://github.com/vreniers/ONDM-Benchmarker >>> > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf >>> > >>> > >>> >> >>
Re: Apache Gora Benchmark
Hi Fellows, Let’s decide what do we need for the new release? Kind Regards, Furkan KAMACI 26 Ara 2017 Sal, saat 22:53 tarihinde Furkan KAMACI şunu yazdı: > Hi Lewis, > > That's cool! We can make a release and than work on that. > > BTW, here is the improved version of YCSB: YCSB++ [1]. We can use it for > our purpose. > > [1] http://www.cs.cmu.edu/~wtantisi/files/tablebenchmark-pdl11-talk.pdf > > > Kind Regards, > Furkan KAMACI > > On Tue, Dec 26, 2017 at 4:38 PM, lewis john mcgibbney > wrote: > >> Hi Furkan, >> I think it's an excellent idea. I don't have much time to do it however I >> do have a comment. >> One issue I see is that we should really be benchmarking recent Gora >> modules. This requires quite a few updates across the board so that is >> where I suggest we start. >> If we can begin working our way through this, make a release of Gora, then >> we can come back and seriously approach a paper for a workshop or journal. >> What do you think? >> lewis >> >> On Sun, Dec 24, 2017 at 12:59 PM, >> wrote: >> >> > >> > From: Furkan KAMACI >> > To: "" >> > Cc: >> > Bcc: >> > Date: Sat, 23 Dec 2017 22:17:12 +0200 >> > Subject: Apache Gora Benchmark >> > Hi Fellows, >> > >> > As you know that our project is defined as: >> > >> > "*The Apache Gora™ open source framework provides an in-memory data >> model >> > and persistence for big data.*[1]" >> > >> > I believe that Apache Gora is a special project and it touches many >> > projects. I always wonder the performance of NoSQL DBs as individual and >> > accessed via Apache Gora. >> > >> > I think that we should make a benchmark and publish it, and Yahoo!’s >> Cloud >> > Serving Benchmark (YCSB) [2] is the most suitable tool for such a >> purpose. >> > I found a recent research about Object-NoSQL Database Mapper (ONDM) >> > benchmark [3] which includes Apache Gora and they have produced the >> > benchmark source code as ASF 2.0 licensed [4]. >> > >> > Here is an example from Apache Accumulo which is based on YCSB too [5]. >> > >> > What do you think about it? Who wants to join that work apart from me? >> > >> > Kind Regards, >> > Furkan KAMACI >> > >> > >> > [1] https://gora.apache.org >> > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R. >> Benchmarking >> > cloud serving systems with YCSB. In: Proceedings of the 1st ACM >> symposium >> > on Cloud computing - SoCC ’10. Association for Computing Machinery >> (ACM): >> > 2010. p. 143–154, doi:10.1145/1807128.1807152. >> > http://dx.doi.org/10.1145/1807128.1807152. >> > [3] https://doi.org/10.1186/s13174-016-0052-x >> > [4] https://github.com/vreniers/ONDM-Benchmarker >> > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf >> > >> > >> > >
Re: Apache Gora Benchmark
Hi Lewis, That's cool! We can make a release and than work on that. BTW, here is the improved version of YCSB: YCSB++ [1]. We can use it for our purpose. [1] http://www.cs.cmu.edu/~wtantisi/files/tablebenchmark-pdl11-talk.pdf Kind Regards, Furkan KAMACI On Tue, Dec 26, 2017 at 4:38 PM, lewis john mcgibbney wrote: > Hi Furkan, > I think it's an excellent idea. I don't have much time to do it however I > do have a comment. > One issue I see is that we should really be benchmarking recent Gora > modules. This requires quite a few updates across the board so that is > where I suggest we start. > If we can begin working our way through this, make a release of Gora, then > we can come back and seriously approach a paper for a workshop or journal. > What do you think? > lewis > > On Sun, Dec 24, 2017 at 12:59 PM, wrote: > > > > > From: Furkan KAMACI > > To: "" > > Cc: > > Bcc: > > Date: Sat, 23 Dec 2017 22:17:12 +0200 > > Subject: Apache Gora Benchmark > > Hi Fellows, > > > > As you know that our project is defined as: > > > > "*The Apache Gora™ open source framework provides an in-memory data model > > and persistence for big data.*[1]" > > > > I believe that Apache Gora is a special project and it touches many > > projects. I always wonder the performance of NoSQL DBs as individual and > > accessed via Apache Gora. > > > > I think that we should make a benchmark and publish it, and Yahoo!’s > Cloud > > Serving Benchmark (YCSB) [2] is the most suitable tool for such a > purpose. > > I found a recent research about Object-NoSQL Database Mapper (ONDM) > > benchmark [3] which includes Apache Gora and they have produced the > > benchmark source code as ASF 2.0 licensed [4]. > > > > Here is an example from Apache Accumulo which is based on YCSB too [5]. > > > > What do you think about it? Who wants to join that work apart from me? > > > > Kind Regards, > > Furkan KAMACI > > > > > > [1] https://gora.apache.org > > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R. > Benchmarking > > cloud serving systems with YCSB. In: Proceedings of the 1st ACM symposium > > on Cloud computing - SoCC ’10. Association for Computing Machinery (ACM): > > 2010. p. 143–154, doi:10.1145/1807128.1807152. > > http://dx.doi.org/10.1145/1807128.1807152. > > [3] https://doi.org/10.1186/s13174-016-0052-x > > [4] https://github.com/vreniers/ONDM-Benchmarker > > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf > > > > >
Re: Apache Gora Benchmark
Hi Furkan, I think it's an excellent idea. I don't have much time to do it however I do have a comment. One issue I see is that we should really be benchmarking recent Gora modules. This requires quite a few updates across the board so that is where I suggest we start. If we can begin working our way through this, make a release of Gora, then we can come back and seriously approach a paper for a workshop or journal. What do you think? lewis On Sun, Dec 24, 2017 at 12:59 PM, wrote: > > From: Furkan KAMACI > To: "" > Cc: > Bcc: > Date: Sat, 23 Dec 2017 22:17:12 +0200 > Subject: Apache Gora Benchmark > Hi Fellows, > > As you know that our project is defined as: > > "*The Apache Gora™ open source framework provides an in-memory data model > and persistence for big data.*[1]" > > I believe that Apache Gora is a special project and it touches many > projects. I always wonder the performance of NoSQL DBs as individual and > accessed via Apache Gora. > > I think that we should make a benchmark and publish it, and Yahoo!’s Cloud > Serving Benchmark (YCSB) [2] is the most suitable tool for such a purpose. > I found a recent research about Object-NoSQL Database Mapper (ONDM) > benchmark [3] which includes Apache Gora and they have produced the > benchmark source code as ASF 2.0 licensed [4]. > > Here is an example from Apache Accumulo which is based on YCSB too [5]. > > What do you think about it? Who wants to join that work apart from me? > > Kind Regards, > Furkan KAMACI > > > [1] https://gora.apache.org > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R. Benchmarking > cloud serving systems with YCSB. In: Proceedings of the 1st ACM symposium > on Cloud computing - SoCC ’10. Association for Computing Machinery (ACM): > 2010. p. 143–154, doi:10.1145/1807128.1807152. > http://dx.doi.org/10.1145/1807128.1807152. > [3] https://doi.org/10.1186/s13174-016-0052-x > [4] https://github.com/vreniers/ONDM-Benchmarker > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf > >
Re: Apache Gora Benchmark
Hi, Furkan. Thank you very much for the links and the information. I find them VERY interesting :) I have my hands full, but I would like to stay updated =) Maybe I will have some time in the mid term. I really appreciate your links! Regards, Alfonso 2017-12-23 19:17 GMT-01:00 Furkan KAMACI : > Hi Fellows, > > As you know that our project is defined as: > > "*The Apache Gora™ open source framework provides an in-memory data model > and persistence for big data.*[1]" > > I believe that Apache Gora is a special project and it touches many > projects. I always wonder the performance of NoSQL DBs as individual and > accessed via Apache Gora. > > I think that we should make a benchmark and publish it, and Yahoo!’s Cloud > Serving Benchmark (YCSB) [2] is the most suitable tool for such a purpose. > I found a recent research about Object-NoSQL Database Mapper (ONDM) > benchmark [3] which includes Apache Gora and they have produced the > benchmark source code as ASF 2.0 licensed [4]. > > Here is an example from Apache Accumulo which is based on YCSB too [5]. > > What do you think about it? Who wants to join that work apart from me? > > Kind Regards, > Furkan KAMACI > > > [1] https://gora.apache.org > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R. Benchmarking > cloud serving systems with YCSB. In: Proceedings of the 1st ACM symposium > on Cloud computing - SoCC ’10. Association for Computing Machinery (ACM): > 2010. p. 143–154, doi:10.1145/1807128.1807152. > http://dx.doi.org/10.1145/1807128.1807152. > [3] https://doi.org/10.1186/s13174-016-0052-x > [4] https://github.com/vreniers/ONDM-Benchmarker > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf >