Hi Renato, Thanks for the reply. As far as I am concerned all options are on the table. I have shared my draft project proposal with the dev email list for comments. I will visit it again and see how best your ideas can be added to the implementation.
Below is the Google doc file, please feel free to add comments. https://docs.google.com/document/d/1djelY4yVwTuWPA310E_JBinOPnt5PJh3x67z0ZxgBLg/edit?usp=sharing Thank you. **Sheriffo Ceesay** On Sun, Mar 24, 2019 at 11:08 AM Renato Marroquín Mogrovejo < renatoj.marroq...@gmail.com> wrote: > Hi Sheriffo, > > Thanks for your interest in Gora and in this project. > We have discussed this a bit already and what the important bit is to > figure out Gora's overhead compared to using just the kv stores. > Obviously, we incurr in overheads, but it'd be interesting to know where > exactly (most likely serialization) and not just say how slow Gora is. > Ideally, one could fix the easy performance bugs but this might be out of > the scope, but anyway, that would be nice. > Another idea would be to actually get the final benchmark run as part of > CI? So we know how every change impacts performance. > > > Best, > > Renato M. > El mié., 20 mar. 2019 a las 17:15, sneceesa...@gmail.com (< > sneceesa...@gmail.com>) escribió: > > > > > > > > On 2017/12/23 20:17:12, Furkan KAMACI <furkankam...@gmail.com> wrote: > > > Hi Fellows, > > > > > > As you know that our project is defined as: > > > > > > "*The Apache Gora™ open source framework provides an in-memory data > model > > > and persistence for big data.*[1]" > > > > > > I believe that Apache Gora is a special project and it touches many > > > projects. I always wonder the performance of NoSQL DBs as individual > and > > > accessed via Apache Gora. > > > > > > I think that we should make a benchmark and publish it, and Yahoo!’s > Cloud > > > Serving Benchmark (YCSB) [2] is the most suitable tool for such a > purpose. > > > I found a recent research about Object-NoSQL Database Mapper (ONDM) > > > benchmark [3] which includes Apache Gora and they have produced the > > > benchmark source code as ASF 2.0 licensed [4]. > > > > > > Here is an example from Apache Accumulo which is based on YCSB too [5]. > > > > > > What do you think about it? Who wants to join that work apart from me? > > > > > > Kind Regards, > > > Furkan KAMACI > > > > > > > > > [1] https://gora.apache.org > > > [2] Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R. > Benchmarking > > > cloud serving systems with YCSB. In: Proceedings of the 1st ACM > symposium > > > on Cloud computing - SoCC ’10. Association for Computing Machinery > (ACM): > > > 2010. p. 143–154, doi:10.1145/1807128.1807152. > > > http://dx.doi.org/10.1145/1807128.1807152. > > > [3] https://doi.org/10.1186/s13174-016-0052-x > > > [4] https://github.com/vreniers/ONDM-Benchmarker > > > [5] https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf > > > > > > > Hi All, I was advised by Kevin Ratnasekera to start or reignite this > discussion. I am currently going over the documentation, installation and > familiarising myself with the code base. Any good pointers here will be > helpful. >