On Mon, Jul 3, 2017 at 1:35 AM, Enrico Olivelli <eolive...@gmail.com> wrote:

> 2017-07-03 7:00 GMT+02:00 Sijie Guo <guosi...@gmail.com>:
> > Hi all,
> >
> > It has been almost 6-7 years since Apache BookKeeper was born. Apache
> > BookKeeper has already grown beyond a WAL system. Both Twitter and Yahoo
> > have used it as their storage foundation for their messaging systems,
> > Salesforce is using it for storage service. We also started talking
> Apache
> > BookKeeper as a storage service since 2016 ([1][2]).
> >
> > I am thinking of changing the description of Apache BookKeeper from a WAL
> > system to "a High Performance and Low Latency Storage Service (that
> > optimized for immutable/append-only data)" in the new website that we are
> > building for BP-11
> > <https://cwiki.apache.org/confluence/pages/viewpage.
> action?pageId=71012301>.
> > This will help us to bring more use cases/adoptions to the project and
> help
> > grow the community.
> >
> > Any thoughts?
>
> My two cents
>
> Honestly when I found BookKeeper I was very happy because I found an
> "original" building block to build replicated state machines.
> I think that the main soul of BK is exactly to be a WAL and this is
> really "original".
>
> From my point of view the "key features" of BookKeeper are "Fencing"
> and "Last-Add-Confirmed protocol"
>
> BookKeeper is really good at storing data, but IMHO it is because it
> has been designed and implemented by very skilled engineers,
> BookKeeper needs to be "fast", because in order to provide a fast WAL
> you have to give an ultra-fast storage, because the essence of a WAL
> is  "durability" and usually "durable" comes together with 'sync' and
> so with 'slow' .
>
> I am not a "marketing expert" but IMHO we should stress on the
> distinctive features of BK in respect to other softwares.
>
> I am not against the proposed change but as an user I wanted to point
> that I happy with BK because it is the most powerful distributed WAL
> (and maybe it is the unique in the opensource/free world)
>
> I would like to write in the website that BookKeeper is the real
> answer to whom who are looking for a distributed WAL.


Agree, we should make a clear case for distributed WAL.

It is worth just putting down all the use cases that BookKeeper has
supported.

- WAL (e.g. HDFS NameNode)
- Message Store (e.g. Apache Pulsar, Twitter Pub/Sub via DistributedLog)
- Offset/Cursor Store (e.g. Apache Pulsar stores cursors in ledgers)
- Object/Blob Store (e.g. in replicated state machine, storing state
machine snapshots in ledgers. We used this pattern at distributedlog based
replicated state machines.)
- ...

They are not all typical WAL use cases. But the common thing on all these
use cases - they are using bookkeeper as an append-only/immutable store.

- Sijie




>
>
> -- Enrico
>
>
>
> >
> > [1]
> > https://www.slideshare.net/hustlmsp/apache-bookkeeper-a-
> high-performance-and-low-latency-storage-service
> > [2] https://www.slideshare.net/jujjuri/apache-con2016final
>

Reply via email to