Hi folks,

This proposal is to add tiered storage to pulsar.

Storing backlogs on bookies for a long time can get expensive. If
there are other cheaper forms of storage available (S3/HDFS), capex
can be reduced by moving older data to this storage.

The proposed design proposes an interface to do this, with an initial
implementation targeted at S3. However the interface is general enough
that HDFS could also be used.

Design doc: 
https://docs.google.com/document/d/1UTMlYA-1XmqIFeyyWPSzva0AmI7vNRsVCEkkde8WYEM/edit?usp=sharing
Prototype/PoC: 
https://github.com/ivankelly/incubator-pulsar/tree/s3-offload-proto

Cheers,
Ivan

Reply via email to