Hi folks, This proposal is to add tiered storage to pulsar.
Storing backlogs on bookies for a long time can get expensive. If there are other cheaper forms of storage available (S3/HDFS), capex can be reduced by moving older data to this storage. The proposed design proposes an interface to do this, with an initial implementation targeted at S3. However the interface is general enough that HDFS could also be used. Design doc: https://docs.google.com/document/d/1UTMlYA-1XmqIFeyyWPSzva0AmI7vNRsVCEkkde8WYEM/edit?usp=sharing Prototype/PoC: https://github.com/ivankelly/incubator-pulsar/tree/s3-offload-proto Cheers, Ivan