nodece opened a new issue #14529:
URL: https://github.com/apache/pulsar/issues/14529


   <!---
   Instructions for creating a PIP using this issue template:
   
    1. The author(s) of the proposal will create a GitHub issue ticket using 
this template.
       (Optionally, it can be helpful to send a note discussing the proposal to
       [email protected] mailing list before submitting this GitHub issue. 
This discussion can
       help developers gauge interest in the proposed changes before 
formalizing the proposal.)
    2. The author(s) will send a note to the [email protected] mailing list
       to start the discussion, using subject prefix `[PIP] xxx`. To determine 
the appropriate PIP
       number `xxx`, inspect the mailing list 
(https://lists.apache.org/[email protected])
       for the most recent PIP. Add 1 to that PIP's number to get your PIP's 
number.
    3. Based on the discussion and feedback, some changes might be applied by
       the author(s) to the text of the proposal.
    4. Once some consensus is reached, there will be a vote to formally approve
       the proposal. The vote will be held on the [email protected] 
mailing list. Everyone
       is welcome to vote on the proposal, though it will considered to be 
binding
       only the vote of PMC members. It will be required to have a lazy 
majority of
       at least 3 binding +1s votes. The vote should stay open for at least 48 
hours.
    5. When the vote is closed, if the outcome is positive, the state of the
       proposal is updated and the Pull Requests associated with this proposal 
can
       start to get merged into the master branch.
   
   -->
   
   ## Motivation
   
   The cursor data is managed by ZooKeeper/etcd metadata store. When cursor 
data becomes more and more, the data size will increase and will take a lot of 
time to pull the data. Therefore, it is necessary to add compression for the 
cursor, which can reduce the size of data and reduce the time of pulling data.
   
   ## Goal
   
   Support use the LZ4/ZLIB/ZSTD/SNAPPY to compress the ManagedCursorInfo.
   
   ## Implementation
   
   - Cursor compression format
   ```
   [MAGIC_NUMBER] + [METADATA_SIZE] + [METADATA_PAYLOAD] + 
[MANAGED_CURSOR_INFO_PAYLOAD]
   ```
   
   - MAGIC_NUMBER
   Ox4779
   
   - METADATA
   Add a named `ManagedCursorInfoMetadata` message to `MLDataFormats.proto`:
   
   ```
   message ManagedCursorInfoMetadata {
       required CompressionType compressionType = 1;
       required int32 uncompressedSize = 2;
   }
   ```
   
   Currently, these compressions have been supported, we only need to deal with 
compression and decompression of the `ManagedCursorInfo` data:
   
   - Get CursorInfo from the metadata store
    We will check the cursor data header, if it is compressed, we will parse 
the bytes data by compressed format, otherwise by the original way.
   
    - Add/Update CursorInfo to the metadata store
   The default is to use compression if the compression type is specified.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to