poorbarcode commented on code in PR #21118:
URL: https://github.com/apache/pulsar/pull/21118#discussion_r1329630099


##########
pip/pip-299.md:
##########
@@ -0,0 +1,83 @@
+# Background knowledge
+
+The config `managedLedgerMaxUnackedRangesToPersist`
+
+Indicates the number of `acknowledgment holes` that are going to be 
persistently stored. When acknowledging out of order, a consumer will leave 
holes that are supposed to be quickly filled by acking all the messages. The 
information of which messages are acknowledged is persisted by compressing in 
`ranges` of messages that were acknowledged. After the maximum number of ranges 
is reached, the information will only be tracked in memory, and messages will 
be redelivered in case of crashes.
+
+The cursor metadata contains the following three data:
+- Subscription properties(Usually, this is small); this data part only 
persists to the ZK node.
+- The last sequence ID of each producer. It only exists for the cursor 
`pulsar.dedup`. If a topic has many, many producers, this part of the data will 
be large. See 
[PIP-6:-Guaranteed-Message-Deduplication](https://github.com/apache/pulsar/wiki/PIP-6:-Guaranteed-Message-Deduplication)
 for more details.
+- Individual Deleted Messages(including the acknowledgment of batched 
messages). This part of the data occupies most of the cursor metadata's space, 
which is the focus of this proposal.
+
+Differ with Kafka: Pulsar supports [individual 
acknowledgment](https://pulsar.apache.org/docs/2.11.x/concepts-messaging/#acknowledgment)
 (just like ack `{pos-1, pos-3, pos-5}`), so instead of a pointer(acknowledged 
on the left and un-acknowledged on the right), Pulsar needs to persist the 
acknowledgment state of each message, we call these records `Individual Deleted 
Messages.`
+
+The current persistence mechanism of the cursor metadata(including `Individual 
Deleted Messages`) works like this:
+1. Write the data of cursor metadata(including `Individual Deleted Messages`) 
to BK in one Entry; by default, the maximum size of the Entry is 5MB.
+2. Write the data of cursor metadata(optional to include `Individual Deleted 
Messages`) to the Metadata Store(such as ZK) if BK-Write fails; data of a 
Metadata Store Node that is less than 10MB is recommended. Since writing large 
chunks of data to the Metadata Store frequently makes the Metadata Store work 
unstable, this is only a backstop measure.
+
+Is 5MB enough? `Individual Deleted Messages` consists of Position_Rang(each 
Position_Rang occupies 32 bytes; the implementation will not be explained in 
this proposal). This means that the Broker can persist `5m / 32bytes` number of 
Position_Rang for each Subscription, and there is an additional compression 
mechanism at work, so it is sufficient for almost all scenarios except the 
following three scenarios:
+- Client Miss Acknowledges: Clients receive many messages, and ack some of 
them, the rest still need to be acknowledged due to errors or other reasons. As 
time goes on, more and more records will be staying there.
+- Delay Messages: Long-delayed and short-delayed messages are mixed, with only 
the short-delayed message successfully consumed and the long-delayed message 
not delivered. As time goes on, more and more records will be staying there.
+- Large Number of Consumers: If the number of consumers is large and each has 
some discrete ack records, all add up to a large number.
+- Large Number of Producers: If the number of producers is large, there might 
be a large data of Last Sequence ID to persist. This scenario only exists on 
the `pulsar.dedup` cursor.
+
+The config `managedLedgerMaxUnackedRangesToPersist`
+If the cursor metadata is too large to persist, the Broker will persist only 
part of the data according to the following priorities.
+- Subscription Properties. This part can't be split up; persist will fail if 
this part is too large to persist.
+- Last sequence ID of producers. This part can't be split up; persist will 
fail if this part is too large to persist.
+- Individual Deleted Message. If it is too large, only one part persists, and 
then the other part is maintained only in memory
+
+# Motivation
+
+Since the frequent persistence of `Individual Deleted Messages` will magnify 
the amount of BK Written and increase the latency of ack-response, the Broker 
does not immediately persist it when receiving a consumer's acknowledgment but 
persists it regularly. 
+
+The data of cursor metadata is recommended to be less than 5MB; if a 
subscription's `Individual Deleted Messages` data is too large to persist, as 
the program grows for a long time, there will be more and more non-persistent 
data. Eventually, there will be an unacceptable amount of repeated consumption 
of messages when the Broker restarts.
+
+# Goal
+
+## In Scope
+
+To avoid repeated consumption due to the cursor metadata being too large to 
persist.
+
+## Out of Scope
+
+This proposal will not care about this scenario: if so many producers make the 
metadata of cursor `pulsar.dedup` cannot persist, the task `Take Deduplication 
Snapshot` will be in vain due to the inability to persist.
+
+# High-Level Design
+
+Provide a new config named `dispatcherPauseOnAckPersistentStateEnabled` for a 
new feature: stop dispatch messages to clients when reaching the limitation 
`managedLedgerMaxUnackedRangesToPersist`.
+- If the user does not care about that Individual Deleted Messages can not be 
fully persistent, resulting in a large number of repeated message consumption, 
then it can be set to `false`.
+- If the user cares about repeated consumption, at can accept a decline in 
consumption speed when cursor metadata is too large to persist, it can be set 
to `true`.
+
+
+# Detailed Design
+### Public API
+
+**broker.conf**
+```
+/**
+ * If true, the dispatcher will pause the delivery of messages to clients if 
the cursor metadata(including ack state) reaches 
${managedLedgerMaxAckStateInBytesToPersist}.
+ * The default value is "true".
+ * Note: If you set it to "false," when cursor metadata reaches 
${managedLedgerMaxAckStateInBytesToPersist}, it can not persist, which might 
lead to repeated consumption.

Review Comment:
   Already corrected the description. Thanks



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to