klevy-toast opened a new pull request #14125:
URL: https://github.com/apache/pulsar/pull/14125
<!--
### Contribution Checklist
- Name the pull request in the form "[Issue XYZ][component] Title of the
pull request", where *XYZ* should be replaced by the actual issue number.
Skip *Issue XYZ* if there is no associated github issue for this pull
request.
Skip *component* if you are unsure about which is the best component.
E.g. `[docs] Fix typo in produce method`.
- Fill out the template below to describe the changes contributed by the
pull request. That will give reviewers the context they need to do the review.
- Each pull request should address only one issue, not mix up code from
multiple issues.
- Each commit in the pull request has a meaningful commit message
- Once all items of the checklist are addressed, remove the above text and
this checklist, leaving only the filled out template below.
**(The sections below can be removed for hotfixes of typos)**
-->
Fixes #11100
### Motivation
We believe that the use of `scheduleAtFixedRate` in the java producer's
batch timer can result in unnecessarily high thread usage, which can become
especially problematic for applications that start many producers.
### Modifications
Replaced the use of `scheduleAtFixedRate` with `scheduleWithFixedDelay`,
which is the same behavior as previously in 2.6.x. The producer's parameter
`batchingMaxPublishDelay` implies the use of the "delay" method instead of
"rate" method as well.
### Verifying this change
- [ ] Make sure that the change passes the CI checks.
This change is already covered by existing tests, such as existing pulsar
client producer tests.
Testing of the performance regression can be demonstrated by using
[this](https://github.com/klevy-toast/dropwizard-pulsar-test) artifact and
comparing a recent release of pulsar client with a manually built SNAPSHOT
version with this change:
#### Version 2.7.1 CPU & thread behavior
- While sending messages
<img width="1632" alt="image"
src="https://user-images.githubusercontent.com/42187013/152588959-8ee4beb9-70f3-4ad8-9132-240d4498dda5.png">
- While running idle producers
<img width="1613" alt="image"
src="https://user-images.githubusercontent.com/42187013/152589079-b45fce49-757a-4bfd-8ddd-c438774ecf41.png">
- 30 second profile while sending messages
<img width="1295" alt="image"
src="https://user-images.githubusercontent.com/42187013/152589222-54732bf3-44d7-40b8-8c6b-03b54ba01090.png">
#### Version 2.10.0-SNAPSHOT CPU & thread behavior
- While sending messages
<img width="1615" alt="image"
src="https://user-images.githubusercontent.com/42187013/152589391-ae243e7a-5f1f-40b7-a77c-7e3d12a84c8e.png">
- While running idle producers
<img width="1603" alt="image"
src="https://user-images.githubusercontent.com/42187013/152589436-784d9c56-043e-41fa-95e8-6a721e0adc78.png">
- 30 second profile while sending messages
<img width="1289" alt="image"
src="https://user-images.githubusercontent.com/42187013/152589619-f274545d-b9f9-48e8-8b02-e226c6dec59e.png">
These samples show fewer threads running with this change compared to 2.7.1,
less time spend in `batchMessageAndSend`, and overall lower CPU usage -- note
that this testing was done on a desktop machine, and we have observed, along
with [other
users](https://github.com/apache/pulsar/issues/11100#issuecomment-1007487433)
that this CPU regression can be much worse with more producers, smaller batch
intervals, and on deployed cloud applications.
### Does this pull request potentially affect one of the following parts:
*If `yes` was chosen, please highlight the changes*
- Dependencies (does it add or upgrade a dependency): (no)
- The public API: (no)
- The schema: (no)
- The default values of configurations: (no)
- The wire protocol: (no)
- The rest endpoints: (no)
- The admin cli options: (no)
- Anything that affects deployment: (no)
### Documentation
Check the box below or label this PR directly (if you have committer
privilege).
Need to update docs?
- [x] `no-need-doc`
The general behavior of the batch timer feature should not be changing
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]