Re: [PATCH 5 of 6] Upstream: allow any worker to resolve upstream servers

Aleksei Bavshin Thu, 09 Feb 2023 08:45:21 -0800

On 2/5/2023 7:01 PM, J Carter wrote:

Hi Aleksei,
Why not permanently assign the task of resolving a given upstream servergroup (all servers/peers within it) to a single worker?
It seems that this approach would resolve the SRV issues, and remove theneed for the shared queue of tasks.
The load would still be spread evenly for the most realistic scenarios -which is where there are many upstream server groups of few servers, asopposed to few upstream server groups of many servers.

The intent of the change was exactly opposite, to avoid any permanentassignment of periodic tasks to a worker and allow another processes toresume resolving if the original assignee exits, no matter if normallyor abnormally. I'm not even doing enough for that -- I should've keptin-progress tasks at the end of the queue with expires = resolvertimeout + a small constant, and retry from another process when thetimeout is reached, but the idea was abandoned for a minusculeimprovement of insertion time. I expect to be asked to reconsider, aspatch 6/6 does not cover all the possible situations where we want torecover a stale task.

A permanent assignment of a whole upstream would also require notifyinganother processes that the upstream is no longer assigned if the workerexits or consistently recovering that assignment over a restart ofsingle worker (e.g. after a crash - not a regular situation, but one weshould take into account nonetheless). And the benefit is not quiteobvious - I mentioned that resolving SRVs with a lot of records may takelonger to update the list of peers, but the situation with contention isnot expected to change significantly* if we pin these tasks to a singleworker as another worker may be doing the same for another upstream.Most importantly, this isn't even a bottleneck. It only slightlyexacerbates an existing problem with certain balancers that alreadysuffer from the overuse of locks, in a configuration that wasspecifically crafted to amplify and highlight the difference and is farfrom these most realistic scenarios.


* Pending verification on a performance test stand.
_______________________________________________
nginx-devel mailing list
nginx-devel@nginx.org
https://mailman.nginx.org/mailman/listinfo/nginx-devel

Re: [PATCH 5 of 6] Upstream: allow any worker to resolve upstream servers

Reply via email to