[jira] [Commented] (SOLR-15146) Distribute Collection API command execution

David Smiley (Jira) Mon, 01 Aug 2022 21:45:05 -0700


    [ 
https://issues.apache.org/jira/browse/SOLR-15146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17574033#comment-17574033
 ]


David Smiley commented on SOLR-15146:
-------------------------------------

I'm thinking of the ConfigSet processing in particular and it seems 
particularly pointless that it ever ran via the Overseer at all.  Was it for 
some sort of atomic locking so that no configSet processing happens 
concurrently perhaps?  There is lots of code involved in translating API 
endpoints to these Map/JSON messages that could disappear if we simply forget 
about serializing this to a queue.  Supporting both modes retains this... 
"debt" in my mind.

> Distribute Collection API command execution
> -------------------------------------------
>
>                 Key: SOLR-15146
>                 URL: https://issues.apache.org/jira/browse/SOLR-15146
>             Project: Solr
>          Issue Type: Sub-task
>          Components: SolrCloud
>    Affects Versions: 9.0
>            Reporter: Ilan Ginzburg
>            Assignee: Ilan Ginzburg
>            Priority: Major
>              Labels: collection-api, overseer
>          Time Spent: 40m
>  Remaining Estimate: 0h
>
> Building on the distributed cluster state update changes (SOLR-14928), this 
> ticket will distribute the Collection API so that commands can execute on any 
> node (i.e. the node handling the request through {{CollectionsHandler}}) 
> without having to go through a Zookeeper queue and the Overseer.
>  This is the second step (first was SOLR-14928) after which the Overseer 
> could be removed (but the code keeps existing execution options so completion 
> by no means Overseer is gone, but it could be removed in a future release).
> -There is a dependency on the distributed cluster state changes because the 
> Overseer locking protecting same collection (or same shard) Collection API 
> commands from executing concurrently will be replaced by optimistic locking 
> of the collection {{state.json}} znodes (or other znodes that will eventually 
> replace/augment {{state.json}}).-
> The goal of this ticket is threefold:
>  * Simplify the code (running synchronously and not going through the 
> Zookeeper queues and the Overseer dequeue logic is much simpler),
>  * Lead to improved performance for most/all use cases (although this is a 
> secondary goal, as long as performance is not degraded) and
>  * Allow a future change (in another future Jira) to the way cluster state is 
> cached on the nodes of the cluster (keep less information, be less dependent 
> on Zookeeper watches, do not care about collections not present on the node). 
> This future work will aim to significantly increase the scale (amount of 
> collections) supported by SolrCloud.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SOLR-15146) Distribute Collection API command execution

Reply via email to