Re: Intra-cluster communication; too many ways to do it

David Smiley Mon, 18 May 2026 13:58:31 -0700

Protocol/transport agnosticism seems very conceptual and dubious
practicality IMO.


RE async: See HttpSolrClient.requestAsync.  Since Solr 9.x, the API
for requestAsync looks fairly standard, returning a CompletableFuture.  As
nice as it is, I have observed it is *barely* even used as measured by # of
call-sites compared to, say, request(), even though there are places that
would clearly benefit but instead go create ExecutorService just to fan-out
something (e.g. DistribFileStore somewhere).  So it's a bit unmotivating
IMO to bring it elsewhere but it's a chicken-and-egg situation, I guess.
 If you want to attempt to elevate that to SolrClient, it's premature: next
step is adding to CloudSolrClient.  Please be my guest :-)

On Mon, May 18, 2026 at 3:56 PM Gus Heck <[email protected]> wrote:

> hmm, missed this originally.
>
> Is it possible to go further and standardize on sending the request to a
> single, transport agnostic API? Basically, never give out ia SolrClient of
> any type or HttpClient. Code should just say, "Here's a request object,
> make it so," and then some code that "does the request thing" should return
> a cancellable future that the calling code can wait for, cancel or return
> to later as needed. All of those should be easy, I'm very strongly against
> anything that looks like an attempt to force everything async! Just make
> either one easy for the calling code. This would then make it possible to
> consistently use the same protocol, or write an entirely different
> transport (I've toyed with the idea of something based on web sockets for
> example).
>
> We don't need to make it pluggable immediately, but if everything talks to
> the same interface to issue a request, the class handling such calls can be
> made pluggable later.
>
>
>
> On Mon, May 18, 2026 at 9:41 AM David Smiley <[email protected]> wrote:
>
> > Following up: This epic has mostly completed.CoreContainer & ZkController
> > have a default HttpSolrClient & CloudSolrClient respectively.
> > HttpSolrClient can be used to make request to specific URLs now.
> > SolrCloudManager is only used from "cloud" classes, except for one little
> > ugly exception in IndexSchemaFactory.  SolrCloudManager is used less...
> > good or bad is a matter of opinion but the creator of it is no longer
> very
> > active.  SolrClientCache is now only used when solrj-streaming is used.
> > I'd like to see the getter gone from CoreContainer; it's deprecated at
> > least.  HttpShardHandler: no change but it's a fairly separate matter.
> >
> > On Thu, Sep 5, 2024 at 3:09 PM David Smiley <[email protected]> wrote:
> >
> > > There are too many ways to submit an intra-cluster request within Solr.
> > > It’s been gnawing at me so I did an audit and made recommendations:
> > >
> > > There are too many ways to submit an intra-cluster request within Solr.
> > > It’s been gnawing at me so I did an audit and made recommendations:
> > >
> > > A. Creating new Http SolrClient (with or without Cloud) from
> > > org.apache.solr.update.UpdateShardHandler#getDefaultHttpClient — very
> > > popular, 14 usages.
> > > Recommendation: This is a poor place for such a client; will be moved
> > > shortly in SOLR-16503 to CoreContainer and return an Http2SolrClient
> > > instead of HttpClient.
> > >
> > > B. org.apache.solr.client.solrj.cloud.SolrCloudManager#request
> (default
> > > impl is essentially a CloudLegacySolrClient).  3 usages.
> > > I’m suspicious we need this; either embrace or abandon IMO.
> > > Recommendation: embrace but migrate to getCloudSolrClient and add
> > > getHttpSolrClient too, the latter sourced from CoreContainer.  Then
> > allows
> > > requestAsync usage and hopefully will also have a URL-specific method
> for
> > > SOLR-17256
> > > right next to it, there’s an httpRequest method that is now unused.
> > > Recommendation: remove
> > >
> > > C. org.apache.solr.update.UpdateShardHandler#getUpdateOnlyHttpClient
> for
> > > “updates” only.  2 usages
> > >
> > > D. org.apache.solr.update.UpdateShardHandler#getRecoveryOnlyHttpClient
> > for
> > > “recovery” only.  4 usages.
> > >
> > > E. org.apache.solr.client.solrj.io.SolrClientCache wow, 86 usages!!  68
> > > are in solrj-streaming since this is where this thing originated from,
> > thus
> > > leaving 18 usages elsewhere but I’m over-counting.  The unique classes
> > > using (excluding streaming, excluding a benchmark that tests streaming
> > too)
> > > are 6 usages (classes).
> > >
> > > * Some usages are generic to a configurable SolrCloud cluster
> > > (solrj-streaming definitely):
> > >  ReindexCollection and SolrReporter (metrics).
> > > This is the primary purpose/value of SolrClientCache; should continue.
> > > * Some of the other cases just need a cluster-local CloudSolrClient:
> > > SchemaDesigner (2 classes), CollectionsRepairEventListener.
> > > Recommendation: Use SolrCloudManager.
> > > * The other 4 usages are to get an Http SolrClient for a URL.
> > > Recommendation: Use CoreContainer.getDefaultHttpSolrClient moving from
> > > UpdateShardHandler (or similar recommended in SolrCloudManager).
> > Requires
> > > solution to SOLR-17256 like an Http2SolrClient specific request method
> > > specifying an additional URL parameter.
> > >
> > > F. org.apache.solr.handler.component.HttpShardHandler  / ShardHandler.
> > > Originally built for distributed-search, which requires supporting
> > multiple
> > > replicas that are equivalent to serve a request for a shard.  Later a
> > > number of Overseer/cluster commands started using/abusing it.  There
> are
> > > some API tech-debt / issues I see here as a result of this.  Now that
> > > Http2SolrClient has async support (it didn’t when HttpShardHandler was
> > > first created), I wonder if some Overseer usages should use that
> instead.
> > > I think it could be simpler.
> > > Recommendation: Add javadocs warning users to avoid using it outside of
> > > distributed search.  Point out Http2SolrClient.requestAsync.
> > >
> > > As an aside, I want to mention that EmbeddedSolrServer is pretty cool;
> I
> > > could imagine expanding its use within Solr in some cases to avoid an
> > HTTP
> > > layer and disconnected caller chain.  Like to issue an admin call,
> even a
> > > SolrCloud one (e.g. to move a shard.  But it doesn't do collection
> > > routing.  It could be neat if the node-local CloudSolrClient proposed
> > above
> > > could somehow detect such an admin call and route through locally to
> the
> > > node.
> > >
> > > ~ David Smiley
> > > Apache Lucene/Solr Search Developer
> > > http://www.linkedin.com/in/davidwsmiley
> > >
> >
>
>
> --
> http://www.needhamsoftware.com (work)
> https://a.co/d/b2sZLD9 (my fantasy fiction book)
>

Re: Intra-cluster communication; too many ways to do it

Reply via email to