Hi all,

TL;DR

Does anyone know what's the responsibility of
DistributedUpdateProcessorFactory?
This is what I found in the source code:

// NOT mt-safe... create a new processor for each add thread
// TODO: we really should not wait for distrib after local? unless a
certain replication factor is
// asked for

https://github.com/apache/solr/blob/main/solr/core/src/java/org/apache/solr/update/processor/DistributedUpdateProcessor.java

---
Long version

Today I realized that in SolrCloud when disabling the schemaless mode, we
are going to disable also:

- LogUpdateProcessorFactory,
- DistributedUpdateProcessorFactory,
- RunUpdateProcessorFactory

Please look at this snipped taken from solrconfig.xml default configuration:

  <updateRequestProcessorChain name="add-unknown-fields-to-the-schema"
default="${update.autoCreateFields:false}"

 
processor="uuid,remove-blank,field-name-mutating,parse-boolean,parse-long,parse-double,parse-date,add-schema-fields">
    <processor class="solr.LogUpdateProcessorFactory"/>
    <processor class="solr.DistributedUpdateProcessorFactory"/>
    <processor class="solr.RunUpdateProcessorFactory"/>
  </updateRequestProcessorChain>

Those processors don't seem to be related to the schemaless mode, in
particular DistributedUpdateProcessorFactory.

So I'm curious to understand what it does, and why it has to be activated
when schemaless mode is on.

Best regards,
Vincenzo


-- 
Vincenzo D'Amore

Reply via email to