[
https://issues.apache.org/jira/browse/NIFI-15519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
David Handermann updated NIFI-15519:
------------------------------------
Fix Version/s: 2.8.0
Resolution: Fixed
Status: Resolved (was: Patch Available)
> Non-deterministic processor running state on Nifi startup
> ---------------------------------------------------------
>
> Key: NIFI-15519
> URL: https://issues.apache.org/jira/browse/NIFI-15519
> Project: Apache NiFi
> Issue Type: Bug
> Components: Core Framework
> Affects Versions: 2.6.0, 2.7.0, 2.7.1, 2.7.2
> Reporter: Philipp Freyer
> Assignee: Pierre Villard
> Priority: Major
> Fix For: 2.8.0
>
> Attachments: anonymized.nifi-app_2026-01-27_22.0-1.log
>
> Time Spent: 20m
> Remaining Estimate: 0h
>
> h2. Issue:
> All processors are either started or stopped after a Nifi instance restart
> when they should always be started.
> This is not deterministic behavior and the logs do not show any noticeable
> indication on why this is.
> h2. Background:
> We are running an Instance of Apache Nifi for years now with rather complex
> flows on it. These flows were working rock-solid for this time, even when
> restarting the Nifi instance and even through Nifi upgrades all the way from
> 1.X to the current Nifi version.
> But since the time we activated Github versioning (GitHubFlowRegistryClient
> 2.6.0), the processors on this instance are not starting reliably after Nifi
> instance startup. Only around 25% of the times the instances start up we see
> running processors on the instance. In about 75% of the time all processors
> on the instance are stopped.
> We use GitHub versioning extensively on the Nifi isntance with the root
> process group containing one (versioned) process group that then contains all
> flow logic. Since the versioned file became too big to restore with default
> Nifi timeout settings, we multiplied these timeouts by 10 (in
> nifi.properties) and also introduced versioned sub-process groups to split
> the files exported to GitHub.
> The log files do not show any log output indicating any issues related to
> processor startup (as far as we could see).
> What we do see is errors loading the versioned process group info for a
> while, when the registry client is not ready yet - these errors get resolved
> as soon as the registry client is initialized properly, though (and they
> occur every startup):
> {code:java}
> 2026-01-27 22:18:44,248 ERROR [Framework Task Thread-1]
> o.a.nifi.groups.StandardProcessGroup Failed to synchronize
> StandardProcessGroup[identifier=6134e32f-f850-3427-76ec-bad51ec7dd01,name=mycompany
> Group AG] with Flow Registry because could not retrieve version
> e9fd9e94f7dd2e0e26371ca2e3d26ef2bdfcd07e of flow with identifier
> MycompanyGroupAg in bucket default
> org.apache.nifi.registry.flow.FlowRegistryException:
> GitHubFlowRegistryClient[id=a0e83df6-019a-1000-47c4-c03b90820d7f] cannot
> currently be used to synchronize with Flow Registry because it is currently
> validating
> at
> org.apache.nifi.groups.StandardProcessGroup.synchronizeWithFlowRegistry(StandardProcessGroup.java:3752)
> at
> org.apache.nifi.groups.StandardProcessGroup.lambda$setVersionControlInformation$27(StandardProcessGroup.java:3591)
> at
> org.apache.nifi.controller.scheduling.StandardProcessScheduler.lambda$wrapTask$1(StandardProcessScheduler.java:189)
> at java.base/java.lang.VirtualThread.run(VirtualThread.java:329){code}
> A log showing the instance shutdown (for backup by copying the entire
> Nifi file system) and restart with stopped processors after startup is
> attached.
> We will gladly discuss details (or a testing session) in Slack, if that helps
> resolving this issue.
> We wil also gladly provide more information, if possible.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)