janhoy opened a new pull request, #3940: URL: https://github.com/apache/solr/pull/3940
This test has failed 8% of the time since start of December, see https://develocity.apache.org/scans/tests?search.rootProjectNames=solr-root&search.timeZoneId=Europe%2FOslo&tests.container=org.apache.solr.search.TestQueryLimits&tests.sortField=FLAKY&tests.test=testAdjustShardRequestLimits <img width="1257" height="373" alt="Skjermbilde 2025-12-10 kl 16 24 18" src="https://github.com/user-attachments/assets/ee701285-f5ef-42b1-85c4-90d1e6f496ce" /> My AI buddy believes it fails due to a too low timeAllowed in the initial request: ## Root Cause The test was failing due to a timing race condition in distributed grouped queries: 1. Test Setup: Uses timeAllowed=500ms with sleep=100ms per request 2. Grouped Query Phases: Distributed grouped queries require 2 phases 3. Actual Timing (from failed run logs): - Phase 1: ~100ms (matching sleep) - Phase 2: ~436ms (sleep + query processing, especially term iteration) - Total: ~536ms > 500ms timeout 4. The Issue: The 500ms budget was barely sufficient: - 200ms consumed by sleep operations - Only 300ms margin for query processing, network overhead, and system variability - Phase 2 query processing (term iteration in SolrRangeQuery) is variable and can exceed this tight margin -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
