PSA: potential new source of intermittent test failures - and how to work around it

Jonathan Kew Tue, 05 Jan 2021 11:32:49 -0800

Do you write Gecko/Firefox patches or testcases, or monitor the Mozillatrees?

If so, you may run into new intermittent test failures due to a recent(intentional) behavior change.

On 2020-12-31, a patch landed inhttps://bugzilla.mozilla.org/show_bug.cgi?id=1676966 that changed howfont fallback is handled. Previously, if the fonts specified in a page'sCSS or through the browser prefs did not support a character present inthe text, Gecko would potentially search all installed fonts to try andfind one that could render the character. The first time this happens,it can be quite expensive, as it involves loading data from everyinstalled font file, of which there may be thousands. Result: unpleasantjank.

To avoid this performance issue, we no longer block layout on theexhaustive search of all the fonts; instead, we start a background taskto load the required character mappings from all the fonts, but proceedwith layout using whatever fallbacks we may find, or just missing-glyphboxes. Once the font data is all loaded, we trigger a reflow everywhereso that content will be refreshed using the proper fonts.

Why does this matter for tests? It may result in two main types offailure in tests that are otherwise fine:

(1) If the test includes content -- such as text in a lesser-usedUnicode script or unusual symbols -- that depends on font fallback, itmay render with a different fallback font or not render at all duringthe initial pageload/reflow, if all the necessary font data has not yetbeen loaded. The rendering will be automatically corrected once theasync font loading completes, but if the reftest harness has alreadytaken a snapshot by that time, it may be too late, and the test fails.

(2) If async font data loading was triggered by the testcase, or by oneshortly preceding it, an "unexpected" extra reflow event will happenwhen the loading completes. This can interfere with tests that arespecifically concerned with event handling and expect aprecisely-defined pattern of behavior, or are watching things like framedimensions for changes.

Because the font fallback behavior is asynchronous (and the actual workhappens in the parent process, while your testcase is usually runningindependently in a content process), the timing of all this cannot beaccurately predicted, and failures may be intermittent.

(Note also that this async behavior only happens once per browsersession, the first time content triggers a global font search. Thismeans that which testcases are affected may depend on the chunking oftest suites, and could change over time.)

If you have tests that are impacted by this, you can disable the asyncbehavior -- reverting to the previous behavior where global fontfallback, if needed, will block layout -- for them by setting the pref'gfx.font_rendering.fallback.async' to false via a test manifestannotation or similar metadata.

We could simply run all tests with the pref set to false, to avoid theseissues, but I'd prefer not to do that as we then wouldn't be testing theconfiguration we ship to users. So let's try to handle this byselectively disabling the new behavior only in cases where we see itcausing actual problems. Thanks!


JK

_______________________________________________
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform

PSA: potential new source of intermittent test failures - and how to work around it

Reply via email to