The GitHub Actions job "Java CI with Maven" on stormcrawler.git/main has 
succeeded.
Run started by GitHub user rzo1 (triggered by rzo1).

Head commit for run:
3d55b2bdd09578b005cf59e248bb54b426e60ce3 / Richard Zowalla 
<[email protected]>
feat(playwright): add JsRenderingDetector parse filter & bolt (#1898)

* feat(playwright): add JsRenderingDetector parse filter

Heuristically flags URLs whose content looks JavaScript-rendered by
inspecting SPA framework fingerprints, noscript blocks, empty hydration
roots, and a thin-content fallback. Sets a routing metadata key so
DelegatorProtocol can dispatch subsequent fetches to Playwright while
the bulk of the crawl stays on a cheap HTTP client.

* feat(playwright): add JsRenderingRedirectionBolt and free-form match list

Pairs JsRenderingDetector with a bolt that reads the routing flag and
emits to StatusStreamName for an immediate refetch through Playwright,
suppressing the cheap fetch's stub from the index. Adds a requiredMessages
parameter to the detector for free-form JS-required / loader / cookie
prompts that don't fit the noscript pattern.

* style(playwright): apply google-java-format to detector and bolt

Report URL: https://github.com/apache/stormcrawler/actions/runs/25670030597

With regards,
GitHub Actions via GitBox

Reply via email to