Gabriella Lotz has posted comments on this change. ( http://gerrit.cloudera.org:8080/23548 )
Change subject: Enable REST API endpoints in webserver crawl test ...................................................................... Patch Set 3: (2 comments) http://gerrit.cloudera.org:8080/#/c/23548/2//COMMIT_MSG Commit Message: http://gerrit.cloudera.org:8080/#/c/23548/2//COMMIT_MSG@12 PS2, Line 12: Also added exclusion logic for large JavaScript bundles (like : swagger-ui-bundle.js) to prevent test failures. The HTML parser : (gumbo-query) incorrectly treats minified JavaScript as parseable : HTML conten > How much bloat is this? About 1.4MiByte? AFAIK, we do have 'heavy' pages a I need to correct my initial description, this isn't about bloat, but rather a bug fix. The test fails without the exclusion because the HTML parser incorrectly extracts JavaScript code fragments as malformed URLs, causing curl errors. The file is ~1.5 MiB, and runtime impact is minimal (~1-2s). The JS file is still validated via HTTP fetch, we simply skip parsing it as HTML. I've updated both the code comment and commit message to reflect the actual issue. http://gerrit.cloudera.org:8080/#/c/23548/2/src/kudu/integration-tests/webserver-crawl-itest.cc File src/kudu/integration-tests/webserver-crawl-itest.cc: http://gerrit.cloudera.org:8080/#/c/23548/2/src/kudu/integration-tests/webserver-crawl-itest.cc@224 PS2, Line 224: causing test failures when curl > This seems to be dependent on https://gerrit.cloudera.org/c/23224 That commit has been merged since, shifting focus to this patch again. -- To view, visit http://gerrit.cloudera.org:8080/23548 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: kudu Gerrit-Branch: master Gerrit-MessageType: comment Gerrit-Change-Id: I5be128536d4f104e2d3bfa55c02b81e6f114d5fc Gerrit-Change-Number: 23548 Gerrit-PatchSet: 3 Gerrit-Owner: Gabriella Lotz <[email protected]> Gerrit-Reviewer: Alexey Serbin <[email protected]> Gerrit-Reviewer: Ashwani Raina <[email protected]> Gerrit-Reviewer: Gabriella Lotz <[email protected]> Gerrit-Reviewer: Kudu Jenkins (120) Gerrit-Reviewer: Marton Greber <[email protected]> Gerrit-Comment-Date: Thu, 06 Nov 2025 13:28:32 +0000 Gerrit-HasComments: Yes
