Branch: refs/heads/main
  Home:   https://github.com/WebKit/WebKit
  Commit: 9883ba9167830378e0913fd31dfcd48a2480307d
      
https://github.com/WebKit/WebKit/commit/9883ba9167830378e0913fd31dfcd48a2480307d
  Author: Wenson Hsieh <[email protected]>
  Date:   2026-02-13 (Fri, 13 Feb 2026)

  Changed paths:
    A 
LayoutTests/fast/text-extraction/debug-text-extraction-sanitization-expected.txt
    A LayoutTests/fast/text-extraction/debug-text-extraction-sanitization.html
    M Source/WebKit/Shared/TextExtractionToStringConversion.cpp

  Log Message:
  -----------
  [AutoFill Debugging] Add more filtering rules for debug text extraction
https://bugs.webkit.org/show_bug.cgi?id=307741
rdar://170228978

Reviewed by Abrar Rahman Protyasha.

Add a few more post-processing steps to further streamline debug text 
extraction:
- Remove zero-width joiners and other non-rendered unicode characters.
- Remove identical and repetitive lines.
- When extracting markdown, remove empty list items.

Test: fast/text-extraction/debug-text-extraction-sanitization.html

* 
LayoutTests/fast/text-extraction/debug-text-extraction-sanitization-expected.txt:
 Added.
* LayoutTests/fast/text-extraction/debug-text-extraction-sanitization.html: 
Added.
* Source/WebKit/Shared/TextExtractionToStringConversion.cpp:
(WebKit::removeZeroWidthCharacters):
(WebKit::isEmptyMarkdownListItem):
(WebKit::TextExtractionAggregator::takeResults):
(WebKit::addJSONTextContent):

Canonical link: https://commits.webkit.org/307470@main



To unsubscribe from these emails, change your notification settings at 
https://github.com/WebKit/WebKit/settings/notifications

Reply via email to