https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7579
--- Comment #5 from Giovanni Bechis <[email protected]> --- (In reply to John Hardin from comment #1) > That a PDF has a URI (clickable or not) doesn't seem a terribly useful datum > in isolation. I'd suggest it would be _much_ more useful to extract the URIs > and add them to the pool that feeds uri rules and URIBL checks. > > Even better if heuristics similar to what's used for body text would pull > non-clickable URIs out of the PDF text, but doing that might best be > controlled by a config option. Looking at my spam collection, a pdf named Invoice.pdf with a clickable uri is very probably spam. Anyway I am looking at extracting URIs from attachments and adding them to the pool of uris to be checked. -- You are receiving this mail because: You are the assignee for the bug.
