https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7579

--- Comment #5 from Giovanni Bechis <[email protected]> ---
(In reply to John Hardin from comment #1)
> That a PDF has a URI (clickable or not) doesn't seem a terribly useful datum
> in isolation. I'd suggest it would be _much_ more useful to extract the URIs
> and add them to the pool that feeds uri rules and URIBL checks.
> 
> Even better if heuristics similar to what's used for body text would pull
> non-clickable URIs out of the PDF text, but doing that might best be
> controlled by a config option.

Looking at my spam collection, a pdf named Invoice.pdf with a clickable uri is
very probably spam.
Anyway I am looking at extracting URIs from attachments and adding them to the
pool of uris to be checked.

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to