http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5780
------- Additional Comments From [EMAIL PROTECTED] 2008-01-17 20:00 ------- Since the original problem was parsing putative URIs from plain garbage, how about some sanity checks around the extraction to see if the stuff nearby even seems like words? If there aren't any spaces in the line you pulled a "uri" out of the middle of, then your guess at a URI is probably wrong, for instance. English tends to average around 5 letters per whitespace, German rather longer. Other checks might be a fuzzy match for "copy" or "paste" or "browser" somewhere near the putative URI. ------- You are receiving this mail because: ------- You are the assignee for the bug, or are watching the assignee.
