We run 'antispamseeder.exe' daily on messages designated as spam to extract URLs for the 'url-domain-bl.txt' file. As part of the extraction process, 'antispamseeder.exe' generates non-compliant URL 'fragments' (for lack of a better word). A sampling of the fragments from our 'url-domain-bl.txt' file are (there are many, many more):
_fatacmt.yscdpe _ude.eope _hdiecyr.rwrxpe _carenth.'re _avorik.ofrure _abel.exclaiming.adulate _vince.compete _lfira.nenhte _kfii.oite' _nf.bmue _loope.ulgkewe _hsabsp.triiwe _vanrdo.isbuxe _eddvaio.nize _ivuheqb.byrze _what're.org _they're.org _com-click.comph _academia.%ng_url _theoretic.webm _leftrightaboutturn..com _didn't.com _couldn't.com _leadbullet..net _�x�w�Ыκ���.tw _�x�w�Цa��.tw NOTE: underscore prefixed to each fragment to prevent them from getting trapped by the antispam engine. Since 'antispamseeder.exe' extracted these non-compliant URL 'fragments', we kept them in the 'url-domain-bl.txt' file thinking that the antispam engine would still match on them. Does anybody know if that is true? Will they cause any problems if left in the file? Or, should they be deleted from the 'url-domain-bl.txt' file? How does everyone else handle these 'fragments'? Just wondering... Thanks in advance, Kim San Diego, CA To Unsubscribe: http://www.ipswitch.com/support/mailing-lists.html List Archive: http://www.mail-archive.com/imail_forum%40list.ipswitch.com/ Knowledge Base/FAQ: http://www.ipswitch.com/support/IMail/
