Stevan Bajić wrote: >> Is this list correct? Is there anything I missed? >> >> Content-Type: >> text/plain >> text/html (stripped of html) >> message/* >> unknown parts >> >> Content-Type-Encoding: >> 7bit >> 8bit >> quoted-printable >> base64 >> > Yes. You miss the point that any word longer then 50 characters will NOT be > tokenized. Most data from attachments fall into that category and will not be > tokenized.
Does DSPAM consider the removal of an HTML tag as a word break? Thanks, Ed -- Ed Szynaka Network/Systems Manager LocalNet Corp./CoreComm Internet Services ------------------------------------------------------------------------------ This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev _______________________________________________ Dspam-user mailing list Dspam-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspam-user