> http://en.wikipedia.org/wiki/Locality-sensitive_hashing
 
> corpus, now, the above may represent a solution, in such a case, the
> rebuild may just "skip" messages if they're repeated too often, so
> allowing to keep the corpus balanced
 
> also say that the email was recognized as "spam" (or ham, whatever);
> now, a second incoming mail hitting the same "LSH bucket" as the first

Also, there's is an opensource library implementing LSH

http://lshkit.sourceforge.net/

the library is written in C++ and can be built for windows, linux and
BSD; this means that, implementing LSH in ASSP may just mean putting
together some Perl "glue" code leveraging the LSH library calls :) !

------------------------------------------------------------------------------
Try before you buy = See our experts in action!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-dev2
_______________________________________________
Assp-test mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/assp-test

Reply via email to