Its there for years now, but was rarly used.

Thomas





Von:    K Post <nntp.p...@gmail.com>
An:     ASSP development mailing list <assp-test@lists.sourceforge.net>
Datum:  02.09.2017 02:00
Betreff:        Re: [Assp-test] fixes in assp 2.5.6 *Fortess* build 17244



What's the purpose of the codename variable?

On Fri, Sep 1, 2017 at 12:19 PM, Thomas Eckardt <
thomas.ecka...@thockar.com> wrote:
Hi all, 

fixed in assp 2.5.6 *Fortress* build 17244: 

- a domain name in an URL that starts with number and dash like "2017-", 
was wrong detected as strong obfuscated IP address 

changed: 

- The over a year existing hidden configuration parameter 'HTMLParser' is 
now changed to a regular configuration parameter. 

**** ATTENTION **** 
 If you still use this hidden parameter, remove any related code from the 
startup script or CorrectASSPcfg.pm BEFORE you upgrade to this version! 
 Configure the parameter after the first start in the GUI! 
******************* 

 'HTMLParser','Use this HTML 
Parser','0:buildin|1:HTML::Strip|2:HTML::TreeBuilder', 
  
  Commonly HTML/XML is used in emails. The HTML/XML tags are too variable 
to use them for Bayesian- and Hidden Markov Model analysis. 
  For this reason, these tags are removed from the HTML/XML content to get 
the clean text of the email. 
  The assp buildin regular expression HTML-parser is now used for decades. 
It got large improvements over the time, how ever - the correctness is 
only 95%. 
  But assp is able to use HTML::Strip or HTML::TreeBuilder, which are 
powerfull perl modules to parse HTML code nearly 100% correct. 
  HTML::Strip and HTML::TreeBuilder are getting there best result, if the 
full HTML code is provided. In case you select any of the both modules, 
  it is recommended to set MaxBytes to 50000 (be carefull on heavy load 
systems - spam bomb regular expressions will take longer using 50000!). 
  HTML::Strip is the fastest module and the default setting, because it is 
written in C. If you can not install it, use the buildin or 
HTML::TreeBuilder. 
  HTML::TreeBuilder is the slowest way to parse HTML code, the assp 
buildin processing is three times faster, HTML::Strip is five times faster 
than HTML::TreeBuilder. 
  If you select any of the perl modules and this module is not installed, 
fails to load or it returns no content, assp falls back to the buildin 
code. 
  
 **** Switching from the buildin HTML parser to HTML::Strip, a 10% faster 
rebuildspamdb task is expected (for MaxBytes = 50000). 
 **** HTML::Strip improves the word processing for Bayesian and HMM, 
because of a much better language detection in the word stemming engine. 
 **** SpamBombs will work more accurate, if HTML::Strip is used 

  To provide any of the perl modules HTML::Strip and HTML::TreeBuilder you 
need to install them using PPM or CPAN. 
  The mod_inst.pl and mod_inst_ocr.pl got an update to version 2.03 to 
install both modules. 
  
  ASSP_AFC.pm version 4.60 is available. It got an improvement by an 40 
seconds timeout watchdog ($ASSP_AFC::maxProcessTime), to prevent stucking 
workers. 
  
Thomas


DISCLAIMER:
*******************************************************
This email and any files transmitted with it may be confidential, legally 
privileged and protected in law and are intended solely for the use of the 

individual to whom it is addressed.
This email was multiple times scanned for viruses. There should be no 
known virus in this email!
*******************************************************


------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Assp-test mailing list
Assp-test@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/assp-test

------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Assp-test mailing list
Assp-test@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/assp-test




DISCLAIMER:
*******************************************************
This email and any files transmitted with it may be confidential, legally 
privileged and protected in law and are intended solely for the use of the 

individual to whom it is addressed.
This email was multiple times scanned for viruses. There should be no 
known virus in this email!
*******************************************************

------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Assp-test mailing list
Assp-test@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/assp-test

Reply via email to