Good catch. I found dspam script under /etc/cron.daily which was pointing to the old place. I fixed that. That should take care of it.
Thanks a lot. On Thu, Jan 28, 2010 at 7:14 PM, Stevan Bajić <[email protected]> wrote: > On Thu, 28 Jan 2010 19:05:13 -0500 > Roman Gelfand <[email protected]> wrote: > >> I don't think I have sent you the following message I found in syslog. >> Actually, this message appears in spurts of may be 50 lines. I am >> not sure why dspam is looking for hash tables. >> >> mail dspam[2272]: hash table >> /usr/local/var/dspam/data/[email protected]/[email protected] full >> > Could it be that you use the cron job from contrib? If so, then you should > use the one currently available in GIT HEAD. > >> On Thu, Jan 28, 2010 at 6:55 PM, Stevan Bajić <[email protected]> wrote: >> > On Thu, 28 Jan 2010 18:26:15 -0500 >> > Roman Gelfand <[email protected]> wrote: >> > >> >> I haven't really dealt with the utilities much. When you say drop the >> >> old data, do you mean physically go into the db delete data on all >> >> dspam tables or use a utility? If use a utility, which one? >> >> >> > TRUNCATE `dspam_signature_data`; >> > TRUNCATE `dspam_stats`; >> > TRUNCATE `dspam_token_data`; >> > >> > >> >> >> >> On Thu, Jan 28, 2010 at 6:14 PM, Stevan Bajić <[email protected]> wrote: >> >> > On Thu, 28 Jan 2010 11:55:48 -0500 >> >> > Roman Gelfand <[email protected]> wrote: >> >> > >> >> >> # >> >> >> # Training Mode: The default training mode to use for all operations, >> >> >> when >> >> >> # one has not been specified on the commandline or in the user's >> >> >> preferences. >> >> >> # Acceptable values are: >> >> >> # toe Train on Error (Only) >> >> >> # teft Train Everything (Trains on every message) >> >> >> # tum Train Until Mature (Train only tokens without enough >> >> >> data) >> >> >> # notrain Do not train or store signatures (large ISP systems, >> >> >> post-train) >> >> >> # >> >> >> TrainingMode teft >> >> >> >> >> > Please switch that to "toe"! Using "teft" is old school and one part of >> >> > your problem. >> >> > >> >> > >> >> >> # >> >> >> # Features: Specify features to activate by default; can also be >> >> >> specified >> >> >> # on the commandline. See the documentation for a list of available >> >> >> features. >> >> >> # If _any_ features are specified on the commandline, these are >> >> >> ignored. >> >> >> # >> >> >> #Feature noise >> >> >> Feature whitelist >> >> >> >> >> > Enable "noise". It's a good thing that will help you. >> >> > >> >> > >> >> >> # Training Buffer: The training buffer waters down statistics during >> >> >> training. >> >> >> # It is designed to prevent false positives, but can also dramatically >> >> >> reduce >> >> >> # dspam's catch rate during initial training. This can be a number >> >> >> from 0 >> >> >> # (no buffering) to 10 (maximum buffering). If you are paranoid about >> >> >> false >> >> >> # positives, you should probably enable this option. >> >> >> # >> >> >> #Feature tb=5 >> >> >> >> >> > Depending on the data you already have learned, it could be beneficial >> >> > to enable this option. >> >> > >> >> > >> >> >> # >> >> >> # Tokenizer: Specify the tokenizer to use. The tokenizer is the piece >> >> >> # responsible for parsing the message into individual tokens. >> >> >> Depending on >> >> >> # how many resources you are willing to trade off vs. accuracy, you may >> >> >> # choose to use a less or more detailed tokenizer: >> >> >> # word uniGram (single word) tokenizer >> >> >> # Tokenizes message into single individual words/tokens >> >> >> # example: "free" and "viagra" >> >> >> # chain biGram (chained tokens) tokenizer (default) >> >> >> # Single words + chains adjacent tokens together >> >> >> # example: "free" and "viagra" and "free viagra" >> >> >> # sbph Sparse Binary Polynomial Hashing tokenizer >> >> >> # Creates sparse token patterns across sliding window of >> >> >> 5-tokens >> >> >> # example: "the quick * fox jumped" and "the * * fox jumped" >> >> >> # osb Orthogonal Sparse biGram tokenizer >> >> >> # Similar to SBPH, but only uses the biGrams >> >> >> # example: "the * * fox" and "the * * * jumped" >> >> >> # >> >> >> Tokenizer chain >> >> >> >> >> > That is the main part of your problem. It is no surprise that you >> >> > retrain and retrain and retrain and still don't get the data to flip >> >> > the state. Please use "osb". It's way better for your situation. >> >> > >> >> > >> >> >> # >> >> >> # Preferences: Specify any preferences to set by default, unless >> >> >> otherwise >> >> >> # overridden by the user (see next section) or a default.prefs file. >> >> >> # If user or default.prefs are found, the user's preferences will >> >> >> override any >> >> >> # defaults. >> >> >> # >> >> >> Preference "trainingMode=TEFT" # { TOE | TUM | TEFT | >> >> >> NOTRAIN } -> default:teft >> >> >> >> >> > Set this to "TOE" >> >> > >> >> > ------------------------------------------------------------------------------ >> >> > The Planet: dedicated and managed hosting, cloud storage, colocation >> >> > Stay online with enterprise data centers and the best network in the >> >> > business >> >> > Choose flexible plans and management services without long-term >> >> > contracts >> >> > Personal 24x7 support from experience hosting pros just a phone call >> >> > away. >> >> > http://p.sf.net/sfu/theplanet-com >> >> > _______________________________________________ >> >> > Dspam-user mailing list >> >> > [email protected] >> >> > https://lists.sourceforge.net/lists/listinfo/dspam-user >> >> > >> >> >> > >> > ------------------------------------------------------------------------------ >> > The Planet: dedicated and managed hosting, cloud storage, colocation >> > Stay online with enterprise data centers and the best network in the >> > business >> > Choose flexible plans and management services without long-term contracts >> > Personal 24x7 support from experience hosting pros just a phone call away. >> > http://p.sf.net/sfu/theplanet-com >> > _______________________________________________ >> > Dspam-user mailing list >> > [email protected] >> > https://lists.sourceforge.net/lists/listinfo/dspam-user >> > >> > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Dspam-user mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/dspam-user > ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Dspam-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspam-user
