http://issues.apache.org/SpamAssassin/show_bug.cgi?id=3787
------- Additional Comments From [EMAIL PROTECTED] 2005-12-14 21:59 ------- I trimmed out the duplicate utf-8 warns in the debug below to leave 1 warn for every unique rule in 70_sare_obfu.cf that triggers the warn. I wasnt running 70_sare_adult.cf or 70_sare_specific during this test, so those obfu rules in there that trigger are not present in this debug. # echo $LANG en_US # set | grep LC_ # # perl -e 'use HTML::Parser; print HTML::Parser->VERSION . "\n"'; 3.46 # svn info /tmp/spamassassin-trunk/ Path: /tmp/spamassassin-trunk URL: http://svn.apache.org/repos/asf/spamassassin/trunk Repository UUID: 13f79535-47bb-0310-9956-ffa450edef68 Revision: 356857 # spamassassin -V SpamAssassin version 3.2.0-r356425 running on Perl version 5.8.6 # ls -la /etc/mail/spamassassin/ -rw-r--r-- 1 root root 158513 Oct 1 15:00 70_sare_obfu.cf -rw-r--r-- 1 root root 890 Sep 15 13:23 init.pre -rw-r--r-- 1 root root 1208 Sep 15 13:23 local.cf -rw-r--r-- 1 root root 2397 Sep 15 13:23 v310.pre # ls -la /usr/share/spamassassin/ -rw-r--r-- 1 root root 5495 Dec 14 14:02 10_default_prefs.cf -rw-r--r-- 1 root root 14312 Dec 14 14:02 20_dnsbl_tests.cf -rw-r--r-- 1 root root 17642 Dec 14 14:02 20_html_tests.cf -rw-r--r-- 1 root root 2164 Dec 14 14:02 20_net_tests.cf -rw-r--r-- 1 root root 2334 Dec 14 14:02 23_bayes.cf -rw-r--r-- 1 root root 420 Dec 14 14:02 25_accessdb.cf -rw-r--r-- 1 root root 1345 Dec 14 14:02 25_antivirus.cf -rw-r--r-- 1 root root 190 Dec 14 14:02 25_dcc.cf -rw-r--r-- 1 root root 1947 Dec 14 14:02 25_domainkeys.cf -rw-r--r-- 1 root root 2738 Dec 14 14:02 25_hashcash.cf -rw-r--r-- 1 root root 189 Dec 14 14:02 25_pyzor.cf -rw-r--r-- 1 root root 2201 Dec 14 14:02 25_razor2.cf -rw-r--r-- 1 root root 2873 Dec 14 14:02 25_spf.cf -rw-r--r-- 1 root root 352 Dec 14 14:02 25_textcat.cf -rw-r--r-- 1 root root 6544 Dec 14 14:02 25_uribl.cf -rw-r--r-- 1 root root 1116 Dec 14 14:02 60_awl.cf -rw-r--r-- 1 root root 4906 Dec 14 14:02 60_whitelist.cf -rw-r--r-- 1 root root 1726 Dec 14 14:02 60_whitelist_subject.cf -rw-r--r-- 1 root root 101479 Dec 14 14:02 languages -rw-r--r-- 1 root root 18944 Dec 14 14:02 triplets.txt -rw-r--r-- 1 root root 1869 Dec 14 14:02 user_prefs.template # cat /root/test | spamc X-Spam-Checker-Version: SpamAssassin 3.2.0-r356425 (2005-12-12) on asset.nmgi.com X-Spam-Level: **** X-Spam-Status: No, score=4.0 required=5.0 tests=HTML_60_70,HTML_MESSAGE, HTML_MISSING_CTYPE,HTML_SHORT_LENGTH autolearn=no version=3.2.0-r356425 Content-Type: text/html; charset=us-ascii <html><body> TUMS® Smoothies™ </body></html> # spamd -D -L > spamd.out 2>&1 ^C # cat spamd.out [19475] dbg: logger: adding facilities: all [19475] dbg: logger: logging level is DBG [19475] dbg: logger: trying to connect to syslog/unix... [19475] dbg: logger: opening syslog with unix socket [19475] dbg: logger: successfully connected to syslog/unix [19475] dbg: logger: successfully added syslog method [19475] dbg: spamd: creating INET socket: [19475] dbg: spamd: Listen: 128 [19475] dbg: spamd: LocalAddr: 127.0.0.1 [19475] dbg: spamd: LocalPort: 783 [19475] dbg: spamd: Proto: 6 [19475] dbg: spamd: ReuseAddr: 1 [19475] dbg: spamd: Type: 1 [19475] dbg: logger: adding facilities: all [19475] dbg: logger: logging level is DBG [19475] dbg: generic: SpamAssassin version 3.2.0-r356425 [19475] dbg: config: score set 0 chosen. [19475] dbg: dns: no ipv6 [19475] dbg: dns: is Net::DNS::Resolver available? yes [19475] dbg: dns: Net::DNS version: 0.49 [19475] dbg: dns: name server: 172.17.1.10, LocalAddr: 0.0.0.0 [19475] dbg: spamd: Preloading modules with HOME=/tmp/spamd-19475-init [19475] dbg: ignore: test message to precompile patterns and load modules [19475] dbg: config: using "/etc/mail/spamassassin" for site rules pre files [19475] dbg: config: read file /etc/mail/spamassassin/init.pre [19475] dbg: config: read file /etc/mail/spamassassin/v310.pre [19475] dbg: config: using "/usr/share/spamassassin" for sys rules pre files [19475] dbg: config: using "/usr/share/spamassassin" for default rules dir [19475] dbg: config: read file /usr/share/spamassassin/10_default_prefs.cf [19475] dbg: config: read file /usr/share/spamassassin/20_dnsbl_tests.cf [19475] dbg: config: read file /usr/share/spamassassin/20_html_tests.cf [19475] dbg: config: read file /usr/share/spamassassin/20_net_tests.cf [19475] dbg: config: read file /usr/share/spamassassin/23_bayes.cf [19475] dbg: config: read file /usr/share/spamassassin/25_accessdb.cf [19475] dbg: config: read file /usr/share/spamassassin/25_antivirus.cf [19475] dbg: config: read file /usr/share/spamassassin/25_dcc.cf [19475] dbg: config: read file /usr/share/spamassassin/25_domainkeys.cf [19475] dbg: config: read file /usr/share/spamassassin/25_hashcash.cf [19475] dbg: config: read file /usr/share/spamassassin/25_pyzor.cf [19475] dbg: config: read file /usr/share/spamassassin/25_razor2.cf [19475] dbg: config: read file /usr/share/spamassassin/25_spf.cf [19475] dbg: config: read file /usr/share/spamassassin/25_textcat.cf [19475] dbg: config: read file /usr/share/spamassassin/25_uribl.cf [19475] dbg: config: read file /usr/share/spamassassin/60_awl.cf [19475] dbg: config: read file /usr/share/spamassassin/60_whitelist.cf [19475] dbg: config: read file /usr/share/spamassassin/60_whitelist_subject.cf [19475] dbg: config: using "/etc/mail/spamassassin" for site rules dir [19475] dbg: config: read file /etc/mail/spamassassin/70_sare_obfu.cf [19475] dbg: config: read file /etc/mail/spamassassin/local.cf [19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::URIDNSBL from @INC [19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::URIDNSBL=HASH (0x8d3a2c0) [19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::Hashcash from @INC [19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::Hashcash=HASH (0x8d28048) [19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::SPF from @INC [19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::SPF=HASH(0x8d69c30) [19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::Pyzor from @INC [19475] dbg: pyzor: local tests only, disabling Pyzor [19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::Pyzor=HASH (0x8e5e4f4) [19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::SpamCop from @INC [19475] dbg: reporter: local tests only, disabling SpamCop [19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::SpamCop=HASH (0x8ed6dd0) [19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::AWL from @INC [19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::AWL=HASH(0x8efd740) [19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::AutoLearnThreshold from @INC [19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::AutoLearnThreshold=HASH(0x8f08558) [19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::WhiteListSubject from @INC [19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::WhiteListSubject=HASH(0x8f14a80) [19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::MIMEHeader from @INC [19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::MIMEHeader=HASH (0x8f1e3fc) [19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::ReplaceTags from @INC [19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::ReplaceTags=HASH (0x8f2e1b0) [19475] dbg: plugin: Mail::SpamAssassin::Plugin::ReplaceTags=HASH(0x8f2e1b0) implements 'finish_parsing_end' [19475] dbg: replacetags: replacing tags [19475] dbg: replacetags: done replacing tags [19475] dbg: bayes: no dbs present, cannot tie DB R/O: /tmp/spamd-19475- init/.spamassassin/bayes_toks [19475] dbg: config: score set 0 chosen. [19475] dbg: message: ---- MIME PARSER START ---- [19475] dbg: message: main message type: text/plain [19475] dbg: message: parsing normal part [19475] dbg: message: added part, type: text/plain [19475] dbg: message: ---- MIME PARSER END ---- [19475] dbg: bayes: no dbs present, cannot tie DB R/O: /tmp/spamd-19475- init/.spamassassin/bayes_toks [19475] dbg: dns: is DNS available? 0 [19475] dbg: metadata: X-Spam-Relays-Trusted: [19475] dbg: metadata: X-Spam-Relays-Untrusted: [19475] dbg: message: no encoding detected [19475] dbg: plugin: Mail::SpamAssassin::Plugin::URIDNSBL=HASH(0x8d3a2c0) implements 'parsed_metadata' [19475] dbg: rules: local tests only, ignoring RBL eval [19475] dbg: check: running tests for priority: 0 [19475] dbg: rules: running header regexp tests; score so far=0 [19475] dbg: plugin: registering glue method for check_hashcash_value (Mail::SpamAssassin::Plugin::Hashcash=HASH(0x8d28048)) [19475] dbg: plugin: registering glue method for check_hashcash_double_spend (Mail::SpamAssassin::Plugin::Hashcash=HASH(0x8d28048)) [19475] dbg: eval: all '*From' addrs: [EMAIL PROTECTED] [19475] dbg: eval: all '*To' addrs: [19475] dbg: plugin: registering glue method for check_subject_in_blacklist (Mail::SpamAssassin::Plugin::WhiteListSubject=HASH(0x8f14a80)) [19475] dbg: plugin: registering glue method for check_subject_in_whitelist (Mail::SpamAssassin::Plugin::WhiteListSubject=HASH(0x8f14a80)) [19475] dbg: rules: running body-text per-line regexp tests; score so far=0 [19475] dbg: uri: running uri tests; score so far=0 [19475] dbg: rules: running raw-body-text per-line regexp tests; score so far=0 [19475] dbg: rules: running full-text regexp tests; score so far=0 [19475] dbg: plugin: Mail::SpamAssassin::Plugin::URIDNSBL=HASH(0x8d3a2c0) implements 'check_tick' [19475] dbg: check: running tests for priority: 500 [19475] dbg: plugin: Mail::SpamAssassin::Plugin::URIDNSBL=HASH(0x8d3a2c0) implements 'check_post_dnsbl' [19475] dbg: rules: running meta tests; score so far=0 [19475] dbg: rules: running header regexp tests; score so far=0 [19475] dbg: rules: running body-text per-line regexp tests; score so far=0 [19475] dbg: uri: running uri tests; score so far=0 [19475] dbg: rules: running raw-body-text per-line regexp tests; score so far=0 [19475] dbg: rules: running full-text regexp tests; score so far=0 [19475] dbg: check: running tests for priority: 1000 [19475] dbg: rules: running meta tests; score so far=0 [19475] dbg: rules: running header regexp tests; score so far=0 [19475] dbg: plugin: registering glue method for check_from_in_auto_whitelist (Mail::SpamAssassin::Plugin::AWL=HASH(0x8efd740)) [19475] dbg: locker: safe_lock: created /tmp/spamd-19475- init/.spamassassin/auto-whitelist.lock.asset.nmgi.com.19475 [19475] dbg: locker: safe_lock: trying to get lock on /tmp/spamd-19475- init/.spamassassin/auto-whitelist with 0 retries [19475] dbg: locker: safe_lock: link to /tmp/spamd-19475- init/.spamassassin/auto-whitelist.lock: link ok [19475] dbg: auto-whitelist: tie-ing to DB file of type DB_File R/W in /tmp/spamd-19475-init/.spamassassin/auto-whitelist [19475] dbg: auto-whitelist: db-based [EMAIL PROTECTED]|ip=none scores 0/0 [19475] dbg: auto-whitelist: AWL active, pre-score: 0, autolearn score: 0, mean: undef, IP: undef [19475] dbg: auto-whitelist: DB addr list: untie-ing and unlocking [19475] dbg: auto-whitelist: DB addr list: file locked, breaking lock [19475] dbg: locker: safe_unlock: unlink /tmp/spamd-19475- init/.spamassassin/auto-whitelist.lock [19475] dbg: auto-whitelist: post auto-whitelist score: 0 [19475] dbg: rules: running body-text per-line regexp tests; score so far=0 [19475] dbg: uri: running uri tests; score so far=0 [19475] dbg: rules: running raw-body-text per-line regexp tests; score so far=0 [19475] dbg: rules: running full-text regexp tests; score so far=0 [19475] dbg: check: is spam? score=0 required=5 [19475] dbg: check: tests= [19475] dbg: check: subtests= [19475] dbg: config: copying current conf to backup [19475] info: spamd: server started on port 783/tcp (running version 3.2.0- r356425) [19475] info: spamd: server pid: 19475 [19475] info: spamd: server successfully spawned child process, pid 19478 [19475] dbg: prefork: child 19478: entering state 0 [19478] dbg: prefork: sysread(8) not ready, wait max 300 secs [19475] dbg: prefork: new lowest idle kid: none [19479] dbg: prefork: sysread(9) not ready, wait max 300 secs [19475] info: spamd: server successfully spawned child process, pid 19479 [19475] dbg: prefork: child 19479: entering state 0 [19475] dbg: prefork: new lowest idle kid: none [19475] dbg: prefork: child 19478: entering state 1 [19475] dbg: prefork: new lowest idle kid: 19478 [19475] dbg: prefork: child reports idle [19475] dbg: prefork: child 19479: entering state 1 [19475] dbg: prefork: new lowest idle kid: 19478 [19475] dbg: prefork: child reports idle [19475] info: prefork: child states: II [19475] dbg: prefork: ordered 19478 to accept [19475] dbg: prefork: sysread(7) not ready, wait max 300 secs [19478] info: spamd: connection from localhost.localdomain [127.0.0.1] at port 34629 [19475] dbg: prefork: child 19478: entering state 2 [19475] dbg: prefork: new lowest idle kid: 19479 [19478] info: spamd: setuid to root succeeded [19478] dbg: info: user has changed [19478] dbg: bayes: no dbs present, cannot tie DB R/O: /root/.spamassassin/bayes_toks [19478] dbg: config: score set 0 chosen. [19478] warn: spamd: still running as root: user not specified with -u, not found, or set to root, falling back to nobody at /usr/bin/spamd line 1152, <GEN5> line 4. [19478] info: spamd: processing message (unknown) for root:99 [19478] dbg: dns: name server: 172.17.1.10, LocalAddr: 0.0.0.0 [19478] dbg: bayes: no dbs present, cannot tie DB R/O: /root/.spamassassin/bayes_toks [19478] dbg: metadata: X-Spam-Relays-Trusted: [19478] dbg: metadata: X-Spam-Relays-Untrusted: [19478] dbg: message: ---- MIME PARSER START ---- [19478] dbg: message: main message type: text/html [19478] dbg: message: parsing normal part [19478] dbg: message: added part, type: text/html [19478] dbg: message: ---- MIME PARSER END ---- [19478] dbg: message: no encoding detected [19478] dbg: rules: local tests only, ignoring RBL eval [19478] dbg: check: running tests for priority: 0 [19478] dbg: rules: running header regexp tests; score so far=0 [19478] dbg: eval: all '*From' addrs: [19478] dbg: eval: all '*To' addrs: [19478] dbg: rules: running body-text per-line regexp tests; score so far=0 [19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, immediately after start byte 0xcf) in pattern match (m//) at /etc/mail/spamassassin/70_sare_obfu.cf, rule __SARE_OBFU_PRICE1, line 1, <GEN5> line 10. [19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, immediately after start byte 0xd1) in pattern match (m//) at /etc/mail/spamassassin/70_sare_obfu.cf, rule SARE_OBFU_PRESCR_SPL1, line 1, <GEN5> line 10. [19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, immediately after start byte 0xd5) in pattern match (m//) at /etc/mail/spamassassin/70_sare_obfu.cf, rule __SARE_OBFU_SOFT2, line 1, <GEN5> line 10. [19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, immediately after start byte 0xce) in pattern match (m//) at /etc/mail/spamassassin/70_sare_obfu.cf, rule SARE_OBFU_VICODIN, line 1, <GEN5> line 10. [19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, immediately after start byte 0xd0) in pattern match (m//) at /etc/mail/spamassassin/70_sare_obfu.cf, rule __SARE_OBFU_CIALIS2, line 1, <GEN5> line 10. [19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, immediately after start byte 0xd1) in pattern match (m//) at /etc/mail/spamassassin/70_sare_obfu.cf, rule SARE_OBFU_PRESCRIP, line 1, <GEN5> line 10. [19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, immediately after start byte 0xce) in pattern match (m//) at /etc/mail/spamassassin/70_sare_obfu.cf, rule __SARE_OBFU_VISIT1, line 1, <GEN5> line 10. [19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, immediately after start byte 0xd2) in pattern match (m//) at /etc/mail/spamassassin/70_sare_obfu.cf, rule SARE_OBFU_XANAX, line 1, <GEN5> line 10. [19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, immediately after start byte 0xc4) in pattern match (m//) at /etc/mail/spamassassin/70_sare_obfu.cf, rule SARE_OBFU_GUARANTEE, line 1, <GEN5> line 10. [19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, immediately after start byte 0xd0) in pattern match (m//) at /etc/mail/spamassassin/70_sare_obfu.cf, rule __SARE_OBFU_MEDS2, line 1, <GEN5> line 10. [19478] dbg: uri: running uri tests; score so far=0 [19478] dbg: rules: ran eval rule HTML_SHORT_LENGTH ======> got hit (1) [19478] dbg: rules: ran eval rule __HTML_LENGTH_512 ======> got hit (1) [19478] dbg: rules: ran eval rule HTML_60_70 ======> got hit (1) [19478] dbg: bayes: no dbs present, cannot tie DB R/O: /root/.spamassassin/bayes_toks [19478] dbg: bayes: not scoring message, returning undef [19478] dbg: bayes: opportunistic call attempt failed, DB not readable [19478] dbg: rules: ran eval rule HTML_MESSAGE ======> got hit (1) [19478] dbg: rules: ran eval rule __HTML_LENGTH_384 ======> got hit (1) [19478] dbg: rules: ran eval rule __HTML_LENGTH_0000_1024 ======> got hit (1) [19478] dbg: rules: running raw-body-text per-line regexp tests; score so far=3 [19478] dbg: rules: running full-text regexp tests; score so far=3 [19478] dbg: check: running tests for priority: 500 [19478] dbg: rules: running meta tests; score so far=3 [19478] dbg: rules: running header regexp tests; score so far=4 [19478] dbg: rules: running body-text per-line regexp tests; score so far=4 [19478] dbg: uri: running uri tests; score so far=4 [19478] dbg: rules: running raw-body-text per-line regexp tests; score so far=4 [19478] dbg: rules: running full-text regexp tests; score so far=4 [19478] dbg: check: running tests for priority: 1000 [19478] dbg: rules: running meta tests; score so far=4 [19478] dbg: rules: running header regexp tests; score so far=4 [19478] dbg: rules: running body-text per-line regexp tests; score so far=4 [19478] dbg: uri: running uri tests; score so far=4 [19478] dbg: rules: running raw-body-text per-line regexp tests; score so far=4 [19478] dbg: rules: running full-text regexp tests; score so far=4 [19478] dbg: plugin: Mail::SpamAssassin::Plugin::AutoLearnThreshold=HASH (0x8f08558) implements 'autolearn_discriminator' [19478] dbg: learn: auto-learn: currently using scoreset 0 [19478] dbg: learn: auto-learn: message score: 4, computed score for autolearn: 4 [19478] dbg: learn: auto-learn? ham=0.1, spam=12, body-points=3, head-points=0, learned-points=0 [19478] dbg: learn: auto-learn? no: inside auto-learn thresholds, not considered ham or spam [19478] dbg: check: is spam? score=4 required=5 [19478] dbg: check: tests=HTML_60_70,HTML_MESSAGE,HTML_MISSING_CTYPE,HTML_SHORT_LENGTH [19478] dbg: check: subtests=__HTML_LENGTH_0000_1024,__HTML_LENGTH_384,__HTML_LENGTH_512 [19478] info: spamd: clean message (4.0/5.0) for root:99 in 0.0 seconds, 99 bytes. [19478] info: spamd: result: . 4 - HTML_60_70,HTML_MESSAGE,HTML_MISSING_CTYPE,HTML_SHORT_LENGTH scantime=0.0,size=99,user=root,uid=99,required_score=5.0,rhos t=localhost.localdomain,raddr=127.0.0.1,rport=34629,mid=(unknown),autolearn=no [19478] dbg: config: copying current conf from backup [19475] dbg: prefork: child 19478: entering state 1 [19475] dbg: prefork: new lowest idle kid: 19478 [19475] dbg: prefork: child reports idle [19475] info: prefork: child states: II [19478] dbg: prefork: sysread(8) not ready, wait max 300 secs [19475] info: spamd: server killed by SIGINT, shutting down ------- You are receiving this mail because: ------- You are the assignee for the bug, or are watching the assignee.
