SpamBayes is a tool used to segregate unwanted mail (spam) from the mail you
want (ham). Before SpamBayes can be your spam filter of choice you need to
train it on representative samples of email you receive. After it's been
trained, you use SpamBayes to classify new mail according to its spamminess
and hamminess qualities.
This perl script generates top spam/ham rules fired for SpamAssassin
installations:
TOP SPAM RULES FIRED
----------------------------------------------------------------------
RANK RULE NAME COUNT %OFMAIL %OFSPAM %OFHAM
----------------------------------------------------------------------
1 CMAE_1 30455 26.68 70.34 0.03
2 BAYES_99 24478 21.98 56.54 0.89
3 RAZOR2_CHECK 11910 10.61 27.52 0.29
----------------------------------------------------------------------
TOP HAM RULES FIRED
----------------------------------------------------------------------
RANK RULE NAME COUNT %OFMAIL %OFSPAM %OFHAM
----------------------------------------------------------------------
1 BAYES_00 44253 46.60 20.71 62.40
2 ALL_TRUSTED 34534 49.42 50.61 48.70
3 SPF_PASS 10880 17.43 20.82 15.35
----------------------------------------------------------------------
iXhash is a plugin for the popular spam filter SpamAssassin.
It computes MD5 checksums of fragments of the body of an e-mail and compares
them to those of know spam. As such it works similar to the now standard
plugins that use the Pyzor, Razor and DCC software packages from within
SpamAssassin.
[ excerpt from developer's site ]
SpamOracle, a.k.a. "Saint Peter", is a tool to help detect and
filter away "spam" (unsolicited commercial e-mail). It proceeds
by statistical analysis of the words that appear in the e-mail,
comparing the frequencies of words with those found in a user-provided
corpus of known spam and known legitimate e-mail. The classification
algorithm is based on Bayes' formula, and is described in Paul
Graham's paper, A plan for spam.
This program is designed to work in conjunction with procmail. The
result of the analysis is output as an additional message header
X-Spam:, followed by yes, no or unknown, plus additional details.
A procmail rule can then test this X-Spam: header and deliver the
e-mail to the appropriate mailbox.
Akismet is a web service for recognising spam comments.
It promises to be almost 100% effective at catching
comment spam. They say that currently 81% of all comments
submitted to them are spam.
It's designed to work with the Wordpress Blog Tool, but
it's not restricted to that - so this is a Python
interface to the Akismet API.
You'll need a Wordpress Key to use it. This script will
allow you to plug akismet into any CGI script or web application,
and there are full docs in the code. It's extremely easy to use,
because the folks at akismet have implemented a nice and
straightforward REST API.
spampd is a program used within an e-mail delivery system to scan messages for
possible Unsolicited Commercial E-mail (UCE, aka spam) content.
It uses an excellent program called SpamAssassin (SA) to do the actual message
scanning. spampd acts as a transparent SMTP/LMTP proxy between two mail servers,
and during the transaction it passes the mail through SA. If SA decides the
mail could be spam, then spampd will ask SA to add some headers and a report to
the message indicating it's spam and why. spampd is written in Perl and should
theoretically run on any platform supported by Perl and SpamAssassin.
This plugin will place "Spam" and/or "Not Spam" buttons on the mailbox message
list page as well as on a single message view page. The action associated with
the buttons (as well as the button text) can be configured to suit most any
spam reporting system. Reporting by email, reporting by executing a command on
the server and reporting by moving (or copying) the message to a designated
folder are all supported. Any number of custom buttons may also be added, where
the associated action is completely customizable (for instance, adding the
message sender to a whitelist or blacklist).
DNS Flood Detector was developed to detect abusive usage levels
on high traffic nameservers and to enable quick response in
halting the use of one's nameserver to facilitate spam.
bmf is an extremely efficient Bayesian mail filter based on
Paul Graham's article, "A Plan for Spam." It's fast, small,
efficient, and more versatile than other Bayesian filters.
Rspamd is fast, modular and lightweight spam filter. It is designed to work
with big ammount of mail and can be easily extended with own filters written in
lua. This port provides development version of rspamd.