On 01/18/2011 03:25 PM, Dave Pooser wrote:
On 1/18/11 12:52 AM, "Warren Togami Jr."<wtog...@gmail.com>  wrote:

I am seeking volunteers to help me build and administrate a "ham trap".
   The idea is to subscribe a list of unique e-mail addresses to various
retailers, airlines, government and other legitimate bulk mail senders.

The possible fly in the ointment I see is that you wouldn't necessarily have
access to some sorts of transactional emails-- airline flight reminders and
things of that nature. Would that be something where you'd be interested in
getting mail cc:ed to a hamtrap address? For example, I use tagged email
addresses for different airlines, and it would be trivial for me to have my
server relay those messages to a hamtrap address as well as delivering to my
personal email if that sort of thing would be useful.

You are correct that this isn't transactional mail. It is however low-effort automatic collection of a subset of ham that real users receive, much of which we are entirely missing from the nightly corpus.

https://fedorahosted.org/auto-mass-check/
As for the ham you suggest, I highly suggest running your own nightly masscheck and uploading logs. This avoids privacy problems and allows you to check/correct quality issues in your own corpus.

Warren

Reply via email to