Re: VGER does gradual SPF activation (FAQ matter)

From: Jeff Garzik
Date: Mon Jun 12 2006 - 10:51:31 EST


David Miller wrote:
We definitely need a better spam solution at vger, the reason is that
the current mechanism (ad-hoc by-hand regexp blocking) creates lots of
problems. For one thing, it means that people with names in languages
other than english get blocked when their emails are quoted in
postings. This is because we don't understand a lot of languages, so
we just regexp block multibyte characters typically assosciated with
that language in order to block spam written in that language.

That isn't acceptable in the long term.

Here's another vote against SPF.

FWIW, DomainKeys looks nice.


To be honest I'm all for some kind of bayesian filter at vger as long
as the rejected postings go somewhere into a folder I can scan every
couple of days looking for false positives.

Though this may not be your thing, I've often thought that this sort of task would be an -excellent- janitor task.

Create two simple web pages, one that shows the last 24 hours' worth of LKML posts, and another one that shows the last 24 hours' worth of spam. Allow any user on the Internet to report an LKML post as spam, or alternately, highlight a false positive as not-spam. (perhaps generate one of those wavy-text verify-you-are-a-human graphics)

Then you, as admin, only have to click a button that accepts or rejects the submission(s). If you want to scan it yourself for false positives, you just hit the same webpage as everybody else.

That feedback is then fed into the bayesian system, to train it using well-known methods.

Jeff



-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/