alexpgp: (Default)
alexpgp ([personal profile] alexpgp) wrote2003-03-18 09:31 pm

More on spambayes...

My subjective feeling regarding the spambayes spam-filtering software is that it is working, and working better as time goes on. Interestingly enough, a lot of the Nigerian 419 scam letters keep getting through, but that's probably the result of the letters looking so much like legitimate e-mail.

In reviewing incoming spam caught by spambayes, here's a breakdown of how many messages were caught by the spambayes, out of every 10 consecutive spam messages, for the first 300 messages in the file (arranged chronologically):

Messages# caught% of spam caught,
total
% of spam caught,
last 50 msgs
000-010330-
011-020645-
021-030753-
031-040553-
041-05024644
051-06064852
061-07085356
071-08055352
081-09035048
091-10065156
101-11055054
111-12075252
121-13075356
131-14045258
141-15095564
151-160105874
161-17065872
171-18045766
181-19055668
191-20085866
201-21075860
211-22096066
221-23066070
231-24066072
241-25045964
251-26075964
261-27096064
271-28086168
281-29096274
291-30076280

The "dips" correspond to when I check my mail and "train" spambayes by passing a message through its analysis engine, after which it is sent to the spam file.

I get the impression that there is an inexorable creep upward in the trend of spam messages caught by the software. Indeed, I'm noticing that I'm seeing (though I may not actually be receiving) a lot less spam than I used to, and that my main bugaboo right now is messages that aren't spam, but aren't terribly welcome, either. (Just call me Mr. Fussbudget.)

Cheers...

[identity profile] volkris.livejournal.com 2003-03-18 09:32 pm (UTC)(link)
I've been feeling for a while that legislation against spam is a mistake of trying to implement a legal solution for a technological problem.

I know my spam filters catch about 90% of it without any tuning and very few false positives. With multiple effective methods for tagging spam, attacking it through new laws is actually kind of lazy. It also goes in the troubling direction of telling people what they can and can't do with the internet connection they're paying for.

My feelings are similar on the legislation against telemarketers: surely we can solve the problem technologically if we really tried.