An interesting idea...
Jul. 1st, 2004 07:17 pm![[personal profile]](https://www.dreamwidth.org/img/silk/identity/user.png)
From a post titled Email Reloaded, from Matthew Mullenweg's blog. Matt is one of the lead developers of WordPress.
In fact, since repairing onegin, the CRM114 spam filter has winnowed out nearly 40 MB of spam. Granted, that includes a backlog of mail that had been accumulating (2000 messages) since onegin went down, but I seem to recall the figure was merely 25 MB this morning, after drawing down the backlog.
Lots of paper chased today, but the pile of stuff to be done seems to not have gone down that much. Ah, well.
Cheers...
So the long and short of it is, I’m loading all the email I receive into a database using a fun combination of Procmail, Spam Assassin, and a sprinkling of command line PHP. I’m very excited about this, more excited than I’ve been about a new project in a while. For me, email has been steadily waning in utility for the past year, and I want to breathe new life into it. I’m tired of folders. I’m tired of slow searching. I don’t want to hand my email over to someone else, even if it’s Google. I don’t want to deal with mbox or IMAP or maildir or any of that junk. Those are implementation details of various servers and clients.Amen to the text I bolded. Besides specific messages with specific attachments, email has certainly become more of a burden than anything as time goes by.
Mirroring my email into a MySQL database has some interesting ramifications. Imagine instant Gmail-type searching using FULLTEXT or LIKE. Imagine instant email backup using MySQL replication. Think email RSS feeds, keyed on searches or senders or anything. Don’t forget the interesting metrics that can be extracted from this as well. Right now I’ve replaced my timely dozen with an counter running since this morning. If you send me an email, you’ll see it increment live. If it increments the spam counter you may want to resend it and reword your mortgage suggestion. This is the most basic of a hundred interesting things that can be culled from this data.
[Emphasis mine.]
In fact, since repairing onegin, the CRM114 spam filter has winnowed out nearly 40 MB of spam. Granted, that includes a backlog of mail that had been accumulating (2000 messages) since onegin went down, but I seem to recall the figure was merely 25 MB this morning, after drawing down the backlog.
Lots of paper chased today, but the pile of stuff to be done seems to not have gone down that much. Ah, well.
Cheers...