E-Scribe : a programmer’s blog

About Me

PBX I'm Paul Bissex. I build web applications using open source software, especially Django. Started my career doing graphic design for newspapers and magazines in the '90s. Then wrote tech commentary and reviews for Wired, Salon, Chicago Tribune, and others you never heard of. Then I built operations software at a photography school. Then I helped big media serve 40 million pages a day. Then I worked on a translation services API doing millions of dollars of business. Now I'm building the core platform of a global startup accelerator. Feel free to email me.

Book

I co-wrote "Python Web Development with Django". It was the first book to cover the long-awaited Django 1.0. Published by Addison-Wesley and still in print!

Colophon

Built using Django, served with gunicorn and nginx. The database is SQLite. Hosted on a FreeBSD VPS at Johncompanies.com. Comment-spam protection by Akismet.

Pile o'Tags

Stuff I Use

bitbucket, Django, Emacs, FreeBSD, Git, jQuery, LaunchBar, Markdown, Mercurial, OS X, Python, Review Board, S3, SQLite, Sublime Text, Ubuntu Linux

Spam Report

At least 236507 pieces of comment spam killed since 2008, mostly via Akismet.

Spam stats

One technical interest I haven't written much about here is spam. I have a fairly aggressive anti-spam setup, and I have a simple spam statistics page that gives hourly breakdowns. But what I've wanted for a long while is some way to aggregate spam stats from other servers into a sort of spam weather report. There are all sorts of reasons why this is impossible to do perfectly -- people have different criteria for what constitutes spam, for one -- but I still think a useful model for sharing data could be worked out. People who are already generating spam stats could publish their data in a microformat, for example. Alternatively, they could submit periodic automatic reports to a central server, which would then make the stats available in machine-readable form. The key would be to make it easy for people to make their data available.

This is sort of a lazyweb post. Does any project or standard for this already exist?

Sunday, August 14th, 2005

Comments are closed for this post. But I welcome questions/comments via email or Twitter.