E-Scribe News : a programmer’s blog

About Me

PBX I'm Paul Bissex, and e-scribe.com is my consulting business. I build web applications using open source software, especially Django. In the '90s I did graphic design for newspapers and magazines. Then I wrote technology commentary and reviews for Wired, Salon.com, Chicago Tribune, and lots of little places you've never heard of. Feel free to email me.

Book

I'm co-author of "Python Web Development with Django", an excellent guide to my favorite web framework. Published by Addison-Wesley, it is available from Amazon and your favorite technical bookstore as well.

Colophon

Built using Django, served by Apache and mod_wsgi. The database is SQLite. The operating system is FreeBSD, on a VPS hosted at Johncompanies.com. Comment-spam protection by Akismet. Vintage topo imagery from the Maptech archive. The markup engine is Markdown.

Pile o'Tags

Stuff I Use

Akismet, bitbucket, del.icio.us, Django, Emacs, FreeBSD, Git, jQuery, LaunchBar, Markdown, Mercurial, OS X, Postfix, Python, Review Board, S3, SQLite, TextMate, Ubuntu Linux

Spam Report

At least 96060 pieces of comment spam killed since January 2008, mostly via Akismet.

More on "Splogs"

"Splog" as a label for spam blogs seems to be taking off. I'm not crazy about it, because I think the challenges and possible solutions of fake-blog spam sites have huge overlap with fake-portal and fake-search-engine link farms. The difference is mostly significant to people who run blog indexing services.

Not to discount their needs or their efforts. J. Scott Johnson, CTO of Feedster, weighs in today with a piece in Online Media Daily. One of his best points: "Can the war on splogs be won? No." In other words, expect to deter and minimize blog spam,not to eliminate it.

For the past few days I've been involved in a small-group discussion of this issue; one participant is a technical staffer from a leading blog search/stats site. He's intrigued by a couple ideas: a flagging system where participation would depend on endorsement by a known "good" flagger; and whitelisting of known "good" sites to defend against some of the many possible poisoning scenarios.

But I don't hear any of the services talking about a distributed, shared, public system for filtering search results. Similar to, as I said before, Razor or Pyzor or DCC in the email-spam world. Maybe it's just not in the cards. Or maybe we just have to build it and convince them that way.

Imagine if there were a public service that you could feed a list of blog URLs and receive back a "cleaned" list that eliminated known spam blogs. With appropriate software support you could use this service to filter comments, regulate referrer spam, receive alerts of domain-jacking among the sites in your blogroll, filter the results of web searches you subscribe to via RSS, and so on. I have some specific implementation ideas that I'll share in a future post -- or maybe even in a proof-of-concept.

Tuesday, September 20th, 2005
+
1 comment

Comment from Nancy , 19 months later

Thanks

Post a comment

Thanks for reading! Please note: Your comment will not appear until approved, which may take a few hours or more. Spammers will be torpedoed.


(Will not be shared)

(Optional)