gloomytrousers.co.uk

wpoison-gt

The original wpoison, by Ronald F. Guilmette, is a CGI script that generates random web content containing made-up e-mail addresses; the idea is that spammers' web crawlers, which trawl web sites looking for e-mail addresses, stumble upon it and start adding these made-up e-mail addresses to their lists. This either forces the spammer to dump the entire list or manually clean it (if they notice the poisoning) or waste their resources trying to spam non-existent e-mail addresses (if they don't).

However, the original version suffers from a number of problems:

  1. The content generated is in a fixed format which is trivial to identify.
  2. The usage restrictions specify that all installations must have a link to the wpoison site on their homepage (although it is no longer accessible); this makes it trivial for a crawler to skip any sites that have this.
  3. It is possible to determine if the script is in use by requesting a made-up URL inside the wpoison 'site' - wpoison will return content for anything, and never return a 404 error.
  4. Requesting the same page several times results in totally different content each time, unlike a real site.
  5. The content generated is devoid of punctuation and other formatting, with a limited set of HTML markup, making it easier to identify as bogus.
  6. There is no limit to the amount of e-mail addresses the script will issue to a single crawler; a sudden large number of new e-mail addresses may arouse the spammers' suspicion early on.
  7. Also because there is no limit, badly-written but legitimate crawlers may get 'stuck' in the 'site', increasing the load on the server unnecessarily.

So, in light of this, I produced an amended version, wpoison-gt, which does not suffer from issues 1 and 5. Full changes:

However, the original author is no longer maintaining wpoison, and does not wish to incorporate my modifications, so I am distributing it here. This script is based on the original wpoison code, and is suitable for use as a drop-in replacement, so the original installation instructions should all still apply. Please note the original licence still stands (this release is not made under the GPL), including the requirement to link to the original wpoison site (it was here, for what it's worth, but is now dead).

I may eventually update this version to address some of the remaining issues, as time/interest permits (but don't hold your breath!).

You can see this version of the script in action - the output has a pleasing Vogon poetry feel to it.

wpoison-gt-0.1.tar.gz (v0.1, 26 May 2007, 5.2KB)
Valid XHTML 1.0!
Valid CSS!
Non-humans only