Am I wrong to assume that any and all weird behavior can be blamed on a robot that's up to no good? If I am wrong, someone will kick this question to the right place.
Question: What kind of robot would attempt to crawl a mailto link? The answers that come to mind are "A very stupid one" and "One that doesn't speak English" (the unknown visitor is from Romania) but possibly I have overlooked something. I don't mean read an html page and trawl for addresses, I mean "GET", just like you'd get a page or image.
My site is 95% personal and 5% very, very, very specialized, so these are genuinely consecutive lines from the log, at five-second intervals. (The first visited page is one of the few that gets bona fide visits from real humans, though there's usually a referrer.) The parts I've shown as {one} and {two} are two different pairs of numbers.
79.112.{one} - - [09/Apr/2011:05:44:30 -0700] "GET /games/ HTTP/1.0" 200 11167 "-" "Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US; rv:1.9b5) Gecko/2008032620 Firefox/3.0b5"
79.112.{two} - - [09/Apr/2011:05:44:35 -0700] "GET /games/../index.html HTTP/1.0" 200 3833 "-" "{same}"
79.112.{one} - - [09/Apr/2011:05:44:40 -0700] "GET /games/../mailto:webmaster@{mydomain}.com HTTP/1.0" 404 1496 "-" "{same}"
The /../ elements are exactly as in the raw logs. I had to try it in a browser to verify that a human would get the same responses: item 2 gets the main Index page (but how did they know it's called index.html? did they deduce it from the name of the actual page in item 1?), item 3 gets the 404 message.
The user-agent is "not a known robot", but robots can hide behind anything can't they? I tend to associate HTTP/1.0 (as opposed to 1.1) with elderly robots, but then, Romania isn't really at the technological cutting edge.