Forum Moderators: DixonJones
The UA is made up of random characters and spaces so it's difficult to block.
The page it took is an event calendar and has email addresses of the artists (100+) in charge of individual events. It's a pretty sweet cherry to pick, but my client wants them to be functional...
What methods do you use to have functional email addresses that won't get harvested? I'd hate to get blamed/be responsible for all these people getting spammed.
Thanks,
Adam
It looks like they're using Google searches (or another search engine that uses Google's database) to find pages that contain email addresses. And possibly web forums as well--I don't run any forums so I can't verify this... That's definitely how they're finding the page on my site that has email addresses, as the URLs they're using are ones that were originally grabbed by a Google crawler. (Ah, the joys of completely traceable URLs.)
Sometimes they use a User-Agent string of "
Mozilla/4.0 (compatible; MSIE 6.0; Windows 98)"; different versions of the same software, I'd guess. I also agree with the theories that it's related to the DSurf stuff. Lately I've seen many more instances of the Mozilla UA string--I've only had a couple of visits with the totally random UA string in the last two months, possibly because I've been completely blocking them.
They never use a valid
Referer:and the rest of the typical headers are bogus or missing as well, so if you feel a need to filter them out it's still pretty easy to do. Though I'm sure eventually they'll get smart enough to do all this stuff correctly...