Forum Moderators: open

Message Too Old, No Replies

Slurp and PPC ads

It's been happening for a year, but nobody seems to notice ...

         

StupidScript

11:36 pm on Nov 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



For nearly a year now, Slurp has been hitting our server for many hours at a time at a rate of 2+ hits per second. Each hit is NOT a normal spider hit, as each log entry reflects the unique "?source=XXXX" attribute that identifies our PPC ads. "Normal" spider hits would not include this attribute.

Here is a sample "normal" Slurp log entry (broke the line for reading):

72.30.134.145 - - [14/Nov/2005:02:30:03 -0800] "http://www.example.com/somepage.html"

200 1745 "-" "Mozilla/5.0 (compatible; Yahoo! Slurp; http:// help.yahoo .com/help/us/ysearch/slurp)"

Here is a sample "PPC" Slurp log entry (broke the line for reading):

72.30.134.145 - - [14/Nov/2005:02:30:03 -0800] "http://www.example.com/somepage.html?source=YAH21034"

200 1745 "-" "Mozilla/5.0 (compatible; Yahoo! Slurp; http:// help.yahoo .com/help/us/ysearch/slurp)"

The "source" attribute allows us to track individual PPC term performance, and every one of our 125,000+ terms in all of our campaigns has its own. It is clear from the log entries that Slurp is trolling through all of our links in our PPC campaigns ... even the links that are no longer included, like to an older domain that hasn't been in our campaigns for months. Often Slurp will run through all of our terms in all of our campaigns several times before stopping. During some sessions, Slurp requests to send an email message to itself with every other request which has the effect of disrupting our mail service, too.

This hammering will usually go on for only 3 or 4 hours, but occasionally, like today, it goes on for 12 to 18 hours or longer. The effect is that of a mild denial of service attack, as it slows down our server tremendously and we lose a lot of visitors as a result.

Have any of you experienced this?

I have asked before, but nobody responded. It's tough to believe that we are the only ones who have experienced this. I'm posting this here instead of to the Yahoo Search forum because this specifically hits our PPC ads, so I'm hoping other PPC advertisers will be able to verify it. Certainly none of our efforts to get info from Yahoo have resulted in anything at all. I even talked to an API tech guy who had no idea what might be happening.

I ban the entire Class B block of Slurp IPs whenever I have the time, for a total of 14 bans so far, however it keeps coming back with a new IP block. Neither .htaccess nor robots.txt instructions keep the hits from draining our resources, so I usually use ipchains/iptables and so forth to drop the packets and their requests entirely.

jdMorgan

12:14 am on Nov 15, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I notice that the request for somepage.html?source=xyz returns a 200-OK status. Have you tried redirecting these requests to somepage.html without the search query?

This looks like regular-old slurp to me, and I'd be checking to see how my dynamic links got 'exposed' so that Slurp finds them and spiders them if this was my site.

Alternately, Denying /somepage.html in robots.txt should stop them - eventually. If a 'bot finds a link, and it's not denied, it will spider it. So where's Slurp finding your links?

Jim

StupidScript

12:50 am on Nov 15, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



The only place where all of our terms in all of our campaigns are available in this way are from within our campaigns.

This simply must be Slurp running through our campaigns repeatedly. When it visits, our logs are full of Slurp entries as illustrated ... 2+ hits per second, and every one of our source attributes is included ... I've checked, even with the terms that have never generated an impression.

At first we assumed it was checking the validity of the links in our campaigns, but it has gone on for far too long and requested far too many inactive domains for this to be normal activity.

We get no visits whatsoever from Slurp requesting a normal page without the source attribute.

Perhaps coincidentally, all of our domains were dropped from the algorithm listings on Jan. 1 2005 when Yahoo dropped the Inktomi paid inclusion customers (like we used to be) and started their own similar service. I don't mean domains that may have shared a link or two ... but ALL of the domains where we appear in the whois info, regardless of the IP block, hosting service, age or content.

The best response we can get from Y is to sign up for Site Match and see what happens ... uh ... yuh ... sure.

It's been a very long year ... and thanks very much for commenting ... it's a first!

StupidScript

12:49 am on Nov 22, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I'm amazed that there aren't more reports of this. On one day last week, Slurp hit us as described over 113,000 times in one 6-hour period ... and we have a total of around 180 pages in the domain directory it was hitting. That's around 628 hits per page at around 5 hits per second.

I'd think one or two hits spaced over a couple of seconds or so would do the job, don't you?

Nobody has seen this Slurp behavior in their logs?!?

I mean, if so, then it's clear only our server is affected ... at least out of all of Yahoo's advertisers that frequent these forums. That in itself is pretty amazing, and it gives us the basis for a consipracy theory or something ... I don't know what to think, anymore.

Yeah, it's weird, but are we really alone in this?

sdani

12:15 am on Nov 29, 2005 (gmt 0)

10+ Year Member



you are not alone,

we get several thousand hits every day from slurp for our overture ad URLs.

sdani

StupidScript

12:41 am on Nov 29, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Thank goodness! And thank YOU!

Any thoughts on what's going on with Slurp during this activity?

sdani

1:08 am on Nov 29, 2005 (gmt 0)

10+ Year Member



the results are definitely not showing up in yahoo results. So, not sure what's going on. but my landing pages are 100% unique for overture and noone can book mark those pages -> it works like this:
mydomain/redirect/tag/overture/target/landingpage

This page sends a 301 redirect for
mydomain/landingpage.

so, noone really sees that overture specific url. I am 100% certain that slurp is getting it from overture database.

and its not the verification agent. The verification process for new urls (ads) really hots the server VERY HARD (not 2 per second). About 20-30 requests or even more per second and that comes from some RPT-HTTPclient.

sdani