Forum Moderators: open
I'm seeing query strings of this nature:
www.mysite.com/SlurpConfirm404/file/page.htm
I am Elvis Presley and I never really died at all. I AM telling the truth - I promise - cross my heart.
Granted the IP is hard to spoof but it's not impossible (the UA is similar to my statement above).I might send Y! an email and see what they say.
fish,
fiestagirl has provided some detailed insights in the past.
I would assume (on her behalf) that she has made this determination on both her log history and her sites content as well!
We have no reason to doubt what she presents.
Best of luck with Yahoo.
Don
best I can hope for from Y! is a cryptic or boilerplate response
IMO, that's the best we can hope for from any provider.
Automated responses and double-talk.
Cooperation and assitance to webmasters is rather rare.
I'm not sure how many mails I sent to providers to reports violations of their UAG's and including links to their UAG's in the process?
Only to be automated with a reporting procedure suggestion that I had already followed.
It's a WASTE of time.
Don
I have recently changed something that may have triggered this reaction in the bot. I used to mod_rewrite a number of old (html) pages to new (php) pages using a 301. I've recently switched some of these off (they'd been there about 6-12 months).
By default these will now be redirected to my 'directory' page with a 404 response.
Perhaps this is it - also "SlurpConfirm404" seems to tie in with a technique similar to what they're stating in the FAQ (above). Perhaps they should call it "Deliberate404" :)
Anyway, it's not killing my site with too many requests - I'm just heavily into bot monitoring and trapping IPs lately so it caught my eye.
Have you made similar changes recently fiestagirl?
The point of my first post was really that I appreciate knowing that the request is a 404 check and not a broken link put up by us or someone else.
I analyze every 404 and am happy to know that I can ignore those probes by Yahoo. Makes my life easier.
Just a tech-note, here:
It is possible to spoof an IP address, but if a client spoofs its IP address, then it will never receive a response from the server -- The server will send its response to the specified (spoofed) IP address -- which might be someone else's machine, or might not even exist.
So, a search-indexing robot, e-mail harvester, or site-scraper could conceivably spoof its IP, but it would never receive any of the content it was requesting, so that would be rather pointless.
Jim
Also, I got a response from Y!
I understand that you have a question about how our crawler Slurp checks
that a server returns a valid 404 error.Yes, this is most likely a request to check of your server returns valid
404s.
Signed by a real human and all!
I guess "most likely" is as good as a service desk person will be able to give.
It is possible to spoof an IP address, but if a client spoofs its IP address, then it will never receive a response from the server
You can spoof an IP address and receive a reply if you have access to a router in the packets' path. While not as many people have this kind of access, one should never assume it can't happen.