Forum Moderators: open
mypage.cgi - won't get crawled.
mypage.cgi?a=1&b=2 will get crawled.
I did a quick search of Google and couldn't find any straightforward .cgi pages in the SERPs. However pages that have extensions like: pl, asp and php are happily crawled if they don't have parameters.
Is this my imagination? Any reason for it?
Dynamic pages are crawled more slowly.
As a real example, why is:
boardpower.cgi not crawled, when:
boardpower.cgi?cookie=logout is?
The second one looks far more 'dodgy' than the first.
I've checked through my logs for the last couple of weeks and seen the same pattern throughout, so I don't think it's a case of sitting tight and waiting for GoogleBot to come back tomorrow.
Googlebot generally picks up a couple of hundred pages quite happily from my site each day, 99% of which have parameters, and yes, I'm quite glad Googlebot doesn't hammer the server to get them.
I guess I'm the only one seeing this. *shrug*
My point was this: CGIs are normally meant for parameters. Standalone ones look as if the parameters were forgotten, and Googlebot is always cautious about that (can you append some empty parameters?)
Google once crushed an online game session just by calling some CGIs (if I remember this story correctly).
Personally, I tend to write my CGI scripts so the 'meaty' bit (for want of a better word) will be displayed when calling the script itself (ie, .cgi without parameters), then any functionality is achieved by using parameters (ie, .cgi?action=dofunkystuff). I'd imagine a lot of other script writers do the same. I find it hard to believe Google intentionally ignore .cgi when they wouldn't ignore .pl, which could probably be the exact same script.
Adding on bogus parameters may work, but it's a bit of an ugly looking bodge. I can't help but think Googlebot just doesn't crawl dynamic pages quite aswell as I originally thought. :o/
That's a fact. See GoogleGuy's post.
[webmasterworld.com...]
They do what they want and when they want.
Woried about duplicate content or dodgy redirects, well no fear.
Search for caskets:
[google.com...]
The top result is no more than a 302 redirect to another site. It does have a listing from DMOZ so it must be right, even if its a link to a redirect. They are however good AdWord spenders, that must make it right.