Forum Moderators: open
Those URL's with the SID's have now got into googles new index on -fi. I was hoping that it would clear. It means that freshbot is now coming to the site and following 36,000 URL's all of which point to about 20 pages because of the SID's on the end!
Will google work this out at some point or is some manual intervention required do you think?
TJ
You could program the site so that it performs some checks before sending out a page, e.g.
A) recognises Googlebot as the useragent
B) checks to see if Googlebot has requested a URL with?sid= at the end of it
C) Issue a 301 redirect to the page WITHOUT the?sid
This will then tell Googlebot that the page it originally requested has been moved permenantly. This should then have the effect of removing that request from the index.
Of course, the other way you could clear out the index is to remove your whole site from the index by exclusion of googlebot by robots.txt, but this isn't generally a good idea ;)
JP