Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Google Site Search and Organic show Disallow pages

         

Propools

9:07 pm on Nov 10, 2008 (gmt 0)

10+ Year Member



Our Robots.txt file contains in it, at least the following:
User-agent: *
Disallow: /additem.php*

We have Google Site Search on our site. I’ve seen on site search and Google site: search that when I search for a specific item number that it brings back:

www.oursite.com/additem.php?product=3005

This is not an expectant result, primarily because on our site search; it only shows that result.

I’ve always been under the impression that something like the “disallow” above would give ALL bots the directive to not index said page or anything prefaced with the /additem.php.

So, what to do? I’m primarily concerned about our Google site search and secondarily concerned about Google site: search. Lastly, I’m concerned if Google is just not following the rules, or do I understand them incorrectly?

tedster

9:45 pm on Nov 10, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



A robots.txt Disallow rule will keep the url from being spidered - and apparently it hasn't been spidered, or else you would see a title and description snippet of some kind. Instead you are seeing a "url-only" listing, which shows that Google is aware that url exists from links that point to it.

You can submit a url removal request through Webmaster Tools to make the url-only listing go away.

Propools

9:58 pm on Nov 10, 2008 (gmt 0)

10+ Year Member



Righto Tedster,
As soon as I read it I went "Dah Hah". So it looks like I need to have the nofollow and noindex code added to all shopping cart stuff, right?

BTW - Can you point me in the right direction for getting info on how to properly note that on the pages?

jimbeetle

10:05 pm on Nov 10, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



You want to use...

<meta name="robots" content="noindex, nofollow"

...or more simply "none" for the content attribute.

Also, be sure to remove the disallow directive from robots.txt as spiders have to be able to reach the page to read the robots meta.

Propools

10:12 pm on Nov 10, 2008 (gmt 0)

10+ Year Member



I was really looking for the code to put in the referring link.

tedster

11:41 pm on Nov 10, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



The only thing you can do in the referring link is add the rel="nofollow" attribute. If there are no other links anywhere that point to the url, and if the attribute is there from the beginning, then that would stop Google from using the link for "discovery"... but Google has already discovered the url, so you're out of luck there.

Propools

3:04 pm on Nov 11, 2008 (gmt 0)

10+ Year Member



I wonder if in using the Google Webmaster URL Removal Request tool if I were to use:
www.oursite.com/additem.php*
if that would be enough to instruct Google to remove all URLs which begin with "additem.php"?
I would think that using the * after the .php would be correct in doing this?