Forum Moderators: goodroi

Message Too Old, No Replies

Robots.txt

Disallowed pages listed

         

unperturbed

2:09 am on Mar 20, 2005 (gmt 0)

10+ Year Member



Hi,

It’s quite strange, I’ve got this as my robots.txt

User-agent: *
Disallow: /go/

When I do allinurl:www.example.co.uk at google it lists a few urls like www.example.co.uk/go/124 there are over a hundred of similar urls but only lists a few. Looking at my logs the googlebot has obeyed robots.txt as I can’t see any visits by google to those pages.

How have they got in the index, Have I messed up the robots file?

Cheers

esllou

2:58 am on Mar 20, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



G started "listing" pages they weren't allowed to visit at some point in the recent past. No descriptions, just the url.

There was a major hoo-ha on these forums when they started doing it...

Lord Majestic

3:20 am on Mar 20, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Your robots.txt is fine.

G started "listing" pages they weren't allowed to visit at some point in the recent past. No descriptions, just the url.

Technically speaking they don't list (or have) pages in index, but just URLs pointing to those pages. These URLs can be found on pages that point to your website and robots.txt does not control that process, so from robots.txt compliance point of view all is perfectly legit.

If you don't want search engine to have pages or URLs in their index then I think you might have to use META command with nofollow, noindex, but of course if you have robots.txt preventing search engine from getting actual pages to find META command then they would not have a clue and assume that its okay to do what they do.

unperturbed

7:28 pm on Mar 20, 2005 (gmt 0)

10+ Year Member



Thanks guys.