Forum Moderators: open

Message Too Old, No Replies

Getting unspidered from Google

         

brucec

6:33 pm on Oct 6, 2004 (gmt 0)

10+ Year Member



I know this may sound crazy, but I now work for a high profile international bank who uses a public domain name to create web sites for its internal web clients and employees. I am not sure why they did not create an intranet web site, but nevertheless...

We actually do not want to be listed on Google.

After doing some looking around, we have not one meta tag on the home page, nor do we have anybody linking to us. I did a link check and nobody comes up.

Yet, Google listed us and the web site needs to be private. Does anybody know how to do this and how long it takes?

bcolflesh

6:35 pm on Oct 6, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



[google.com...]

hannamyluv

6:36 pm on Oct 6, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



You will need to create a robots.txt to ban all spider activity (See the robots.txt forum here for more info) This will keep the spiders out of the site.

But, if you have pages already in the site, you will have to request a remove, which is not as easy as it use to be (for very good reasons).

Here is a link to the information about how to remove your urls from Google.

[google.com...]

brucec

7:07 pm on Oct 6, 2004 (gmt 0)

10+ Year Member



Thanks to both of you.

I uploaded the robots.txt, but when I go the Automatic Removal Page which tells google to crawl your site without waiting to be spidered...

it gives the error message "The page you requested was not found."

Oh well. I will just wait to be spidered.

Teknorat

5:08 am on Oct 7, 2004 (gmt 0)

10+ Year Member



Yeah that page has not been found for quite a while now...

vincevincevince

8:51 am on Oct 7, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



You should only allow access from internal IPs. That would sort it at once. Depending on the server type, the method is different, but straightforward.

In this way, external persons, browsers or spiders, will not be able to access the site.