Forum Moderators: goodroi

Message Too Old, No Replies

block cgi bin on a virtual host

cgi bin is not in root directory

         

Reid

10:42 am on Apr 15, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Ok I am on a virtual host and my root directory is called httpdocs.

cgi-bin is in the same directory as httpdocs. (I have limited access to this, only full access to httpdocs, errordocs, cgi-bin through a form interface ect) so by all accounts httpdocs is my root directory and robots.txt works fine there.

robots.txt is in httpdocs.

I want to stop google (or anyone else) from indexing my tracking url's

w*w.mydomain.com/cgi-bin/click.cgi?id=x

if I use
Disallow: /cgi-bin/

would this do the trick? or should go with

Disallow: /cgi-bin/click.cgi?*

currently there is nothing else in my cgi-bin but i may use it in the future although I probably won't want anything in there crawled anyway.

mack

4:51 am on Apr 16, 2005 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



You could leave both as a safety net, although the first version...
Disallow: /cgi-bin/
Would work fine to stop it crawling anythign in the cgi-bin.

Some url's may still show up in results pages though. Google has a bad habit of showing pages it knows exist, but hasn't been alllowed to crawl. In this case it will use and anchor text from a link as the title in the results page, and show no description.

Mack.

Reid

6:26 am on Apr 16, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



thanx mack

you know I should have blocked it right away but

better late than never I guess.