Msg#: 24074 posted 11:39 pm on May 23, 2004 (gmt 0)
I have been experiencing this for a while... Pages excluded by robots.txt have been constantly appearing in SERPs as url-only listings and can be found by searches matching parts of the URL string.
I've noticed that just after a number of my new sites were crawled by G, many have been hit with an attempted php hack.
My logs tell me that the hacker entered the site via urls that I was able to find in the Google index with a search for part of the query string - for example: "/email.php?page" (undoubtedly used by the hackers to identify my sites as potential targets).
I understand that these pages have not themselves been crawled, but isn't it about time G got it right and not list url's of excluded resources?
Is there anything I can do in future to stop these results from appearing in the SERPs?
[edited by: Marcia at 2:38 am (utc) on May 24, 2004]