Forum Moderators: goodroi
domain.com/dir/search.cgi?color=blue&size=small
Could I just use the following robots.txt in the root directory for the site:
User-agent: *
Disallow: /dir/search.cgi
Would that still let the spiders crawl all over except anything with search.cgi? This wouldn't keep it out of the /dir directory would it?
User-agent: *
Disallow: /dir/search.cgi
Yes.
> Would that still let the spiders crawl all over except anything with search.cgi?
That would still let the spiders crawl all over except anything starting with "/dir/search.cgi"
> This wouldn't keep it out of the /dir directory would it?
No.
The technical term for what robots do is "prefix-matching." The Disallow directive applies to any resource whose prefix matches the given string. So your Disallow applies only to resources which start with /dir/search.cgi -- and possibly more characters, but no less.
Jim