Welcome to WebmasterWorld Guest from 188.8.131.52 , register , free tools , login , search , pro membership , help , library , announcements , recent posts , open posts Pubcon Platinum Sponsor 2014
Disallow: /*? - is it ok ? nex99
i wants to block all affiliate links, so i create my robots.txt as...
is it ok for google and other search engines ? if not, please guide me the correct one.
This will block all urls with the ? in the url. Is that what you want to do?
I have found this to be helpful:
...] google.com nex99
Yes. and thanks for the link. jimbeetle
Just be sure you know exactly what you want to do here. Robots.txt is used to block spiders from URLs on your site. They will still follow outgoing links unless the pages on your site on which the links appear are blocked. Receptional Andy
I've had no issues with that syntax. Note that the safest approach is to only disallow those bots known to understand the wildcard, which I believe is googlebot, msnbot and slurp of the majors.
Unfortunately, the robots standard has developed into not much of a standard at all. I believe the below
User-agent: googlebot User-agent: slurp User-agent: msnbot Disallow: /*? g1smd
Don't forget that you need at least one blank line after the last record too.
User-agent: googlebot User-agent: slurp User-agent: msnbot Disallow: /*?