Welcome to WebmasterWorld Guest from

Forum Moderators: goodroi

Message Too Old, No Replies

How to block this URLs

6:46 am on Jul 4, 2008 (gmt 0)

New User

5+ Year Member

joined:Jan 15, 2008
posts: 4
votes: 0

Hi All,

i have around 1000+ pages indexed in google for my site. Some affiliate campaign also going on for my site. Few of affiliate URL indexed in google from this affiliate campaign.

I already implement this code in my robots.txt

Disallow: /?agent_camp=

but still this affiliate URL crawled.
Can anyone suggest how to stop crawler to crawl this type of URL.


[edited by: encyclo at 4:30 pm (utc) on July 5, 2008]
[edit reason] switched to example.com, fixed formatting [/edit]

1:59 pm on July 5, 2008 (gmt 0)

Senior Member

joined:Jan 27, 2003
votes: 0

Robots exclusion is prefix matching, and the major engine support wildcards, so you can use lines like the below to block your affiliate URLS:

User-agent: *
Disallow: /*?agent_add
Disallow: /*?utm_source

Some would advise specifying the user-agent rather than blocking all spiders, since some don't support wildcards. I don't bother since the smaller engines are not significant and I believe most of them will treat the asterisk literally in any case.

2:26 pm on July 11, 2008 (gmt 0)

Senior Member

WebmasterWorld Senior Member g1smd is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:July 3, 2002
votes: 0

Your rule does not block the URLs you want to block.

You need to "match all from the left" until you have specified enough to cover all the URLs that need to be blocked, without still matching any URLs that need to be indexed.

See also: [webmasterworld.com...] (half way down the page).


Join The Conversation

Moderators and Top Contributors

Hot Threads This Week

Featured Threads

Free SEO Tools

Hire Expert Members