Forum Moderators: Robert Charlton & goodroi
I now have all the urls of my site indexed within Google, however after doing a site search I have noticed that our affiliate tracking urls are being indexed.
The affiliate is pointing directly to my homepage, therefore in Google I have indexed both
www.example.com
and
www.example.com/CAMPAIGN=widgets&KEYWORDS=widgets_text
Both these pages contain the exact same content. My question is will this have a detrimental effect on my homepage rankings due to spamming, and if so how do I remove the affiliate url from Google?
[edited by: pageoneresults at 1:46 pm (utc) on Nov. 12, 2007]
[edit reason] Examplified URI References [/edit]
Some people write a 301 redirect rule that removes the tracking information from the url, but they first need to be sure their analytics still picks up the original url requested for tracking purposes. The actual technical details are up to you, but definitely keep those urls out of the Google index.
To block Googlebot from block crawling any URL that includes a? (more specifically, any URL that begins with your domain name, followed by any string, followed by a question mark, followed by any string):
User-agent: GooglebotDisallow: /*?
[google.com...]
The robots.txt standard has not officially added wild card pattern matching, but Google and Yahoo do follow it.