Forum Moderators: goodroi
http://example.com/blog/page/6/?theme=xyz
http://example.com/blog/this-is-post/?theme=xyz
but want to keep these url's
http://example.com/blog/page/6/
http://example.com/blog/this-is-post/
so basically I just want to block all pages which are ending with "?theme=xyz" as this is causing unnecessary content duplication.
/comments.php?id=3740&ocid=30562&replyid=0&catid=1 [remove]
/comments.php?id=3740&replyid=30574&catid=1 [remove]
/comments.php?shownews=3740 [OK]
i want to remove first two line and 3th one is my primary link.
i think i must use this code :
User-agent: googlebot
Disallow: /*id
Disallow: /*replyid
is that true ?
Do you know if the parameters ever appear in a different order?
Do you know if URLs with shownews ever have additional parameters and that you will not want to block those?
Otherwise, I would do:
User-agent: *
Disallow: /comments.php?id=
Disallow: /comments.php?action=
Disallow: /comments.php?highlight=
Disallow: /comments.php?ocid=
Disallow: /comments.php?replyid=
Disallow: /comments.php?catid=
Maybe others too?
You need to be aware of every possible format that could be requested.
You also need to be aware that with parameters in a different order you have a duplicate URL for the same content.