Forum Moderators: open
My Question:
Apart from the index page, I only host dynamic scripts on my www2 & www3 servers. All the content is hosted only on www server. So, to avoid the duplication, if I DISALLOW * on my www2 & www3 servers, would that affect crawling of my www server by Google?
I checked Google's robots.txt on www & www2 servers and they are exactly the same? Not sure why other search engines don't bar them for duplicate pages.
<META NAME="robots" CONTENT="noindex">
or possibly
<META HTTP-EQUIV="refresh" CONTENT="0;URL=http://www.yourdomain.com/">
This sort of redirect is supposedly understood by Google. Haven't tried it myself. I don't think it's appropriate in your case, but it's your call.
Kaled.
This is an excellent suggestion to just put noindex on other two index pages. But:
1) I have "NoIndex" option on my privacy policy page, but Google still lists that page in site:mysite.com, without title and description. So may be by the time the crawler would know not to index this page, it has already crawled it.
2) You mentioned "Alternatively", so do you think my proposal of adding Disallow * on www2 & www3 servers would also work without causing any harm to www site?
Thanks
SDani
Thinking about it more, perhaps the redirect I mentioned might be appropriate.
I'm certainly not qualified to give definitive advice. With anything you try, you'll still need to cross your fingers and hope.
Kaled.