Forum Moderators: phranque
I am having same domain name and same content for URL's starting with www, www1, www2, www3 and sites are live.
Example: I have a domain's www.mydomainname.com ,www1.mydomainname.com , www2.mydomainname.com and www3.mydomainname.com with same content in all the domains.
Is this treated as spam? Will it effect SERP for mydomainname.com ..
Furthermore, today i run a command for 'site:www2.domain-name.com' on Google, many pages had been indexed by it yet. Our mainsite will be penalized by google?
Looking for your valuable suggestions.
Thanks in advance,
looka
The web servers www1, www2 and www3 are destinated to serve the same content. That said, It's planned on sharing the load between www1, www2 and www3 through www.mydomainname.com
I set up those sites for sales tracking. And any sites are 'opening' when user pick any one.
Any ideas? Does www.mydomainame.com will be penalized by Google? TIA
To protect your main site, you could use robots.txt to block spiders from your duplicate sites.
In my view, from what you are saying, these would not be mirrors, but clones; Google, in particular, does not like duplication.
There are exceptions, but I would be very surprised if your scheme was one of them.
Thanks for the reply.
Just a quick note to let you know the www1, www2 and www3 are a set of available mirrors of mydomainname.com, not clone sites.
Now the serious problem is several mirrors(like www2.mydomainname.com), google has indexed 90 pages. I just don't know those mirrors whether effect SERP for www.mydomainname.com? And how to use robots.txt to block spiders from my mirror sites?
Looka
As I said above: " There are better, safer ways to do your tracking than setting up duplicate sites, which will very likely cause you problems. To protect your main site, you could use robots.txt to block spiders from your duplicate sites."
When Google finds duplicates, after a while, all except one item will be dropped or become a 'supplementary result.
You have no control on which one will survive; could be the newest, oldest, blue-est, most linked-to, least linked-to ... no way to predict or control this.
So if you want one to survive, protect it by blocking the others.
I really don't know whether my advice is of any use to you. Try to find the robots.txt generator online programs. There are many on the net but I don't konw which one is the best. After generating the robots.txt from such application put this file in your root directory on the server. This will proabably solve your problem.
Mandar
<snip>
[edited by: engine at 8:30 am (utc) on June 29, 2006]
[edit reason] No sigs See TOS [webmasterworld.com] [/edit]
To exclude the mirrors, in the root directory of each of the subdomains, in the same folder where the main index page of each of the subdomains sits, upload a text file called robots.txt with this entry to keep all robots out of the whole subdomain:
User-agent: *
Disallow: /
Make sure the file extension is .txt and don't link to those subdomains from anyplace.
Thank you.
//google does not treat that as spam at all, even if you're hardcore spamming the serps with wildcard subdomains. at least google didn't treat that as spam for the last years and i am quite shure they won't treat it as spam in the future because their system can not handle that.//