Forum Moderators: Robert Charlton & goodroi
The sites got indexed and started ranking for a few longer tail terms, then in January the entire domain fell out of the Google index, that is, there are no pages on the entire domain (all four sub-sites) in the Google index, so it appears that they're banned. Now I’ve got absolutely no idea why the domain/sites would be banned – I have talked to the people who developed the sites (it’s been completely created in-house although I myself was not involved in the creation) and have been told of everything that has happened to the site, I’ve looked all round the sites, and I really cannot find anything that would cause it to be banned.
So anyway, I have gone to Google’s reinclusion request form to look into submitting a reinclusion request for the site. However, to submit such a request, you MUST agree to a declaration that the site has been spamming (you must agree to: “I believe this site has violated Google's quality guidelines in the past.”)
So what do I do? I don’t want to sign a declaration that says we’ve been spammers, because it isn’t true. I’m quite happy for people at Google to look into why the domain is not indexed, as I think it’s clean, but I don’t want to be forced to ‘admit’ to being a spammer when that’s not the case.
Do you think that it is reasonable for Google to demand an admission of liability for them to look into what may be an error on their part?
================================
theBear (many thanks) offered the following possibilities (my responses in Capitals) :
Are there any inbound links to those four subdomains from pages that aren't on that domain?
YES, NOT THOUSANDS, BUT ENOUGH.
Is a common robots.txt file in use and is it by any chance blocking access by Google?
TESTED AND FINE.
Is Google still visiting the subdomains?
VERY SELDOM NOW, BUT YES & TO PARTICULAR PAGES.
Are the subdomains sharing a single IP address and are they interlinked?
THEY DO SHARE AN IP AND ARE NOW INTERLINKED VIA A ROOT INDEX PAGE, BUT WERE NOT INTERLINKED AT THE TIME OF EXCLUSION.
Are the subdomains making use of ODP data?
Do the subdoains make extensive use of rel="nofollow" which could look like link manipulation?
ALMOST NO USAGE, CERTAINLY NOT INAPPROPRIATE
What was on the domain prior to it being used?
NOTHING, THIS WAS A NEW DOMAIN
================================
I'd appreciate any further ideas.
Do you think that it is reasonable for Google to demand an admission of liability for them to look into what may be an error on their part?
I think its unlikely there was an error on Google's part ... but at the same time, I think it is rather draconian to require anyone to sign their name to that statement prior to determining it is true.
If you submit a reinclusion request and the powers that be at Google look at the site and then determine that you are indeed guilty of violating Google's quality guidelines ... then fair enough, sign the document and make your ammends.
This requirement is more than a bit high handed though. Lots of people make honest mistakes without knowing they have done anything wrong. In law, ignorance of the law is no excuse ... but at least a person is allowed their day in court before being found guilty. The last time I checked though, Google doesn't write laws, just "guidelines".
If this is really important to you, I would ask a lawyer what this admission could mean to your business down the road. Who knows how this could affect your business in ten year's time should Google change their guidelines (without your knowledge) and you once again find yourself on the outside looking in.
The only reason I bring this up is because I once found myself on the outside for reasons I truly didn't understand. Google tweaked their algo and suddenly my site was penalized severely. It was nowhere! The day before, my site was in the top 5 for most of my important keywords, then suddenly, it was nowhere to be found. Long story short, it seems my keyword density was too high for the new algo. I eventually figured it out myself, fixed the site and it bounced right back, but I nearly went bankrupt in the meantime.
Personally, I would never sign that document because I know (in my case) that it just isn't true regardless of how Google changes their guidelines or their algo. The problem is that what is acceptable today may not necessarily be acceptable tomorrow.
I suggest you get an "expert" to have a look at the sites before you make any mistakes you may regret down the road. None of us knows how Google may use that prior admission against you in the future should they arbitrarily change their algo or guidelines. The two are not mutally exclusive.
[edited by: Liane at 11:48 am (utc) on Mar. 28, 2007]
I'd probably go to either SEW or [groups.google.com...]
Since you're doing OK on Y! it doesn't sound like an indexing glitch, if it's the way the subs are setup that's causing the problem you may want to reevaluate that strategy...
[... We] recently launched four small sites around a topic. The first site launched in June, two in August, and one in October. They are created as subdomains of a new domain [...]
If I were Google, the question I would ask is this:
Do all four sites cover the same topic? Why are four sites necessary?
If all four are owned/operated by the same entity, and all four cover the same basic topic, then that --in Google's eyes-- is spamming the index with four sites when one would be sufficient to provide their users with information on that subject. Four sites takes four times the space in their search servers and four times as much work by Googlebot and the back-end semantic-processing, ranking, and filtering programs, and potentially, four times as much space on the first page of search results. For this reason, Google wants a compelling reason to list pages from four sites.
They would see this situation as an attempt to gain a 'bigger footprint' in search.
Taking off the devil costume and assuming that yours is at worst a marginal and not blatant case of identical/similar site topics, I'm surprised that they didn't just filter all but one site out. I will follow this thread with interest.
Jim
They have picked a particular exception to this: where a site has previously been in violation of their rules, you can do a check-a-box thing to get them to re-review it.
I suspect this is an automated method that looks at what they have recorded as problems for the site and sees if those items are still there.
I repeat - this process is specifically designed for when there have been problems on your site, and you have fixed them.
Your case? because you have not done anything that is against the rules, a re-inclusion request is not going to be useful to you - this, I think, is why they have that nasty statement in the reinclusion process, to tell people what it is there for.
What should you do?
Google groups, as mentioned above, would seem to be an obvious one.
I would also agree with someone else above and get someone independent to have a look through your sites - its possible that one of your team members is hiding nasty stuff on the company site - it does happen :(
If none of that helps, I'm thinking the 'subdomain spam' diagnosis sounds possible. You might want to review whether separate subdomains are really justified for these sites.
You could also ask the mods permission to put your domain names in this thread for members to have a peek at, but I don't think they normally allow that.
You know what would be nice for webmaster tools? A listing of the rules a site is in breach of. But I suppose the spammers would abuse that to run close to the wind :(
No, they cover four very distinct areas of a loose topic. Without posting the URLs it is hard to explain, but I think that the specific topic that each covers is valid and they hang off the same domain because they vaguely 'fit' together. Certainly, whilst the style of each is similar, the content and subjects are not shared.
@leadegroot: Who should I ask permission from?
It's a new domain and right from the beginning you're creating four sub-domains.
Unfortunately for you, Google probably doesn't manually review most sites prior to banning them, so it has to recognise patterns that spammers use. It identified a pattern, had no trust or faith that you weren't a spammer, and banned you.
Unfortunately for you, Google probably doesn't manually review most sites prior to banning them, so it has to recognise patterns that spammers use. It identified a pattern, had no trust or faith that you weren't a spammer, and banned you.
I think they figure if we ban you and "you" say nothing, well, your guilty since you are not fighting the ban. That's why they don't manually review a site before banning. (also saves manpower).
Even if you'd get the subdomain ban lifted this time, if there's nothing on the main domain, a second and a third wave of penalties is almost guaranteed. I wouldn't bet on that the Google employee reviewing the site would be able to permanently flag it as "invulnerable to future subdomain related penalties because it's actually a great site, really".
I've had some fun of Google not being able to remember its own decisions on bans and un-bans. Had to go through the same procedure on an annual basis, mailing them back their own decisions of the same issue every twelve months. Not sure if I'd advise this route.
I'd say get the four sites to look like four sites on the domain level, and forget all your troubles. Including the reinclusion request.
[edited to say] ...what Bennie said.
[edited by: Miamacs at 10:09 pm (utc) on Mar. 29, 2007]