Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

How to determine if a site on Google has been penalized

well established sites missing from Google

         

Panacea

11:49 pm on Apr 6, 2005 (gmt 0)



How do you determine if a site on Google has been penalized?

I have 4 unrelated sites that all fell off Google back in January all at the same time. These sites are now all in the supplemental index, meaning that they get no traffic. The only way to find them is to search by domain name, and this just displays the domain names without title or description. Here is a typical case from one of my sites:

If I search for: ‘www. mysite .com’
I just see just the listing for my url with no title or description

If I search for: ‘site:www. mysite .com’
My index page is missing. 50% of the pages from my site show title and description, and the other 50% just show their url. If I click cached of one of the pages with title and description the result says: ‘did not match any documents’.

If I search for: ‘allinurl: www. mysite .com’
I see no index page, but I do see 2 pages from my site. The cashed shows that these pages were retrieved on Mar 20, 2005.

Some facts about my sites:

1.All 4 sites disappeared from Google at the same time back in January.
2.All 4 sites use shared hosting with a big (cheap) domain register with a silly name! (I am sure you can guess who I mean).
3.All four sites shared the same name server IP address block.
4.All four sites are still spidered by Google but very infrequently now.
5.My hosting company admitted that they had a domain name server problem in January. However they were not very specific about details.
6.All four sites are PR5 or higher, are about 3 years old and were very well positioned for their relative keywords.

Questions:
Have my sites been penalized for some reason? The first time I emailed Google, their responses indicated that there had been a spidering problem. Basically they said the Google index contains two types of pages: fully indexed and partially indexed pages, and that my page is currently partially indexed. Because their robots were unable to completely review its content during our last crawl, my site appears without a cached copy or detailed title. Instead, it's listed by its URL.

Six weeks later I contacted Google a second time and I got a more cryptic response. They said that they are unable to send personal responses to all of the requests they receive to review individual website content. They said that Websites can fall out of our index for many reasons, including penalization. Certain actions such as buying or selling links to increase a site's Page Rank value or cloaking - writing text in such a way that it can be seen by search engines but not by users - can result in penalization.

My sites comply 100% with googles guideline. I have not engaged in buying links, cloaking, or link exchanges. All my link exchanges are with appropriate similar theme sites as my own, and in fact I have not exchanges links with anyone for at least six months.

Any guesses, advice, or opinions you could share would be most appreciated.

Panacea

4:59 am on Apr 8, 2005 (gmt 0)



Anyone have any ideas?

Bobby

6:40 am on Apr 8, 2005 (gmt 0)

10+ Year Member



Because their robots were unable to completely review its content during our last crawl, my site appears without a cached copy or detailed title

I'd try to determine why they were unable to spider your site.

Check to make sure you don't have a robots.txt file excluding the spider.

Was the server down at the time the spider came by?

Try pressing your hosting company to get more information on exactly what their domain name server problem was and if anyone else hosted on their servers has experienced the same problem.

glengara

8:04 am on Apr 8, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



If your hosting company is GD, has there not been a well documented problem with Google?

Panacea

3:02 pm on Apr 8, 2005 (gmt 0)



glengara

Yes my hosting company is GD. Have others had the similar problems that you know of? Would you propose I switch to another host that has a more serious reputation?

Bobby

Nothing wrong with my robots.txt. GD is not that helpfully giving specifics or even understanding the problem as I describe it to them.

BlackRaven

5:38 pm on Apr 8, 2005 (gmt 0)

10+ Year Member



does GD stand for Godadddy?

if so, i am on GD and had no problems.

jnmconsulting

7:40 pm on Apr 8, 2005 (gmt 0)

10+ Year Member



I have not had any problems with GD!

Reid

11:09 am on Apr 9, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



all 4 sites share the same ip address block

Take a look at this thread
[webmasterworld.com...]

Panacea

3:15 pm on Apr 9, 2005 (gmt 0)



Reid,

I only mentioned that my four sites shared the same IP block to illustrate that if I had a name server glitch it may have affected all four sites. I am not fully convinced that dedicated IP make much difference since 97% of all website use shared IP.

I did however about 4 weeks ago set 3 of these sites missing from Google to dedicated IP’s. Since then I have noticed absolutely no difference. Google is not spidering them any more than before, only about once every 7-10 days.

window

7:35 am on May 12, 2005 (gmt 0)

10+ Year Member



Panacea,

I am also facing the same problem. I have 5 sites which are interlinked with each other.
My rankings disappeared from google in January.
They all are at same hosting plan and sharing the same IP....
Index Pages of my sites are not crawling at all & showing without Title/Description...Though internal pages are crawling weekly.
Did you find any solution of this problem?

financialhost

4:42 pm on May 13, 2005 (gmt 0)

10+ Year Member



If your sharing hosting anyone on your IP block that has been banned will effect all the other sites.

Solution - change to another hosting company that offers a unique IP.

Powdork

5:53 pm on May 13, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I have a similar problem although I don't think it has to do with hosting. It certainly seems like the creeping death.

When doing a site:www.mysite.com search, the following happens.
1. It used to have 1500 pages indexed now it says it has 800. There are now over 2000 pages.
2. An increasing number of the pages listed are URL only. When searching for exact strings of text from these pages they are not returned even when there are no results to show (i.e. it's not a duplicate content thing).
3. My index page and many of the other high up pages only show up when &filter=0 is added to the string (include omitted results). It seems these pages are the ones with incoming external links. It could be the 302 issue. These pages do show up for exact phrases and occasionally for detailed search queries.
4. When searching for "Company Name" (yes quotes) I do not show up in the first 1000 results.

Background- Site was originally part of another site. The subdirectory of the old domain was 301'd to the new domain in June of 2004. The vast majority of the links have been updated and I recently (about 1 month ago) removed the 301's. The pages have never ranked at any time since moving to the new domain.

TampaLou

6:22 pm on May 13, 2005 (gmt 0)

10+ Year Member



A problem I had, unrelated to "penalties", was when I changed my entire main personal homepage from .html to .shtml last year. It was a royal pain to do, but it was well worth it for the long-term value.

Unfortunately, Google couldn't get the old .html files out of its listing for a long long time (some of them were commentary files dating back to 2000), and the new .shtml files were not being spidered by the site. Some still haven't been, nearly one year later (I did this in approx. August 2004).

One way I managed to get some of the best of the older commentary pages listed was to either link to them singularly from new commentary pages (which, from my main page, were being spotted right away). Another was manually submitting the "lost" classic commentary pages one at a time as the new .shtml files; a third was to create an .html file for those still in Google and make it an auto-forward to the new .shtml file.

Any way you slice it, it's been a pain to deal with, and the other major web browsing sites out there have been able to catch up without too much trouble.

What's the lesson? If you're going to change your extensions en masse for an entire page, keep in mind that you might pay a serious price for the move. Unless you have something set up to forward every old .html ending page to the corresponding .shtml page (which I didn't have easily available to me at the time).

FWIW, my main page (without any .html or .shtml ending, just www.*****.com, has a PR4 from Google) has never had a problem appearing on Google. But if you tinker with the sub-pages, you might run into problems.

glengara

6:31 pm on May 13, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Powdork, slowly falling out of the index does sound like a 302, or a sudden devaluation of your links.

On the GD hosting thing, AFAIK they had blocked Googlebot from some of their servers, and sites were also slowly falling out of the G index.

Powdork

6:42 pm on May 13, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I have submitted the form at support.google.com with the subject of 'canonical pages' but haven't heard back yet. It hasn't been that long though. I am hopeful that Google is doing something behind the scenes to complement the window dressing they have given to the problem recently.

MLHmptn

7:08 am on May 14, 2005 (gmt 0)

10+ Year Member



The solution is to ban Googlebot in your robots.txt! I had this same problem happen to my sites and I banned Googlebot for roughly a month and watched some of my pages fall out of their index. I gave it roughly about 6 weeks and then I removed the ban on Googlebot and "WALLA!" almost every page of my site was reindexed within a week! BAN GOOGLEBOT! I am serious! It worked for me! It sounds extremely stupid to ban Googlebot but if Google can't update your supplemntal pages to their regular indexed status what good is Google doing you the webmaster?