Forum Moderators: Robert Charlton & goodroi
"That's exactly the sort of sites I'm referring to"
Unfortunately some comments about this issue apparently can't be bothered to actually, horrors, look at the serps. Authority has a specific meaning with Google, and its plain that authority sites are what are commonly, mistakenly hit by this penalty. I don't think this is a good summary of the effect, but one simplistic way to look at would be to say authority sites with volume of quality in-links are being confused with spam sites with a volume of rotten quality in-links, sometimes.
One of the most interesting phenomenons is how an authority site can be #1 for red topic, blue topic, green topic and purple topic, but be 950 for orange topic, even though the linking, page structure and keyword usage is basically the same for all of them. Clearly a a ranking mistake is being made (either the 950 result, or all those #1's).
[edited by: tedster at 9:17 pm (utc) on Feb. 27, 2008]
One of the most interesting phenomenons is how an authority site can be #1 for red topic, blue topic, green topic and purple topic, but be 950 for orange topic, even though the linking, page structure and keyword usage is basically the same for all of them.
Precisely the situation I'm seeing. Very strange.
And poof, gone the next day.
Have you seen the same site repeat the top rank=>950=>top rank cycle more than once?
Anyone think redundancies on sites could be playing a part?
Do you mean redundancy in terms of how the navigation is set up on each page or do you mean something more?
authority sites with volume of quality in-links are being confused with spam sites with a volume of rotten quality in-links
Plus authority sites are bombarded with inbound links from scraper sites.
Have you seen the same site repeat the top rank=>950=>top rank cycle more than once?
I've seen pages on one site make six full cycles (drop, recover, drop, recover, drop, recover, drop, recover, drop, recover, and one final drop) since late June. When pages recover they are in the top 10, normally top 5. When the site drops I rarely check to see just how far, sometimes it is down around 150 while other times it is on the last page (950+).
And perish the though that we actually *GASP* title the page to describe what it is about, which is also coincidentally the same text incoming links have, incoming links which we very often HAVE NO CONTROL OVER?
And let's not optimize the page meta description or H1 using the same text, because then it becomes spam.
This just sounds ridiculous to me.
And the scraper sites are Google's problem. Google needs to find a way to fix it so they have no effect at all on the sites they are scraping. They just need to make the scrapers disappear as soon as they find them, and the scraper sites will cease to exist because they won't be worthwhile for the thieves who own them.
If you have a page about Blue Widgets, it's not too much to anticipate that the term "Blue Widgets" is actually going to appear on the page a few times.
It's pages, not a site. But yes, many times. Some pages maybe seven or eight times. Others four or five times. No page I know of has been penalized every single day since the September 2005 first sighting of the phenomenon.
I've seen pages on one site make six full cycles
Was there any appreciable variance in either the top or bottom positions with each cycle?
I'm just wondering if it's possible they are testing a routine that is making small refinements to the index with each pass through the data, since one of those papers referred to the monumental task that certain types of Phrase-Based Reranking would be.
And perish the though that we actually *GASP* title the page to describe what it is about
My only hope is that Google will figure out the damage they are doing to their serps and look at some alternative methods. Of course the real villains are the spam sites. I suppose someday they may just take over the Internet and it will be all gibberish.
Was there any appreciable variance in either the top or bottom positions with each cycle?
To be honest, I'm not sure. At first I thought pages were only falling back to positions somewhere around 100-200. But then I realized that the keyword tracking system didn't account for the fact that it was actually a different page on the site that was being returned in that position.
In other words take one page about Super Widgets. It used to rank in the top five. Then the keyword tracker would report my site ranked 175 for that phrase, however it was actually for a different page on my site that happened to mention Super Widgets. The page which is dedicated to Super Widgets was actually much, much further down in the results.
I can say that there wasn't much variation at the top. I don't know about the bottom though.... as far as G referral traffic goes there isn't much difference between being ranked 50 and being ranked 950.
Right now we are on the second round
top rank(months)=>950(3days)=>top rank(2weeks)=>950(+4days)=>?
One of our client who is also facing same situation and he is on Top Rank round. He just recovered yesterday only, hope this last for ever now. He cannot take such inconsistency any more.
thanks
lashkari
And the scraper sites are Google's problem. Google needs to find a way to fix it so they have no effect at all on the sites they are scraping. They just need to make the scrapers disappear as soon as they find them, and the scraper sites will cease to exist because they won't be worthwhile for the thieves who own them.
[edited by: Marcia at 9:52 am (utc) on Feb. 13, 2007]
Since then we have done three things:
- Removed duplicate text (but not the affiliate links).
- Added unique content that is not keyword dense.
- Slightly de-optimized keyword density on title, headers and internal links.
In the past few days we are starting to see a gain in the SERPs and now rank at around 20+ for the pages that have the new structure and have been crawled by Google. (Originally we where in the top 10).
HTH
If it's an anchor text thing you would think it would only effect the index page where alot of the anchor text links point to for alot of sites and not the inner pages where the content is.
Just removed the anchor text on a few links that are pointing to my site that is effected, see how it goes anyway.
Good luck.
However, IMHO I think changes applied to a Web site don't have an immediate effect.
Google should be making quite difficult for Webmasters to notice which changes have a direct impact on the ranking. Otherwise, it'll be pretty easy to figure out how to game Google.
That's ok. I'm not going anywhere.:-)
This particular site got filtered out once before so I removed one particular site-wide anchor text link and it came back in a few days. Co-incidence.. maybe?
Who's to say what has an immediate effect or not, highly crawled sites always have a better chance of having a quick turn around on tests that you decide to conduct.
I keep coming back because it is fascinating in its own peculiar way. It's a bit like watching the Google alternative dream morphing into the big bad Microsoft machine. They have turned into one big ugly robot that really doesn't care what it hits so long as it makes more and more money. If little guys with really cool sites are being battered, it is just collateral damage. I guess it's disappointing.
"If Google would stop PAYING people for putting up scraper sites.."
And that too. They could do that, but how motivated are they to do it? They don't seem to be at all.
[edited by: Pamela2 at 1:48 pm (utc) on Feb. 14, 2007]
BUT it tells me the pages are no longer way down at 900 something. Problem is that I don't know if I still need to reduce possible problem phrases or did I de-optomize the page so much that is only ranks in that extended cluster?
The first time the penalty (?) lasted 3 months (on 1 blog) and the second time (on another blog) one month.
They didn't change anything.
[edited by: Biggus_D at 11:40 pm (utc) on Feb. 14, 2007]
We have been hit with a phrase-related drop in Google since Feb. 1. For clarity, the same pages that have been dropped from one keyphrase serp (blue widgets) are easy to find if you make the search more specific (dark blue widgets). It seems to have affected only a few major keyphrase searches but not all our high ones. I think the drops are correlated with higher traffic keyphrases - the more referrals for that keyphrase, the more likely a drop.
First of all, I assume that there is a *phrase-based* point system in effect (similar to email spam filters) so that no small group of issues could explain anything in the SERPs (and yet people talk on the forums as if it could be one or two things). For instance, you do A and you gain X points, and you do B and you lose a few points; and at the same time, inbound links are doing C and you gain or lose points (e.g., lost wiki links), and so on.
As one major factor in your keyphrase score, I think Google is using more and more user data to decide what people like for certain search phrases. For instance, the more people bounce back quickly to try the next result, you lose points. Bounce rate and related issues are mentioned in the Google patent. I think user data is collected via the Google Toolbar, in addition to Google SERP reactions. It says right in the Toolbar info that they will use people's browsing statistics to help improve their system, unless you click or unclick some feature. SO, that said, if people with the Toolbar are going to your site, and the majority of them are not easily finding what they were looking for, your score will eventually go down for that phrase.
The user data factor helps explain why the higher traffic phrases are more affected. The more people that visit it and don't find something, the faster your keyphrase score will decline. The #1 spot would be the most susceptible to this because people click on it mindlessly. Just because you think it's useful doesn't mean the average searcher using a certain keyphrase will find it useful. Maybe their reading level is different or the site isn't user-friendly to them, etc.
When I compared our pages with the new top 10 serps, I could see why those sites might be more satisfying and more user-friendly for a lot of users. We are revising our layout, and this may or may not be related, but a few of our lost keyphrases started recovering, about a week after the changes. We still have some new layout improvements being rolled out shortly.
In sum, I think it is multiple factors working together, and I think user data input is probably one of the biggest and fastest-growing factors in the Google serps.
I would appreciate hearing any feedback or where you think I might have a flaw in my theory.
Thanks
Val
I'm absolutely certain in at least some cases (including my own) there is no possible way the bounce rate could be so low as to be penalized so badly. Look at some examples or browse some results yourself and you'll see a few sites at the back that simply defy logic if your theory is true.
Of course, I think it's fair to say that bounce rates could be (are?) a part of the overall score but not to the degree you are suggesting.
The #1 spot would be the most susceptible to this because people click on it mindlessly.
I've thought of this. The higher a page is in the serps the more likely it will get casual surfers who are just wandering about glancing at this and that.
Your regulars will stay longer along with those who are referred from a related site, through a mail list or forum or through a friend.
Of course some who are really interested in your topic will find the site or page through search engines but others will just surf on.
Here is why I think you are headed in the right direction...
Another symptom seen is that the rank of pages for a particular phrase can go way to the bottom one week, then climb back up the second week after a "data refresh". So what would cause the yo-yo?
Think about it like this. Let's pretend for a minute that the bounce rate from the SERPS is a much larger part of the score than it used to be. The bounce rate is something that would need to be continually counted and evaluated over time.
So let's say I have a page about blue widgets that ranks in the top 10. For whatever reason users were not highly impressed with the page and over the course of a couple of weeks the page got a high bounce rate. Google sends the page way back in the results.
Another week passes. Further back in the SERPS the page wouldn't accumulate a lot of bounces since not many people will go back that far in the SERPs. Without much more bounces being recorded for that page, it seemingly recovers without the webmaster having made any changes to the site.
This makes sense for a few other reasons as well. When MC has talked about sites impacted by this phenomena he consistently references making sure your site is useful to visitors. Many of us have taken that as a very vague answer. But perhaps it is more specific then we think. How could google measure how useful a visitor might think a site is? By measuring bounce rate.
Now the first argument against this theory is that Google wouldn't use something like this because it would be too easy to manipulate by users.... Just search for your own pages and click on your own pages in the results without going back to the original query to click on something else.
My response to that argument is that Google already has very sophisticated algorithms to detect "click fraud" from AdSense/AdWords. It would be easy for them to apply this knowledge (multiple clicks from the same IP, too many clicks too soon from different IPs, etc) to the SERPs.
Something else that supports this theory is that some people have had success taking the content of a particular page, putting it up under a different URI, and then the page seemingly recovers. In this case Google would no longer have the historical "bounce" data from the new URI and it would seemingly avoid the penalty... perhaps until new bounce data is gathered.
I think it is also worthy of looking at this from a data storage/retrieval/scoring view. These "data refreshes" only used to happen every few months. Then last summer they started to happen about once per month. During the fall they moved to about weekly, and then more recently MC says they happen almost daily.
So whatever factor is causing pages to take swings is something being calculated as part of the data refresh. Therefore whatever type of data we are trying to identify as causing the problem is something that can only be calculated over time... It doesn't seem like something that can be calculated "on the fly".
Thus, I think the answer to these data refreshes is to look at factors that would likely be something that they would need to calculate over time and not something that could be calculated "on the fly". Things like H1 tags, title tags, keyword density, etc could probably be looked at on the fly from the cache. Things like bounce rate, the number of recent new backlinks found for that page, etc are likely aspects that would need to be calculated behind the scenes and fed to other parts of the algo as the data is calculated.
Let's face it, many of us have looked at dozens of other sites impacted by this and we've found very few similarities between the sites. Therefore on-page factors seem unlikely. I think the answer lies in off-page data Google is calculating.
This can also start to explain why some pages on a site with the same template are impacted, but other pages are left alone... For whatever reason the user data Google is collecting is different for those pages.
And this could even explain why it is most often the more popular pages/phrases that get sent to the back of the results... More user data for Google to make an evaluation from.
I could be completely wrong on all of this.... but so far it is the only theory I've seen that makes sense for the majority of sites I've seen impacted by it.