Forum Moderators: Robert Charlton & goodroi
I could give zillions of examples of this where Google is completely off the mark but cant list them here due to WebmasterWorld policy so will have to try and use the widget example.
Lets say your detailed authority site has dedicated pages about "Blue Widgets" and also "Red Widgets" and you also have pages about "Blue-black Widgetvillers" and "Red-white Widgetvillers".
Now then, google has decided via its stemming information (probably due to webmasters buying adwords for both "Blue Widgets" and "Blue-black Widgetvillers") that these terms are associated when in fact they represent different terms (im not posting about plurals here, but different words altogether that G thinks are related).
When the search user types "Blue Black widgetvillers" into the search, google no longer returns the dedicated pages about this search as it used to, but it now delivers ANY page what so ever on the net from blue widgets to bluish widgets to bluey widgots to blue black widgetvillers for the search.
So, an authority site may find google listing its page about the blue widget rather than its dedicated page about the blue-black widgetvillers which the search user was looking for.
Moreover, a webmaster quickly finds that their dedicated pages may not rank for other search terms as thier dedicated "blue-black widgetvillers page is being returned by google for a search by another user for "blue widgots"
In all, i believe google has introduced this in order to try to prevent webmasters being able to optimise sites for multipal keywords in an attempt to push up adwords purchase and at the same time give the search user less precise results in the hope they will turn more to sponsored adverts where another webmaster may be bidding for the term "Blue Widgots" but using the "Blue-black Widgetvillers" title on their listing.
IMO stemming doesnt work or produce quailty serps match to the keyword string, its only any good for plurals - outside of this google is trying to run before it can walk and thats why some search results are plain garbage.
To quote Google:-
"Google uses Stemming technology. Thus, when appropriate, it will search not only for your search terms, but also for words that are similar to some or all of those terms. If you search for pet lemur dietary needs, Google will also search for pet lemur diet needs, and other related variations of your terms. Any variants of your terms that were searched for will be highlighted in the snippet of text accompanying each result.
[google.com...]
Problem is they havent stopped at mild variants!
[edited by: engine at 2:55 pm (utc) on July 4, 2006]
[edit reason] added link to the stemming section [/edit]
Still, in some niches, I think spam is a much bigger problem than poor relevancy due to stemming. And I also think that the semantics portions of the algo are continuing to improve. It's a work in progress, you know? Here's a case where I am very willing to use that "Dissatisfied" link at the bottom of a SERP. I think intelligent feedback on this issue can only help Google improve.
Im at a point now where i think google has lost all relevence where it continues to drive search using semantics.
It feels like Google picks the worst page on your site that may have a word variation on it and lists that one in the serps rather than your dedicated page about the subject!.
Ive also noticed that google gives weight to a page with just a link on it about the subject matter - i think they have this dial turned far to high!
As a regular user of google i have to say i hate the serps results now its a real mix up with next to no relevancy - i just dont get it!