Forum Moderators: open
Is this a side-effect of improved dynamic spidering, or have the web developers for the sites wised-up (en masse)?
Random forum posts and PDF's sometimes show up in the top 10 for words that are pricey at overture ($5+).
My hope is that this is just a temporary situation before the full update occurs.
Above me:
3 .gov sites that will be of no interest to searchers for this phrase.
A Stanford university paper that might be of some value to a small minority, but not really what they will be looking for.
A .pdf which won't be of real interest
Snapping at my heals, more .gov sites and 2 Amazon book sites.
Not only will the book sites be of no interest to anyone using this phrase, but one is for a year 2000 guide, just had a look and there are more up to date versions of the same book. hmmmmm.
Example:
Keyword: Widgets
Message board URL: www.ukscoobydoo.co.uk/scoobydoodiving/forum/messages/1191.html
Then you click on the link and see:
Subject: 'Scooby Doo'
re: Go to my 'widgets' site and check out all the 'widgets' {link..link..etc.}
And 2 of these boards are totally different, but both have comments and links from the SAME person.
In the friggin top 10? What? Googleguy, please let us know when there will be some light shed on this. Thank you.
SeventiesMartin, as I listed in my post above, 3 of the 10 results on page 1 were unrelated .GOV sites. That, combined with some other worthless results, show Google is having big problems with its algo changes.
Perhaps what we need is a better outlet for our frustrations? Webmaster World doesn't permit posting specific URLs. Googleguy may or may not be listening. Using Google's official spam report can be like a one-way black hole.
How about a independant 3rd party creating a new Google spam report website (www.google-spam.com is available). It appears just like Google's spam report, except that the results are posted for public viewing.
Fill in a spam report to Google. Send a duplicate report to Google-spam.com. Get it off your chest. Spammers will be embarrassed to see their domains listed. Google itself is free to peruse the site. Feature the top 10 examples of spam and irrelevant results. The possibilities are endless!
Silly idea? Probably. But this is the sort of desperate idea that come to mind after Google keeps pulling the rug from under your feet.
How about a independent 3rd party creating a new Google spam report website (www.google-spam.com is available). It appears just like Google's spam report, except that the results are posted for public viewing.
Good idea. But save yourself some grief and make it google-spam.org instead.
A small-time blogger had the scroogle.com domain, and got threatening emails from Google lawyer Rose Hagan complaining that "scroogle" was "phonetically similar" to "Google." I immediately registered scroogle.org and offered it to the beleaguered blogger for free if he wanted it.
The point is, I think you're safer with a dot-org than a dot-com in cases like this. It also helps to be clearly operating as a nonaffiliated nonprofit.
There's no doubt that something very unusual has happened. But I suspect the 'Amazoogle' theory is probably wrong.
(By the way, I proposed the word Amazoogle, and was shot down in flames - but it's just a term that springs to mind when I look at the SERPS!)
What we may be seeing is an inflated version of what we used to call freshbot.
New crawl => new site that is initially high in the listings (this actually makes sense, if it is a good site folks need to see it for a few days so that it can be linked to!). Then it drops to find its proper place.
The Amazon sites etc. are more worrying - the affiliate system is so huge that without a tweak they'll ruin the serps.
In fact I have done a report today for the Googlers to consider - note consider (Google might find the pages relevant to the search) - just making them aware of problems/unfair ranking.
However, the problems being discussed regarding PDF,Docs and Amazon pages is not a spam issue but an algo issue.
Cant say I have noticed Amazon,PDF or Doc problems myself ( unless the term is mentioned here ;) )but I have been working to much and surfing to little - wonder if anyone from Amazon has noticed this thread and if action was taken I am sure they would feel it was unfair - have they broken Google Spam guidelines?
Well done!
See you all in cyberspace!
Here we are talking about a site with nearly 8 million pages (allinurl:) dominating the SERPS in most search areas (and it would seem with results that are mainly not relevant) to the detriment of the user experience.
If we can't discuss that then there's not much point in a forum.
Also, the pdf's, and docs that are being talked about are not being mentioned by site name, just document type.
"If we can't discuss that then there's not much point in a forum." - I disagree, WebmasterWorld does well even with this TOS strictly enforced. I agree it's annoying sometimes, but it's there for a reason, and not just to protect spammer sites....
When a major company, or group of companies, can come into the Google index and absolutely dominate thousands of markets, that's a problem. That's a problem for the end user, that's a problem for webmasters and ultimately that's a problem for Google, as joe user will simply switch search engines in search of relevent results.
I won't go as far as to say that Google's days are numbered. However, there is a lot of negative energy swirling around about Google right now. Not just from webmasters but end users as well.
I work in the high tech industry. Just a short time ago you were considered a dinosaur if you didn't use Google as your SE. That is no longer the case. Now you are "quaint", and in danger of being "lame" if you use Google. That's not a good sign.
If Google can't regain some semblance of stability and relevence they have a tough time ahead.
Also, I would never advocate using someone's domain name here. Unfortunately the companies in question here are so big that they are literally household names. Shall I say "That really big online book company that also has web stores and other products as well"? Or perhaps "That other big SE that Microsoft owns"?
In many areas the SERPS aren't up to the quality that we know they can be at the moment.
This forum is our only way to discuss what we see. It is also a way we can give feedback as GG reads this forum. GG takes note of what's mentioned here, and I'm sure google get to know of problems sooner than they might have done because of Webmasterworld.
In cases like this where extemely large sites are completely dominating search results we can't discuss and give that feedback without mentioning names.
The solution is simple:
Buy the full version of Acrobat and upload all your pages into this format - and convert to Word too. Stick your site-map in it. Excel won't do any harm either!
Publish your site in as many languages as possible; become a reseller of everything you can think of. Become an Amazon associate.
But most cunning of all - write a BOOK about your site, and publish it, and ensure that Amazon sell it.
First, to compare Ink to Google is almost funny. The actual results by Ink aren't awful, but hardly compare to Google. The reason? Ink has nothing in their database! I did a search on pure search for various domain names, to see how many pages were listed in their index. Google has virtually all of mine (over 2000), but only 160 in Pure Search. A friends website is almost identical. Even larger sites, though, suffer the same fate. I checked a number of major catalog merchants domains and found the same thing - only a small fraction of their pages are listed in the Ink database. Thus, Ink has no where near the index that Google does, which limits it's usefulness.
Second, for many searches, there is nothing wrong with Amazon appearing. Many of the SE results are dead on for what Amazon has. While it isn't perfect, for sure (I too have seen some Amazon results come up that leave you scratching your head), for the most part, Amazon generally deserves the rankings it gets.
Third, completely agree about Kelkoo and Netster. They are listed everywhere in the SERPS and add absolutely nothing to the quality of the SERPS.
Fourth, another big question which is very justified, why are all these other search engine results appearing in Google Listings. I mean, this is probably the biggest spam of all - having other sites somehow (I have no idea how) manipulate Ms. Googlebot into ranking their site for virtually everykind of keyword combination available. And, as previously posted, most of these pages lead to 404's by the time they appear in Google's index.
Fifth, while I'm not a fan of DealTime, it's a popular site and it works well enough for what it does. I can understand why it is ranked as highly as it is. However, is there really a need to rank all the "flavors" of dealtime?
Sixth, while I've seen more PDF and DOC files in the SE's (which can be irritating, especially DOC files). However, many of these PDF's HAVE been on target - frequently deserving to be ranked where they are.
Ultimately, Google is always going to battle spam. The majority of the "old spam" sites seem to have been flushed from their system, with some newer sites taking their place. As always, Google will have to tweak their algo to get rid of these sites to keep their rankings ok.
Right now, from where I sit, I see two big sources of SE clutter that Google needs to guard against. First, they need to quit listing other Search Engine Results. Why search the results of another search engine? I went to Google to find something - not to be handed off to another Search Engine.
Secondly, with the rise of Google AdSense, has also arisen quite a crop of absolutely useless sites designed for one thing and one thing only - sell Google AdSense clicks. These sites are pure spam (consist of nothing but links to nearly carbon copy pages, a Google AdSense Ad, and a very long, plainly visible thing of keyword text), but, because they don't do anything "dirty" (no hidden text, doorway pages, etc...), they get picked up by Google and often rank quite highly.
Anyways, just my thoughts.
Jim
This is totally untrue, and its bizarre to even see it mentioned.
The problems with the serps are a LACK of pagerank. The serps are ALL about anchor text.
Do an allinanchor search for a term. Do a regular search for a term. The results are normally nearly identical. Jeez, its plain as day if you spend five seconds checking.
This is all about Google's obsession with anchor text and its huge overraction to the selling of pagerank by drastically slashing the value of pagerank in the algorithm.
Can this explain the Amazon deluge?
I agree with several of your points. However, what is at question here is the future of Google search results. If Amazon can dominate the serps in this way, and I can't agree that the pages I've seen deserve to be there, then what's next? Do other huge companies simply list a billion items from a database and domintate as well? What happens when 5 or more big companies, all selling the same things, dominate the top 10 or 15 listings on Google?
I think that companies like Amazon should be listed in the serps. I would expect a search like "online book dealers" to have Amazon in the top 10, perhaps #1, instead Amazon isn't even there.
However, I would not expect a search term like "chiltons repair manual" to have Amazon in the #1 position, as they are now. I would expect the chiltons company to be above Amazon in the serps on their own product.
The bottom line is that no one wants to see a bizillion Amazon pages in the serps.
Using the search term you provided, I agree Amazon probably shouldn't be the number 1 slot - but it should most definitely be in the top 10, if not number 2. After all, the search term you used is for a "book" - a title of a book, in fact. And, in regards to books, Amazon is likely to be number 1 for many reasons.
And the reason I suspect Chiltons own website is down at number 5 (behind Yahoo Auctions and a Auto Gear Site) is due to so-so SE technique and a lack of back-links. Some more backlinks and removing the frameset would probably help increase the rankings of Chiltons Site.
Jim
[google.com...]
Ill pay $1,000 to anybody can tell me how Amazon gets a #7 and #8 ranking on the word "car parts" ... it doesnt even mention the word.
This is a VERY hard word to get ranked for, all other sites in the top 10 have over 60-200 links.
Im not picking on amazon but look at the page.
I assume you're referring to me, and I'm sorry but three of those were the broadest search terms possible with 1.7 million, 2.9 million and 4.5 million results. Data storage is by no means a technical search. Of those three, two pertained to my site. The other two did not pertain to my site, and they were picked at random. PDFs have never appeared in the top 30 results for the two that I monitor closely. If that is the case, I could put up 20 - 30 page detailed pdfs all day long to compete. As I've mentioned before, I feel that a pdf takes away from the users experience unless it is a very technical search, otherwise I'd provide more for my users. But even if you take all of that out of the picture, I'm talking about pdfs/docs that have little relevant content.
I appreciate you looking into it, and if you'd like for me to explain further or send more examples then I will.
One final note, I've searched in areas where I do have pdfs, and I've found them near the top of the results. They don't deserve to be there either.