Forum Moderators: Robert Charlton & goodroi
Google Press Release [izurl.com]
Answer highlighting helps you get to information more quickly by seeking out and bolding the likely answer to your question right in search results.
The search engines have introduced new tags in the past, like the canonical tag. It's time webmasters introduce a few tags themselves and demand that search engines obey them. How about a tag like this:
<meta name="use" content"only-fair">
or something like that.
What google does is like steeling the apples in the garden of your neigbhours and selling them as your own.
So all of you know: Google reads Webmaster World. They assign people to read these threads. They copy items and distribute it in an internal Google newsletter. So, Google, you're reading this. Yes, you, Sergey, Larry, and Eric. What you're doing is deeply wrong. It's "legal", but it's wrong
That doesn't mean our forum is intended to be a way to get a message to Google. This forum is for webmasters to share professional discussion and analysis - it's here so we can help each other understand what's going on.
This particular thread itself is quite interesting in that regard. There are viewpoints shared in many directions about Highlighted Answers and Rich Snippets. As I read through the discussion, I found myself thinking about ideas I hadn't taken in before. For example, TheMadScientist's question about whether this move negatively affects advertising income.
try "the state or height of building an empire"
To add to this problem, apparently the answers are based from Google Squared. Google squared is based on popular answers, not correct answers.
[edited by: Seb7 at 9:16 pm (utc) on Jan. 24, 2010]
You can't take the most important parts and publish them as your own... If google is indeed finding the most valuable piece of information on a website, and publishing it without explicit permission, this indeed is a violation of copyright law.
what we've been talking about here, is google finding the answer to a question on your page and then presenting it to the searcher, so they don't have to visit your site.
but they've already gone way beyond that, months ago.
because what happens if they can't find the answer in a useable snippet? you would think that would defeat them, and they would present the user with the best alternative (...like a normal snippet, highlighting the words they searched for).
but what they are doing now is this: they have started writing their OWN snippets. they don't just lift a segment of text from your page anymore, they lift bits and pieces from all over the place and then re-write them into a user-friendly snippet of their own authorship -- which is more likely to provide the answer to the searcher -- and then print it under your listing.
if you do a search for a UK premier league football club, for example, then you will see what i mean. (the site i'm seeing is for the premier league.) they provide the date and score of the last game, and the next fixture -- none of which appears on the page in that format when you click through.
google realises that when people search for the club, what they usually want is the results and fixtures, so they have said 'to hell with the text on the page... we'll just write our own thing, and who cares whether it actually appears on their page or not'.
that is a blatant grab for the traffic, which would otherwise have gone to the site.
I better keep my "answers" long and descriptive, not short and sweet... it looks like quick, specific answers are what GORG is looking for.
Frankly, I don't think we should be in this situation. An improved and more granular robots.txt would take care of this. I've heard about a European proposal for a more "in-depth" successor to robots.txt, which would specify how the data is used. I think if we all demand that Google adopt it, they'll have no choice but to comply. The way it is right now, nobody is asking so they feel they can do whatever.
My own suggestion would be a single line at the top of robots.txt that points to another file (eg bots.php) and ALL bots are prohibited below that line.
The file bots.php (whatever) format would be based on typical Linux INI files (eg mysql, php) - headed segments for each bot/function (it would take a bit of working out).
Within such a structure, optionally dynamically changeable per SE/bot, pretty much everything could be covered for ALL bots that has been discussed in this and other threads. It could include formalised copyright text, type of content, instructions to use/not use specific SE features (eg stuff google has been trying to push, as mentioned in other threads), "use this snippet only" text - even site title and meta description.
Creating such a file for each web site would be relatively easy - there are enough people around who could create GPL software to create and manage the file.
The most difficult problem would be getting the SEs to accept the new system, but if enough robots.txt files said "go away or read and obey this" then maybe it could be adopted.
Oh, and I want paying! :) This should be managed by some kind of consortium with no axe to grind, although SE personnel should be included on the team.
[edited by: tedster at 12:18 am (utc) on Jan. 25, 2010]
Google is treading on shaky ground. These actions may be a violation of copyright law.
In other words, you're not allowed to find the most important piece of information and republish it. You can't take the most important parts and publish them as your own.
If the snippet shown is factually incorrect in a major way and someone who accepts it as true is harmed by the information in any way, how does google stand in law?
Here's an excerpt:
This document proposes extensions to the Robots META Tags format to express a content owner’s policy for allowing or denying crawlers access to and use of their online content. These extensions do not replace the existing Robots META Tags format, but enable unambiguous expression of permissions, both unqualified and qualified by a range of restrictions(1), and outright prohibitions as to what a crawler and associated automated follow-on processes may or may not do with the resource in which the expression of these policies is embedded...
(Emphasis mine)
I think that's exactly what we're talking about - what "follow-on" processes should look like and what control we have over them.
[edited by: tedster at 2:14 am (utc) on Jan. 25, 2010]
[edit reason] added a clickable link [/edit]
But if people don't visit sites then they won't click ads. I am not sure of the wisdom of this.
They WILL click on Adwords ads (not on your own ads), that's the point. Why share revenue when they can get 100%?
Do you realize that the snippet text of 10 results comes to 310+ words? It's like a full page of text! EVEN IF I don't find everything under one snippet text, I still have 9 other that total 270+ words! And I'm sure their algorithms are designed to show as unique and useful text under each snippet as possible so that the reader could scrape as much relevant information as possible without leaving google.com. Why would I want to click on any website when all I need to do is to read snipped text of a few selected sites that are on one page and then click on Adwords ads when needed?
I must say it's pretty genius, but it sucks to be a webmaster or content provider these days...
I must say it's pretty genius, but it sucks to be a webmaster or content provider these days...
Exactly. It's such an oxymoron it's not even funny...
It's really cool and I can't stand it at the same time!
I think the best advice has been given by a few here: Find a way to generate traffic without the Search Engines and you'll be better off in the long-run. Otherwise, you're at their mercy and IMO it's not a good spot to be in.
Since Google is attributing the source of the facts and not claiming to have vetted the information they shouldn't have any liability.
If they are attributing the source as the webpage above where their self compiled snippet of information appears and that website does not actually include this information where does that leave them?
its not a lot different to the whole google news issue. when google prints the headline and the gist of the story, its makes murdoch mad because he's losing traffic. and now they're doing more or less the same thing to us
Spot-on.
Google has now declared war on webmasters. They seem to forget, though, that the search engine eco-system only works when there is a triple-win situation for all participants (end-consumer, search engine, publisher). As they now want to take away a substantial part of the benefit for the publisher, I fail to see how they will be able to survive.
As I said in a previous post, I am inching towards blocking Google bot every single day. And I bet I am not the only one.
The best thing I see coming out of it is the possible decimation of MFA and 'thin affiliates' which serve no real purpose except to the owner.
This actually sounds like a good thing. I'm getting tired myself of landing on garbage sites when looking for something. If your opinion also includes the scraper type sites that folks complain about ranking above them while using their content, then another agreement from me.
If the internet had more informational or true creative and journalistic folks displaying the majority of the content, then it will be easier to protect that content through copyright awareness and explicit spidering rules. The thin affiliate webmasters don't have the same interest as real content creators in protecting and managing copyright I would imagine, and would only help the se's continue to trample copyright.
If your opinion also includes the scraper type sites that folks complain about ranking above them while using their content, then another agreement from me.
Yes, absolutely.
If the internet had more informational or true creative and journalistic folks displaying the majority of the content
Yeah, there's a whole rant I could go on here, but will refrain, except to say, if people built better websites IMO there would be much less complaining by the webmasters... Google, FaceBook and Twitter are 3 of the most visited sites on the Internet and I don't recall them running any TV ads, and Twitter may not be, but FB and G are mainly word-of-mouth sites people find useful... I think (hope) quite a bit of success moving forward will be determined by whether you can actually build a useful website, not out-scrape someone then display your affiliate links and AdWords better.
This actually sounds like a good thing. I'm getting tired myself of landing on garbage sites when looking for something. If your opinion also includes the scraper type sites that folks complain about ranking above them while using their content, then another agreement from me
you can't have a subjective view on "garbage" just cause you don't like something you can't just say ohok im cool with this cause it benefits me. Shady people will always find a way to do what they do.. SPAM is still rolling around the internet even with all the safeguards going up.
I think parked domains with adsense on them are total garbage, but godaddy and google don't.
[edited by: J_RaD at 7:00 pm (utc) on Jan. 25, 2010]