Forum Moderators: Robert Charlton & goodroi
Increase base index size by 15%. [project codename "Indexing"] The base search index is our main index for serving search results and every query that comes into Google is matched against this index. This change increases the number of documents served by that index by 15%. *Note: We’re constantly tuning the size of our different indexes and changes may not always appear in these blog posts.
New index tier. [launch codename "cantina", project codename "Indexing"] We keep our index in “tiers” where different documents are indexed at different rates depending on how relevant they are likely to be to users. This month we introduced an additional indexing tier to support continued comprehensiveness in search results.
Keyword stuffing classifier improvement. [project codename "Spam"] We have classifiers designed to detect when a website is keyword stuffing. This change made the keyword stuffing classifier better.
No freshness boost for low quality sites. [launch codename “NoRot”, project codename “Freshness”] We’ve modified a classifier we use to promote fresh content to exclude sites identified as particularly low-quality.
More authoritative results. We’ve tweaked a signal we use to surface more authoritative content.
Fewer autocomplete predictions leading to low-quality results. [launch codename "Queens5", project codename "Autocomplete"] We’ve rolled out a change designed to show fewer autocomplete predictions leading to low-quality results.
More efficient generation of alternative titles. [launch codename "HalfMarathon"] We use a variety of signals to generate titles in search results. This change makes the process more efficient, saving tremendous CPU resources without degrading quality.
More concise and/or informative titles. [launch codename "kebmo"] We look at a number of factors when deciding what to show for the title of a search result. This change means you’ll find more informative titles and/or more concise titles with the same information.
Better query interpretation. This launch helps us better interpret the likely intention of your search query as suggested by your last few searches.
Anchors bug fix. [launch codename "Organochloride", project codename "Anchors"] This change fixed a bug related to our handling of anchors.
More domain diversity. [launch codename "Horde", project codename "Domain Crowding"] Sometimes search returns too many results from the same domain. This change helps surface content from a more diverse set of domains.
[edited by: Robert_Charlton at 12:19 am (utc) on May 8, 2012]
[edit reason] added quote box and fixed title [/edit]
Better query interpretation. This launch helps us better interpret the likely intention of your search query as suggested by your last few searches.How do I stop my last few searches from being used in this manner? Is being logged out enough? Google needs to clarify this as it has privacy concerns attached.
MLB search feature. [launch codename "BallFour", project codename "Live Results"] As the MLB season began, we rolled out a new MLB search feature. Try searching for [sf giants score] or [mlb scores].The #1 result is ESPN which shows scores right in the serps and Google's new display of scores is apparently coming from ESPN. I think ESPN might have something to say about their data being scraped like that...
MLB search feature... data being scraped like that...
[edited by: Robert_Charlton at 7:32 pm (utc) on May 7, 2012]
No freshness boost for low quality sites. [launch codename “NoRot”, project codename “Freshness”] We’ve modified a classifier we use to promote fresh content to exclude sites identified as particularly low-quality.
More authoritative results. We’ve tweaked a signal we use to surface more authoritative content.
More authoritative results. We’ve tweaked a signal we use to surface more authoritative content.
I now only use Bing because of this crap - I got sick and tired of Wikipedia being #1 for 99.99% of all search terms. I know Bing shows Wikipedia as well, but nowhere near as much as Google!
How much of a failure for a search engine is the fact that Wikipedia is the #1 result for pretty much everything?
How much of a failure for a search engine is the fact that Wikipedia is the #1 result for pretty much everything?
It depends.
Say, if Wiki was in the top 5 results, it achieved an average 50% CTR. That would strongly suggest most searchers preferred Wiki to RandomSite.com.
but I know from casual shoulder-surfing that most people go straight to Wiki, regardless of its position. Ergo, Google is returning the most popular destination in the top spot.
atlrus wrote:
How much of a failure for a search engine is the fact that Wikipedia is the #1 result for pretty much everything?
People generally go to a search engine to find alternatives.
[edited by: tedster at 7:22 am (utc) on May 9, 2012]
I don't think this is true. In my experience with hundreds of clients since long before the days of Google, they go to a search engine to find an answer. Not alternatives. An answer. And generally they're happy with the first one or two they find. WE ARE NOT NORMAL USERS. Normal people don't search like we do.
Normal users also don't resent Wikipedia because it ranks above them. ;)
jonathanleger: bookmark google verbatim search, you will like the results much better.
When I search for "flat tire" the Wikipedia entry tells me what it is, what could happen if I drive on it, and differences between car tires and bicycle tires. It also includes prevention information, assorted methods of repair, and an odd paragraph about the dangers of changing a flat tire. #2 result is an authority site's page on how to change a flat tire. That looks pretty darn good to me.
Assuming many people search for just "flat tire" in the first place, which is a big assumption
When you use bing.com/search?q=flat+tire you would basicly get the same results, just some images added.
In the absence of any clue about user intent, Google served up a variety of possibilities - "flat tire" doesn't mean anything by itself.