Msg#: 3829149 posted 12:23 am on Jan 19, 2009 (gmt 0)
Does anyone know why the top results for mainkeyword*, mainkeyword**, mainkeyword*** and main-keyword differ from the normal mainkeyword search? What are causing the differences in the top 10?
I am trying to determine if my site has been manually or semi-automatically dropped to around pages 3 to 4 for its main keyword. If I try those special searches my site ranks around 20 positions higher in top 15 along with the other major sites.
I'm just curious. There is little competition (~200,000 results), my site is optimized for it (not over-optimized) and starting on page 2 the results are mostly from outdates news, blog tags and directories. I think it's realistic to expect to rank somewhere on top of page 2, right where I see it with the special *** searches.
It's a popular "money" keyword so maybe I need to achieve a certain level of Trust Rank before I can get an unfiltered position for it? The domain is 6 months old so it's still relatively new in Google's eyes. All traffic now is from 2 or 3 word searches where rankings are okay.
Msg#: 3829149 posted 7:58 pm on Jan 19, 2009 (gmt 0)
Those special searches include a * which is a wild card or pattern matching character. If you notice which words are bold in the results for those searches, you'll see that the rankings are no longer being calculated for just the single main keyword but now also include other words.
I'm happy that you asked this question, because there may be some value in noticing WHICH extra words are included and rank near the top.
If I were to take a guess, I'd say that the data collected by Google's phrase-based indexing [webmasterworld.com] is at work here - words that have a higher frequency of co-occurence may be more highly ranked on a wild card phrase. That's just a hypothesis at the moment, but it's one I'm going to experiment with.
Msg#: 3829149 posted 8:33 pm on Jan 19, 2009 (gmt 0)
I recently found a (very) small set of data released by Google two years ago to the "Linguistic Data Consortium" at the University of Pennsylvania. For anyone looking for a more concrete example of what phrase-based indexing measures, this is such a thing:
Msg#: 3829149 posted 9:01 pm on Jan 19, 2009 (gmt 0)
I'm not sure the highlighting is all that revealing here.
If you search widget*, the highlighting function will highlight that and the next word (or next punctuation character, the highlighting is very imprecise). Each asterisk will highlight an additional word.
Msg#: 3829149 posted 9:17 pm on Jan 19, 2009 (gmt 0)
I believe the phrase-esque matching is because the main function of the asterisk is for unknown words - e.g. if you're not sure what might come in the middle of something ([Google * News] or [Google news is a *]). It's not quite phrase matching though, as you'll get different results if you enclose the query in quotes.
What's also interesting is that the inclusion of a space (or lack of) changes results too - [google*] is not the same as [google *].
I don't think it's to do with related words though - it seems to me that a query for an unknown words is pretty tricky relevancy-wise, in a similar way to ultra generics like [0..9999999] or [site:com] - so there's some kind of fallback to trust or authority or some such thing ;)