Yes! Hallelujah! As someone who has used underscores in page names since day 1 (Before Google Era) and who has never bitten the bullet and changed all the URLs on my site, I'm hoping to see a slight bump.
Having said that, I've had the impression for some time now that Google already understands this on some levels. Same with run-together words (like WebmasterWorld). I'm not sure if it's related to anchor text or word density or what, but it's a feeling I get.
I saw that and wondered what difference it might make, they did include file names in their allinanchor calculations but only for a limited period... .
That's a shame, as underlined URLs with spaces or with underscores look much the same such that you cannot tell which one it really is.
That's a good reason to avoid both.
I think this is only big news if it is accompanied by the announcement that words in the filename are significant in the ranking algorithm.
This will be good for me, as I've been using underscores in the filenames for my book reviews since 1992 (pre-web) and have never changed that (though I probably should have).
OTOH, my reviews mostly rank first for title searches anyway, so any effect will be limited.
And now if they can use actual good signals to rank web pages then we might be onto something - oh I don't know something like looking at the content of the page vs links.
Oh sod it time to get my keyword rich urls into action!
|oh I don't know something like looking at the content of the page vs links. |
I don't know, isn't that what yahoo and msn do? Look where that left them...
g1smd, I agree
Koan, MSN and Yahoo are totally different from each other and Google - they all weight the hundreds of factors differently, however the term "google bombing" meaning to get lots of inbound links with a particular phrase to rank the destination web page that can not feature any of those terms may back up my point. That was just my half hearted joke in that respect. With regard to MSN & Yahoo looking at on page factors more, they have to because at the moment they are unable to create a deep enough and fresh enough index to be able to develop good link maps. That is the reason they are behind in the short term.
[edited by: tedster at 12:28 am (utc) on July 26, 2007]
|OTOH, my reviews mostly rank first for title searches anyway |
Are you Oprah? ;)
It's good news for general Webmasters, bad for those on the technical side though. I hope I can override this new behaviour "by_quoting_the_terms" so I can still find server variable names and such...
|And now if they can use actual good signals to rank web pages then we might be onto something - oh I don't know something like looking at the content of the page vs links. |
They already are (and have been for a long time).
europeforvisitors, that was meant to be ironic - I assume you are either (a) joking (b) don't know that I really know how Google ranks pages (c) can't spot irony.
(My above post in itself is light hearted - and ironically, ironic!).
[edited by: Swanson at 1:12 am (utc) on July 26, 2007]
No matter what, search results will always be slanted by those who abuse the system. Unless it is a way off topic where people don't spam the results, IMHO you never get good results.
This is great for my current site.
I may still switch to dashes with my next site though, just to avoid the underscore vs. space issue with underlined links. Any other reason to use dashes?
Go with dashes in my opinion, they have always been recognised as spaces by all search engines not just google and there is no confusion.
Underscores are not universally recognised as spaces by all search engines - only Google has come out and said this is in effect a new development. Use A-Z, 0-9 and hyphens just like a domain name - it is safe, and also gives you maximum exposure.
[edited by: Swanson at 1:22 am (utc) on July 26, 2007]
Stick with dashes.
I thought they counted all this time (2000).
Now I'll get downgraded for keyword over optimization.
That's my vote. There are other search engines besides Google and they are still trying to get the indexing of HTML under control, URI strings are next. And now you expect them (the other search engines) to follow everything Google is doing? Wishful thinking eh? :)
I was lucky enough to attend this same conference (WordCamp 2007,) and just as an FYI––while Matt did indeed say that underscores would soon be accepted as word separators, he also prefaced this entire segment of his presentation (which was on White Hat SEO) by stating that dashes were the best method for those who wanted to do well in Google's search.
I just thought that should be clarified, as it wasn't at all obvious from reading the article referenced above.
Honestly I'd be happy if file names didn't matter. It's content that matters to viewers .. not what you name your page or your choice of directory structure.
Google engineering is weak if it needs breaks of any kind to figure out URLs. If the page title and content don't explain it, the page sucks.
Multiple hyphens and long urls etc. are butt ugly. G should whack every url longer than c. three words, w/ or without hyphens!
Nice that WW doesn't have all the spammy urls like all the ugly blogs. If you want 15 words in your url, why stop there and not go for 100?
|Honestly I'd be happy if file names didn't matter. It's content that matters to viewers .. not what you name your page or your choice of directory structure. |
I think filenames do have a role to play in the relevance game. If I write an article with the filename "one-eyed-cats.htm," the odds are pretty good that the article is about one-eyed cats. That doesn't mean it deserves the #1 SERP position for "one-eyed cats," but it ought to be listed somewhere in the results for a search on that keyphrase.
|I think filenames do have a role to play in the relevance game. If I write an article with the filename "one-eyed-cats.htm," the odds are pretty good that the article is about one-eyed cats. That doesn't mean it deserves the #1 SERP position for "one-eyed cats," but it ought to be listed somewhere in the results for a search on that keyphrase. |
I agree but I don't think there should be that much difference in "one-eyed-cats.htm" and "one_eyed_cats.htm". Both articles are obviously should be about one eyed cats. I've always used underscores and will continue to use them. I don't see this as a big deal.
When I first understood that underscores were not being seen as word separators, I began experimenting. On one site I changed all the underscore urls over to hyphens, and we had a months-long rankings dip and then a recovery period. I cannot say that the fully recovered rankings were any better than before I switched to hyphens.
On another site, we continually build biography pages for important figures in that profession. Armed with my previous experimnet, I did not go back and change the legacy urls from first_last.htm, but I did begin to name new bios as first-last.htm. Again, there was no stand-out winner here, and it's now several years down the road. Both types of pages are still performing very well, and sometimes an underscore page ranks above the professional's own dedicated website. There are several hundreds of these bio pages on the site right now, so I'd say I'm looking at a significant amount of data.
My old-school way of thinking about the Google algo was like a scorecard. My black-box model was as if Google totalled up points for this and points for that -- and the most points would win. But I don't think the algo works that way any more. Google is getting very "neural" these days, working towards AI and fuzzy logic -- sometimes too fuzzy, perhaps.
A better analogy for keyword-in-url might be how well a site's signals let Google focus their lens, their "relevance lens". In this approach, I see keywords in the url just as a kind of reinforcing factor. They can confirm the sharpness of the focus, but I don't see those keywords as creating an independent plus. Instead, they are one potential reinforcement for what the rest of the algo determines about relevance.
It's even possible that a keyword-in-url that is off-topic for a long tail search might work against some rankings that the page used to get. It might be telling the algo, in essence, "I cannot confirm that relevance score from my angle - back it off just a bit."
I think you forgot an important aspect though.
A few million natural links to normal sites with copy pasted URLs as their anchor text... with /whatever_keywords_described_the_page(.html) for which they might get some additional recognition from the algo.
And thus stop ranking for "whatever_words_described_the_page" in exchange for the words and phrases themselves.
Keywords in URLs can account for some(times the majority) of your IBL anchor text so don't dismiss the idea just yet.
Although underscore is not yet a word separator, I just checked.
Also, there's no guarantee that Google would treat underscores any differently as it does now when it encounters them as text, when it encounters them as anchor text, and when it sees that they're in fact a copy pasted URL.
[edited by: Miamacs at 11:26 am (utc) on July 26, 2007]
|I agree but I don't think there should be that much difference in "one-eyed-cats.htm" and "one_eyed_cats.htm". |
I don't think there's much difference in practical terms. (I've got some underscored_filename_pages that rank #1 for extremely competitive keyphrases, so I'm inclined to believe that other factors--or the combination of other factors--has been more important than the word separator in the filenames.)
This seems to weigh heavily on MSN.com and suspectGoogle has been doing this for some time.
I agree with Essex_boy.
the hyphen works with MSN pretty well
Underscores sometimes become obscured in hyperlinks. Definitely a usability issue. In that instance, it might be a good idea to take that into consideration and setup something to catch those typos. You know, this...
as opposed to this...
| This 60 message thread spans 2 pages: 60 (  2 ) > > |