Forum Moderators: phranque
But there are some problems with just naively spitting out a page that has a link to every other page on your website.
Ran into these (and other) problems? What do you do? How do you break your sitemap up? How do you link to a broken-up sitemap (e.g., all pages point to the root sitemap page, or each page points to the most relevant sitemap sub-page, or...?)
I wouldn't worry for this:SEs searches keyword density,so they index page contents not links.Also,If you have a long link list,SEs anyway scan all of them,and search the related ones.
So there is any problem,but if you wish to be more secure,you just could break them up in sub-pages in a three structure like any existing directory.
I wouldn't worry for this
Sorry, I wasn't clear. These weren't hypothetical concerns. MSN/Yahoo! regularly rank my sitemap.html higher than, say, "article.html" for the title of "article.html" (which, of course, appears in sitemap.html. Not all the time, of course, but often enough to be irritated about the wasted traffic (people are unlikely to search a large sitemap page to find the exact keyword they were looking for).
Maybe someone has some SEO tips for discouraging this kind of mis-ranking of the sitemap page.
you just could break them up in sub-pages
I wasn't clear again. Yes, I implicitly listed that as a solution in the original post. What I'm looking for is a sampling of how other people who've run into these same problems have actually dealt with them (and the actual link structure they chose). Sounds like you haven't run into these problems.
Also,If you have a long link list,SEs anyway scan all of them,and search the related ones.
While it's true that Google has expanded their previous maximum page size that GoogleBot will scan, you seem to be saying you think that there is no maximum at all. My experience is to the contrary. See, for example, [webmasterworld.com ] wherein GoogleGuy cites the hard limit (at that time) of 100KB per page. The limit is larger now. I'm certain it's not infinite.
Dumb Engines Overvalue Your Sitemap Page
Have you tried <meta name="robots" content="noindex,follow">?
Sitemap Page Gets Too Big
Google offers an alternative way to submit large sitemaps.
[google.com...]
I don't know how to turn that into specific ideas about how to stop it from mis-ranking your site map. I can't remember it ever landing me on a site map page, but it often takes me to a site's home page, which is usually what I want.
[webmasterworld.com...]
Why do I want the sitemap indexed? Because it's a valuable source of keyword research once it attains a certain number and variety of keywords on the page. People put together keywords X and Y and land on the sitemap page because it refers to an article about X and also to an article about Y. I see that in my logs and realize "Oh, I could get some extra traffic by writing an article about "X Y".
noindex,follow
Hmmm, Google isn't a problem in this regard, and I guess I thought that was a Google-only construct, but I see now that it allegedly isn't. Of course, I don't want to use it as suggested here, since I do want the sitemap indexed and followed :-).
But, I guess I could use the rel="nofollow" link attribute on all the anchors that point to the sitemap page (except one). I suspect that would convince Yahoo! and MSN that it should not rank higher than a "normal" page for anything. However, I wonder if it would also significantly decrease the number of serendiptious hits on unanticipated keyword combinations that my sitemap currently gets.
Might be worth an experiment.