Forum Moderators: open
I started doing searches on the site map's links and it came up in Google under most of the links terms. In other words, if there was a link to a page on our site and the link text was "Widget Repair" the site map was found when I did a search on "Widget Repair".
However, it didn't seem like Google got all the way through the site map file. In other words, only about two-thirds of the link texts result in my finding the site map when I do a search in Google. That's about 400 search terms of around 600. The last 200 search terms don't result in Google finding the site map.
Is it possible that the site map is too big? I had seen somebody say that a site map should be broken up into linked chunks of 50K files. My site map is about 148K and strangely Google indicates that it's about 101K.
It's like it just ignored the last 47K in the file. Maybe it got tired or bored.
Does anyone have any experience with this? Do I need to break up my site map? Does Google think it's a link farm or a bunch of doorway pages or something? I do use a common template and some of the pages are quite similar (while still being uniquely valuable to the visitor).
I guess in the end I will find out what Google thought of the site map when the next index is up, but just thought somebody might have some ideas.
Thanks for letting me know that. What exactly does it mean?
Does it mean Google drinks 101K worth of data and then goes on and will never get to the other 49K of data (but still indexing the links in the first 101K)?
Or does it mean Google drinks 101K worth of data, leaves, but eventually will come back to finish the job (but still indexing the links in the first 101K)?
Or does it mean Google realizes the files is > than 101K and moves on and never ever indexes any of the links in the first 101K?
I meant that from a design and indexing perspective. The leaner your code, the more you get indexed. Just think, would you rather have 10 pages of 101k get indexed or 20 pages of 50.5k? I'll take the latter. Even so, I might even go a step further and trim them down to 25.25k and end up with 30 pages being indexed. The smaller the better.