Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

What's the best way to check what URLs aren't indexed from a sitemap?

         

DiscoStu

11:13 pm on Dec 1, 2009 (gmt 0)

10+ Year Member



Is there a good tool/way to check which of your URLs from a sitemap are not in the index?

Sorry if this is the wrong place to post this, but I still haven't figured out where to post seo questions about Google other than here - seems where most people are posting them

tedster

4:14 am on Dec 3, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



We don't discuss specific SEO tools (see Google Forum Charter [webmasterworld.com]), but I do appreciate the challenge you face, especially if there are many URLs in the Sitemap. This is an area where I would "roll my own".

If you've got analytics of any kind, you should be able to get a list of URLs that are getting Google search traffic. Remove those URLs from your sitemap list and what remains is a good list to start your improvement campaign from. I would suggest that even if a page is in the index, if it's not getting any traffic then it's still very much worth a look.

Also, I've never seen a site of any significant size that is 100% indexed, so that's not a goal I even consider.

ogletree

10:10 am on Dec 7, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Put a unique code on groups of 999 pages. Write a program that will scrape Google for each of the unique codes. Then process the data and compare to your sitemap. Since this is not something you would need to run very often scrape very slow.