Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

URLs in Webmaster Tools and not site:

         

Tonearm

3:06 pm on Sep 11, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Very old URLs have suddenly appeared in Webmaster Tools as 404s. Their sudden appearance may have to do with the fact that I started returning proper 404s about a month ago, but the interesting part is these URLs were NOT in site:www.example.com.

tedster

1:13 am on Sep 12, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Sounds like Google dug back into their back office for historical URL records. They will do that from time to time, just to be sure that an old URL hasn't been brought back. However, URLs where Google gets a 404 response should not appear in the publicly available site: query - glad to hear that they don't.

Tonearm

4:27 pm on Sep 12, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



But Google wasn't getting a 404 until recently and the pages were not visible with site:www.example.com before that change.

tedster

5:05 pm on Sep 12, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Right - but Google stores WAY more information than we can access on the public side. Their history and age data patent application [webmasterworld.com] and others make that very clear. They have a long memory about your domain's history.

So they had the urls from somewhere or other, stored along with whatever HTTP server response they were getting - and from time to time they were re-checking them. When they recently got a 404 response, they let you know in GWT.

Tonearm

3:19 pm on Sep 13, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



OK, I was under the impression that all HTTP 200 pages were displayed with site:www.example.com if there are less than 1000. This really muddies the duplicate content waters.

g1smd

7:02 pm on Sep 13, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



If Google is telling you the URLs are 404, then they already know the pages are gone. That is good. There is nothing to worry about.

They will keep on checking to see they remain gone. That, too, isn't a problem.