Forum Moderators: open
Perhaps wrong forum, if so, I apologize.
I'm just curious how come Google is still crawling and trying to reach pages I deleted from the site 8-10 months ago. It's getting 404's, but still coming back after a week or so.
Dan
Welcome to Webmasterworld [webmasterworld.com]
Are there still backlinks "somewhere" pointing to those pages?
Shak
No, AFAIK, there's no links anywhere to the pages in question. They were mock-ups for a design job I had last year, and I kept them on my own server for 2-3 months.
I really don't care, just wondering how many times Googlebot need a 404 until it understand that the pages are gone...
BTW: Actually, I've been here since August 2001, but not that frequent. Then I lost pass/log, so had to create a new identity.
Cheers,
Dan
If it's your own custom 404 page made with, for example, PHP you will need to put this header in yourself like header('HTTP/1.X 404 Not Found").
(My 404s say "HTTP/1.x 404 Page is buh-bye") :)
Peter
I think you might be on to something. I *think* <blush> it's Red Hat and Apache (host is <snip>). I do have a custom made 404, in plain html (but suffix .shtml) since I need a link to my site's home page.
Should I add that snippet in the head of that page? And what about the custom made 400, 401, 403 and 500?
D
[edited by: NFFC at 1:14 pm (utc) on May 24, 2003]
[edit reason] URL snipped [/edit]
<quote>
you will need to put this header in yourself
<quote />
Isn't it better I put in in the page? (Sorry, just couldn't resist) :-)
D
404-Not Found means the file was not found for unspecified reasons, but this condition is not necessarily permanent.
410-Gone means it's really, really gone, and the condition is permanent.
Ref: RFC 2616 HTTP/1.1 [w3.org] Hypertext Transfer Protocol/1.1
HTH,
Jim
404 Not Found
Not Found. The requested URL was not found on
this server. Apache Server at mysite.com.
www. mysite.com/page.html
But point is, my pages are in htm format not html.
I have DMOZ link and link in Google directory. Loot of fresh pages… but nothing…
I’m tired.
Please help!
Newman, Sad hearing. But please don't hijack a thread! Start your own!
D
> But how do I tell the server to bring up a 401 instead of 404?
That depends on what server you are hosted on.
For Apache, you declare the page, resource, or directory name (or a wildcard), and tell the server to respond with 410-Gone. For example, using mod_rewrite:
RewriteRule ^defunct\.html$ - [G]
RewriteRule ^removed-directory - [G]
RewriteRule ^discounts/discount-(.*-)+widgets\.html$ [G]
RewriteRule \.mp3$ - [G]
If you wish to serve a custom 410 error page, you can declare that in the same way you declare a custom 404 error page:
ErrorDocument 410 /custom410.html
HTH,
Jim