Forum Moderators: DixonJones

Message Too Old, No Replies

Strangly high number of 404 in statistics

problem with high count of 404 errors

         

Grregorr

6:00 am on Jul 12, 2003 (gmt 0)

10+ Year Member



Some time ago I redesigned my site completely. Both before and after the redesign I had a customized file for 404 error. This 404 file is featured in site statistics generated by Webalizer just as the other files. Since the redesign I got dramatic increase of 404 erros. Recently I looked at the logs and to my suprise I found that it DOES NOT not seem to be caused by requests to files from my old site. It seems that the 404 file is loaded at the same time as graphics used for home page. The hits sequence may look like this (I edited it slightly, but the vital time info is real):

host - - [:08:30:43 +0200] "GET / HTTP/1.1" 200 10075
host - - [:08:30:44 +0200] "GET /gfx/sq.gif HTTP/1.1" 304 -
host - - [:08:30:44 +0200] "GET /gfx/1x1.gif HTTP/1.1" 304 -
host - - [:08:30:44 +0200] "GET /gfx/bgsz.gif HTTP/1.1" 302 219
host - - [:08:30:44 +0200] "GET /gfx/bgle.gif HTTP/1.1" 304 -
host - - [:08:30:44 +0200] "GET /404.html HTTP/1.1" 200 1854

I checked the home page and it seems to be perfectly OK. The site runs on an Apache/Linux server.

jpjones

7:23 am on Jul 12, 2003 (gmt 0)

10+ Year Member



Hi Grregorr,
Welcome to WebmasterWorld [webmasterworld.com].

I assume the 404 request is coming from the same IP as the rest of the graphics?

What seems to be happening is that SOMEWHERE on your html page, you have a request for a graphic, a css, a javascript (or maybe something else) file which does not exist on your website filesystem. Because of the way you have set the server configuration to handle 404's, if the server cannot find a file of ANY sort (be it graphic or other), it's returning the standard 404.html error page.

Go through your pages with a fine-tooth comb. If you're doing any image-preloading in your JavaScript files-check these out too.

If the IP address for the 404 isn't the same as the other files, then you could be a victim of someone else's site using your imagery - but this is unlikely if you're just getting 404's at the same time as your web site is being visited.

Hope thats of some help,
JP

Grregorr

7:52 am on Jul 12, 2003 (gmt 0)

10+ Year Member



Yes the request always comes from the same IP. As far as wrong request from the home page are concerned, that is the first thing I checked, but did not find anything. I do not use JS at all, home page is about 10kB in size and I think I checked it pretty thoroughly.

Another interesting phenomen (perhaps it is in some way connected, but maybe not). Search engines do not see my new home page or any other page on the site. The only page they see is the 404.html file. It is surprising because the traffic is normal and visitors seem to use the site as usual as well.