Forum Moderators: open
Example URL:
[laterooms.com...]
When submitted to Altavista at :
[add-url.altavista.com...]
we get
You have entered the following URL:
[laterooms.com...]
We tried to fetch this page but received an Error 406. The usual causes of an Error 406 are a mistyped URL or the page could not be retrieved from the Web site you entered. This page will be removed from the AltaVista index.
Any ideas??
Regards, Paul Walsh
This subject came up yesterday.
[webmasterworld.com...]
Error 406 means the server has a page which does not match the requested content headers (or something like that). Our php3 content works fine...
Paul
HTTP/1.1 404 Not Found
Date: Wed, 25 Apr 2001 17:21:36 GMT
Server: Apache/1.3.12 (Unix) PHP/3.0.16 mod_ssl/2.6.2 OpenSSL/0.9.5a
Connection: close
Content-Type: text/html; charset=iso-8859-1
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
<HTML><HEAD>
<TITLE>404 Not Found</TITLE>
</HEAD><BODY>
<H1>Not Found</H1>
The requested URL /en/country1.html was not found on this server.<P>
<HR>
<ADDRESS>Apache/1.3.12 Server at wim.nation-net.com Port 80</ADDRESS>
</BODY></HTML>
So, their spider is going to have the same difficulty as me. What you'll need to do, my guess, is reconfigure your htaccess file...I don't know apache, though.
Try this: (it gives 200 OK at hypermart but 406 in Altavista)
[uk.laterooms.com...]
Paul
This could also explain why we can't get the same html(php3) pages into Google - all the php3 pages in the site are there but no sign of any of the handler generated ones ...
All I can guess is that the AV/Google agents are requested something like a charset or something which a normal html page would satisfy but which Apache is forgetting to add on to the ForceType handler
versions.
Paul
(we use 1.1 - virtual hosts)
Paul
HTTP/1.1 404 Not Found
Date: Wed, 25 Apr 2001 18:34:45 GMT
Server: Apache/1.3.12 (Unix) PHP/3.0.16 mod_ssl/2.6.2 OpenSSL/0.9.5a
Connection: close
Content-Type: text/html; charset=iso-8859-1
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
<HTML><HEAD>
<TITLE>404 Not Found</TITLE>
</HEAD><BODY>
<H1>Not Found</H1>
The requested URL /en/region106.html was not found on this server.<P>
<HR>
<ADDRESS>Apache/1.3.12 Server at wim.nation-net.com Port 80</ADDRESS>
</BODY></HTML>
Notice in the top, the header information it says http 1.1.
Is what I get for the same url using an lwp script.
Here is what I got using a simple "get" perl client:
from the command line:
perl wget.pl www.uk.laterooms.com /en/region106.html
the resulting output:
HTTP/1.1 404 Not Found
Date: Wed, 25 Apr 2001 19:29:12 GMT
Server: Apache/1.3.12 (Unix) PHP/3.0.16 mod_ssl/2.6.2 OpenSSL/0.9.5a
Connection: close
Content-Type: text/html; charset=iso-8859-1
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
<HTML><HEAD>
<TITLE>404 Not Found</TITLE>
</HEAD><BODY>
<H1>Not Found</H1>
The requested URL /en/region106.html was not found on this server.<P>
<HR>
<ADDRESS>Apache/1.3.12 Server at wim.nation-net.com Port 80</ADDRESS>
</BODY></HTML>
telnet www.uk.lateroms.com 80
>GET /en/region106.html HTTP/1.1
404 Not Found etc.
telnet www.uk.lateroms.com 80
>GET /en/region106.html HTTP/1.1
>Host: www.uk.laterooms.com
200 OK etc.
Do not use the ForceType x-httpd-php3 directive to alias a directory to a script - I have found that the following Accept headers are rejected with an Error 406
Mime Type:
Only
Accept */*
and
Accept application/x-httpd.php3 works
Many browsers send this but not custom bots
Accept text/html and text/* DO NOT !!
Also only Accept-Language: en works - no others!
Instead I am now using:
Alias /en/ /htdocs/laterooms/en.php3/
etc.
within the VirtualHost container
Seems to be working fine - no more Error 406s from Scooter and hopefully Googlebot will be happy too.
Paul