Forum Moderators: phranque
Example:
...
Connection:Keep-Alive
Content-type:text/html
...
instead of
...
Connection: Keep-Alive
Content-type: text/html
...
Since i programmed my robot to check the content-type by looking for "content-type: text/html" (with space) and ignore all others, the site couldn't get crawled.
I modified the coding and now it works. But i ask myself if it's normal for a se robot to also accept / read / index sites that even return wrong / not rfc conform http headers? Any insights?
<edited>cloumn -> colon, thanks andreas ;)</edited>
[edited by: Yidaki at 6:39 pm (utc) on Feb. 6, 2003]
Each header field consists of a name followed by a colon (":") and the field value. [...] The field value MAY be preceded by any amount of LWS, though a single SP is preferred.
[faqs.org...] - 4.2 Message Headers
Andreas