Forum Moderators: phranque

Message Too Old, No Replies

URLScan Problem with Google URL Requests

IIS question - URLscan rejecting google URL requests

         

globalseo

5:45 pm on May 1, 2003 (gmt 0)

10+ Year Member



In looking at the URLScan logs on some of the webservers where it is deployed we are finding that urls from Google's crawler are getting blocked because they seem to be in a incorrect format. The url requested seems to be
/content/article/2/1700_50030,http://mydomain.com/content/article/2/1700_50030'

Trying to determine if it is a problem on our side or why Google is requesting urls in this format or where they are getting these urls from. No specific help from Google as of yet.

A sample from URLscan log is given below. Basically it shows that raw url is a relative url followed by a comma and the complete url [domain substituted]. The correct request would have resulted in only the relative url in the url scan log Raw url field

[04-30-2003 - 00:05:24] Client at 64.68.80.155: URL contains extension '.com', which is disallowed. Request will be rejected. Site Instance='3', Raw
URL='/content/article/2/1700_50030,http://mydomain.com/content/article/2/1700_50030'

[04-30-2003 - 00:05:37] Client at 64.68.80.149: URL contains extension '.com', which is disallowed. Request will be rejected. Site Instance='3', Raw
URL='/content/article/12/1687_50188,http://mydomain.com/content/article/12/1687_50188'

Thanks for the help!

DaveAtIFG

3:33 pm on May 3, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Any of you IIS users seen this problem?