Forum Moderators: open

Message Too Old, No Replies

IBM Crawl_Application

198.81.209.19

         

WebGuerrilla

6:27 pm on Nov 15, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



198.81.209.19 - - [14/Nov/2002:11:05:55 +0400] "GET [mydomain.com...] HTTP/1.1" 200 - "-" "Crawl_Application"

This on is interesting because I found it in a log of a site that uses an IP delivery system. This bot grabbed about 50 pages from this particular site.

However, this IP is not on the list, so requesting any single page from the site will return the content, but that content will not contain any links to the other pages.

That would suggest that this bot is crawling from a list of urls collected from other search engines, rather than following links in a normal fashion.

carfac

12:54 am on Nov 16, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Thanks WebGuerrilla
Added this to the ban list!

dave