Forum Moderators: open
And that was only the beginning.
After that, they all started crawling my site, on an average half a dozen pages per minute, probably well past midnight, where my current log file ends. Looks like each one of those 16 robots fetched every single HTML document from my site seperately.
I wonder if they will now rank my site cumulatively in exchange? ;)
Yesterday, 020202, 12 individual FAST servers (66.77.73.*) made 12 visits to one of my little sites and each of the 12 FAST servers went to nearly half of my entire site. Is this normal behaviour for FAST or should I put in a command in robots.txt?
Pg Req Date ISP
272: 272: 6.97%: 2/Feb/02 04:51: 66.77.73.73
272: 272: 6.97%: 2/Feb/02 10:14: 66.77.73.74
272: 272: 6.97%: 2/Feb/02 07:32: 66.77.73.75
38: 38: 0.74%: 2/Feb/02 12:01: 66.77.73.76
272: 272: 6.97%: 2/Feb/02 04:35: 66.77.73.77
282: 282: 7.21%: 2/Feb/02 13:08: 66.77.73.78
270: 270: 6.92%: 2/Feb/02 03:41: 66.77.73.80
272: 272: 6.97%: 2/Feb/02 03:33: 66.77.73.81
270: 270: 6.92%: 2/Feb/02 04:25: 66.77.73.82
283: 283: 7.05%: 2/Feb/02 05:40: 66.77.73.83
38: 38: 0.74%: 2/Feb/02 07:12: 66.77.73.84
272: 272: 6.97%: 2/Feb/02 06:28: 66.77.73.85
272: 272: 6.97%: 2/Feb/02 04:33: 66.77.73.86
272: 272: 6.97%: 2/Feb/02 04:16: 66.77.73.87
One of our engineers was testing the performance of a new feature using a
test-cluster of 16 crawler machines. Unfortunately, he forgot to configure
the crawlers to divide the work between them, so they all acted independetly
and effectively all crawler were crawling all webservers. We got some
feedback on this quite quickly and the configuration error was then
corrected.
Sorry for the convenience and thanks for the feedback!