Forum Moderators: open
Lehigh University poking around here as well - a few pages here and there - no images.
IP: 128.180.121.221
UA: Mozilla/4.0 (compatible; MSIE 5.5; Windows 98)
*Note* Accept_Language is blank
Not once have I seen these "pokes" from Uni computing centers come to any good. They often come back full-on and rip entire sites - usually looking for keywords, density, etc.
We've got several ip ranges banned from other University Computing Centers.
For us it's better to err on the side of caution:
RewriteCond %{REMOTE_ADDR} ^128\.180\.
GG
Not once have I seen these "pokes" from Uni computing centers come to any good
gordon,
I guess there are exceptions to everything :)
I get some incoming traffic from Cornell, however I have links going to their Making of America section.
I also get some occassional standard traffic from Rutgters. (Have some folks there I email with.)
The University of Kentucky as well.
All, without crawls.
On the other hand, I've had many University's come in on crawls.
Thanks to both of you for the Lehigh insight. I added them in before they started a crawl.
Don
wume2.cse.lehigh.edu - - [25/Mar/2005:09:45:10 +0200] "GET /directory/file.html HTTP/1.1" 200 21190 "-" "Googlebot/2.1 (+http://www.googlebot.com/bot.html)"
128.180.121.222
wume2.cse.lehigh.edu
Lehigh University
183 Computing Center, Building 8B
Bethlehem
PA
18015
United States
Why does it say it's Googlebot though? That's a bit suspicious...
After digging through the log some more I found another one also posing as Googlebot:
pool-68-236-42-186.phil.east.verizon.net - [22/Mar/2005:17:56:18+0200]GET /directory/file.html HTTP/1.1 200 21069 - Googlebot/2.1 (+http://www.googlebot.com/bot.html)
68.236.42.186
Verizon Internet Services
1880 Campus Commons Dr
Reston
VA
20191
United States
Both only requested one file.