Forum Moderators: open
I recommend keeping an eye on it -- prior experience with that school's Web Mining students indicates that their numerous projects [informatics.indiana.edu] may not be as well-behaved as they initially present. E.g.:
Spring, 2006
1. Usage Statistics of Robots Exclusion Standard
[Direct link not included because it's to a blog and links to blogs violate TOS.]
"Crawl the URLs in the robots.txt. This would violate the robot exclusion standards but..."
("But"? I don't think so.)