Forum Moderators: open
NetRange: 140.233.0.0 - 140.233.255.255
CIDR: 140.233.0.0/16
NetName: MIDDLEBURY
NetHandle: NET-140-233-0-0-1
Parent: NET-140-0-0-0-0
NetType: Direct Assignment
NameServer: LION.MIDDLEBURY.EDU
NameServer: CATAMOUNT.MIDDLEBURY.EDU
Comment:
RegDate: 1990-05-21
Updated: 2000-07-26
TechHandle: HM101-ARIN
TechName: McCausland, Howie
TechPhone: +1-802-443-5754
TechEmail: howie@middlebury.edu
[nitle.org...]
The bot you refer to (NITLE Blog Spider/0.01) is part of a weblog census being run by the National Institute for Technology and Liberal
Education (NITLE). We're a non-profit consortium of liberal arts colleges, funded by the Andrew Mellon Foundation. Middlebury is one of our member institutions, and since we happen to operate on their campus network, the crawl will appear to originate from the middlebury.edu domain.The purpose of the blog census is twofold.
First, we're trying to identify and catalog as many active weblogs as possible across all languages, and make this data publically available (on the site at [blogcensus.net,...] which should go live this weekend). There are very few accurate statistics on weblogs available right now, particularly concerning non-English language communitties.
Second, we want to use this data as a test collection for our own work on search algorithms and information retrieval. This work is described in some detail at [nitle.org...] Since weblogs are a live collection with about a million documents, they make a good data set for learning to scale our algorithms.
I have tried to make sure our crawler respects the usual robots.txt exclusion rules, and does not hit any sites too hard. If I've made any programming errors that are causing the crawler to behave badly, please contact me at [my] email address mceglows@middlebury.edu, and I will work to rectify the problem.
Similarly, if you have any further questions about the blog crawl, don't hesitate to ask. I expect the blogcensus.net site to be up by Sunday afternoon [June 8, 2003].
Maciej Ceg owski (Mr.)
Lead Developer
Center for Educational Technology
Middlebury, VT 05753