Page is a not externally linkable
lucy24 - 3:28 pm on Jan 26, 2013 (gmt 0)
And now, returning to this thread's original theme:
206.117.3.2 - - [25/Jan/2013:23:33:26 -0800] "GET /robots.txt HTTP/1.1" 200 657 "-" "LWNutch/Nutch-1.4 (another scientific bot - we accept your robots.txt! )"
206.117.3.2 - - [25/Jan/2013:23:33:26 -0800] "GET /robots.txt HTTP/1.1" 200 657 "-" "LWNutch/Nutch-1.4 (another scientific bot - we accept your robots.txt! )"
206.117.3.2 - - [25/Jan/2013:23:33:26 -0800] "GET /fun/lions.html HTTP/1.1" 200 2466 "-" "LWNutch/Nutch-1.4 (another scientific bot - we accept your robots.txt! )"
For a given definition of "scientific", anyway.
(For those who care: 206.117 is an outfit called Los Nettos. I don't know them, but when contact addresses begin with 'hostmaster@' you can safely draw conclusions ;) I do know there is some reason why I don't globally block UAs containing 'Nutch'; I just don't remember what that reason is.)