Page is a not externally linkable
- Search Engines
-- Search Engine Spider and User Agent Identification
---- UNTRUSTED in Nokia User Agent


lucy24 - 3:28 pm on Jan 26, 2013 (gmt 0)


And now, returning to this thread's original theme:

206.117.3.2 - - [25/Jan/2013:23:33:26 -0800] "GET /robots.txt HTTP/1.1" 200 657 "-" "LWNutch/Nutch-1.4 (another scientific bot - we accept your robots.txt! )"
206.117.3.2 - - [25/Jan/2013:23:33:26 -0800] "GET /robots.txt HTTP/1.1" 200 657 "-" "LWNutch/Nutch-1.4 (another scientific bot - we accept your robots.txt! )"
206.117.3.2 - - [25/Jan/2013:23:33:26 -0800] "GET /fun/lions.html HTTP/1.1" 200 2466 "-" "LWNutch/Nutch-1.4 (another scientific bot - we accept your robots.txt! )"


For a given definition of "scientific", anyway.

(For those who care: 206.117 is an outfit called Los Nettos. I don't know them, but when contact addresses begin with 'hostmaster@' you can safely draw conclusions ;) I do know there is some reason why I don't globally block UAs containing 'Nutch'; I just don't remember what that reason is.)


Thread source:: http://www.webmasterworld.com/search_engine_spiders/4440895.htm
Brought to you by WebmasterWorld: http://www.webmasterworld.com