Forum Moderators: open
Looking at yesterdays logs I noticed Altavista is grabbing css files.
trek13.sv.av.com - - [16/Jan/2002:03:20:29 -0500] "GET /css/a.css HTTP/1.0" 200 1227 "-" "Scooter-W3.1.2"
trek13.sv.av.com - - [16/Jan/2002:14:46:00 -0500] "GET /css/b.css HTTP/1.0" 200 994 "-" "Scooter-W3.1.2"
trek13.sv.av.com - - [16/Jan/2002:16:16:32 -0500] "GET /css/c.css HTTP/1.0" 200 1863 "-" "Scooter-W3.1.2"
trek13.sv.av.com - - [16/Jan/2002:18:51:18 -0500] "GET /css/a.css HTTP/1.0" 200 1227 "-" "Scooter-W3.1.2"
For the first time today a server we manage went down from memory overuse, the second the server came back online a pile of connections from
trek28.sv.av.com:44285
trek28.sv.av.com:44360
etc started coming in,
so I guess they havent perfected the whole idea of not requesting every page from a server at the same moment in time.
64.152.75.52 - - [30/Jan/2002:08:24:01 -0500] "GET /robots.txt HTTP/1.0" 200 1308 "-" "Scooter-W3.1.2"
64.152.75.52 - - [30/Jan/2002:08:24:01 -0500] "GET /directory-name/stylesheets/global.css HTTP/1.0" 200 939 "-" "Scooter-W3.1.2"
Alta has been very active on another site, going a few directories deep and grabbing graphics and useless HTML pages displaying full backgrounds. Just that site, not the same activity on others.
the one thing that bugs me though, is that despite the sheer amount of spidering from scooter, altavista doesn't update it's index ....