| 12:43 pm on Jun 24, 2003 (gmt 0)|
You didn't mention any specifics.....
| 12:48 pm on Jun 24, 2003 (gmt 0)|
this scooter/3.3 coming from buildrack38.sv.av
| 12:56 pm on Jun 24, 2003 (gmt 0)|
I assume you mean the Altavista crawler that is called scooter - if you wish you can block it using robot.txt.
Depends if you want to be listed in Altavista?
| 4:30 am on Jun 25, 2003 (gmt 0)|
I didn't ask it to... what a pain in the nas, they could send an e-mail asking us, instead of their little pain in the nas creature sniffing my system, and the damn thing opened a socket too! little piece of crap! Someone to tell AV to sort themselves out.
| 5:37 am on Jun 25, 2003 (gmt 0)|
Dont think it would really be practicle for Av to send an email to every site they intend to crawl.
The whole point of the crawler is to index web sites that can later be searchable by AV users. If your complaining about a legitamite search engine spider then you really cant be planning for any real traffic. You aint going to get real traffic without search engine listings and you aint going to get those without being crawled.
| 5:50 am on Jun 25, 2003 (gmt 0)|
Effectively, it's an ecom site for locals, simply becuase shipping the products overseas isn't viable, we basically import the goods and sell them locally, also I don't mind say the system lookingonce, taking the url and buggering off, that's fine by me, but hogging the system for days is another thing........
| 8:42 pm on Jun 25, 2003 (gmt 0)|
That's what robots.txt is for.
Those of us who have free bandwidth want the search engines to come every day and take all they want, so that they have the most complete and up-to-date info on our sites.
| 12:10 am on Jun 26, 2003 (gmt 0)|
Scooter/3.3 appears to be ignoring robots.txt, as I pointed out in this thread:
BTW, has anyone else noticed this, and what do you plan to do about it?