Forum Moderators: open

Message Too Old, No Replies

Sosospider

Update on this spider

         

Dijkgraaf

11:25 pm on Sep 16, 2010 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Since this thread is too old
[webmasterworld.com...]

Checks Robots.txt: Yes
Ignores Robots.txt: No, but it's a bit slow to read and start obeying it.

Behavior: It had been hitting my main page and a JavaScript file a few times a day, when it started going a bit mad on the 31st of August and requested the main page 99 times in a day.
On the 1st of September it behaved more normally (4 request) but on the 2nd 99 requests.
At this point I had enough and updated robots.txt to ban it.
3rd of September another 94 hits (and didn't read robots.txt all day).
4th of September, read robots.txt, and then proceeded to get the main page another 84 times.
5th of September, didn't visit, and ever since has just requested robots.txt and gone away.

Staffa

8:57 pm on Sep 17, 2010 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



On one of my sites I'm seeing the same thing happening. Although I did not record the first date it seems to be well before the end of August when it started.

The bot reads the robots.txt (in which it is disallowed) several times a day and promptly ignores it while in the meantime coming to crawl some 50-80 times a day.

By now I have probably seen their whole IP range pass by - though the range was banned before it started to go wild.

enigma1

2:23 pm on Sep 20, 2010 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Yes same here numerous attempts daily since the beginning of sep, to read the home page from the 124 range. So far the attempts are spaced few seconds apart at least.

blend27

3:12 pm on Sep 30, 2010 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Extremly obusive spider, comming from:

114.80.93.*
124.115.4.*
124.115.0.*

Does not even request ROBOTS.TXT

Requested home page 184 times in 17 hours.

Staffa

2:41 pm on Oct 1, 2010 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



114.80.93.nn also visits using these UAs

Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322;TencentTraveler)
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)

124.115.5.nn and 124.15.1.n also uses the above UAs.

58.61.164.nn with the above UAs. 42 visits in 12 days

Over the last 12 days one site got hit 934 times, sometimes up to 70 times in one hour. During that time 4 requests for robots.txt were made and promptly ignored.

This is not So-So anymore, this is downright obnoxious.