It hit my servers for several thousand pages. It looks to me like it is crawling excite listings.
This is them -> Capital Network, 8th/F Chian Resources Buiding, No.8 Jianguomenbei Avenue, Beijing,China [uk.gsmbox.com]
Am I the only one who finds it interesting that a the government of China is spidering web sites? Okay, so it isn't the Gov. of China outright, but it is a company partially owned by them.
they must be looking for that unforthcoming "apology"
Thanks for the info littleman. They hit our site again and grab just about everything. I also wonder what they might be looking for.
220.127.116.11 which is a Gateway IP???
The original bot from China is now using two IPs.
And there are these. 166 is open on port 80, and running a win32 machine with apache.
inetnum 18.104.22.168 - 22.214.171.124
descr CHINANET Beijing province network
descr Data Communication Division
descr China Telecom
Ah, but thats like saying the British Government is spidering you if BT decide to start indexing pages:)
It is a bit strange though, why would they bother?
here's another one that hit me about a week back:
I've been wondering what's going on, they are very, very aggressive in getting at my stuff. It's not *that* interesting...oh well. Any takes on what to do about this one? I alread made my decision on these, but I'm curious what others are going to do. I figured let em have it, they'll probably laugh anyway.
They hit me too. About 20 requests per minute (one every 2-3 seconds). they also got robots.txt, which they seemed to honor (no deep crawl).
Atm. I'll let them crawl. they are, afaik just another spider. And I might eventually show up in a chinese portal/search engine.