Forum Moderators: open

Message Too Old, No Replies

DOD Web Scans

Department of Defense Robots

         

hbarker

5:24 pm on Apr 20, 2002 (gmt 0)



I run a military related website, the Korean War Project. Korean language email bots have harvested our email, so four days ago I put a trap on our Home page and other pages. Within hours we caught several DOD sites going for our robots.txt and our invisible Bot Trap endless loop pages. It's not a spoof, DOD has confirmed we are being hit by Signal Intelligence OSI computers, and has already admitted this is not legal. From prior Bot Trap logs, we found they would repeatedly go straight to our Membership.html page and pass a +Fetch+API+Request

What can they get from these scans? m.html is the Bot Trap URL.

2002-04-18 09:15:30 <snip> Server <snip> Get /html/membership.html - 200 9684 HTTP/1.0 Mozilla/4.0 ompatible,+MSIE+5.5,+Windows+NT+5.0)+Fetch+API+Request - -

2002-04-18 09:16:02 <snip> Server <snip> Get /html/m.html - 200 9684 HTTP/1.0 Mozilla/4.0+(compatible,+MSIE+5.5,+Windows+NT+5.0)+Fetch+API+Request - -

and

2002-04-19 02:20:01 <snip> - SERVER <snip> GET /html/units/m.html - 200 10916 HTTP/1.0 Mozilla/4.0+(compatible,+MSIE=4.01,Windows+NT,+MS+Search+4.0+Robot)+Microsoft Webtrends XXXX

Some of the requests vary with:

+Microsoft+Scheduled+Cache+Content+Download+Service

(edited by: Brett_Tabke)

Brett_Tabke

6:21 pm on Apr 20, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



Who knows. Various govts around the world, all have programs in place to scan the web. It's nothing new. Stuff like this has occured for many years.

wilderness

9:52 pm on Apr 20, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



You can deny the entrie country with these blocks
[apnic.net...]

or
if you have limited visitors from the remainder of Asia (which BTW includes Austraila and NZ)

deny from 210.
deny from 211.
you might add in these also
61.248.0.0 - 61.255.255.255
61.96.0.0 - 61.111.255.255

Everyman

10:52 pm on Apr 20, 2002 (gmt 0)



wilderness: The original poster was complaining about U.S. *.mil requests for robots.txt, not Korean domains. But when Brett snipped the domains, the post became confusing.

hbarker: I don't think you have anything to worry about. I examined my logs. My website has information that may be of interest to Pentagon types on occasion; I had over 3,200 GETs in the last 20 days from *.mil domains. Of these, I had about 20 requests for my robots.txt. But looking at each of these, they are not spider-related at all. I can tell from the requests for GIFs, Java applets, and CGI searches whether it's a spider. These weren't spiders.

My best guess is that some bored Pentagon types are surfing, and some of them took a class in information warfare, and the teacher mentioned that by looking at the robots.txt you can see which directories are forbidden to spiders. This information is useful because it shows you the layout of the site to some extent, and clues you into which directories may be especially interesting (to info-warfare types).

Harmless fun, I suspect.