Forum Moderators: open

Message Too Old, No Replies

Same-second hits from nec-labs using Java and -- Ken

If at first you don't succeed, do a switcheroo.

         

Pfui

3:51 am on Jul 1, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



UA: Java/1.5.0_04
UA: Ken
HOST: svext.nec-labs.com
IP: 138.15.10.10

LOG:

svext.nec-labs.com - - [30/Jun/2006:19:24:22 -0700]
"GET /robots.txt HTTP/1.1" 403 815 "-" "Java/1.5.0_04"

svext.nec-labs.com - - [30/Jun/2006:19:24:22 -0700]
"GET /dir1/file1.html HTTP/1.1" 302 223 "-" "Ken"

(Java is always 403'd. I've yet to figure out how to block no-dot UAs.)

NOTES:

From dnsstuff.com (excerpted):

IP address: 138.15.10.10
Reverse DNS: svext.nec-labs.com
Reverse DNS authenticity: [Verified]

NetRange: 138.15.0.0 - 138.15.255.255
CIDR: 138.15.0.0/16
NetName: NEC-LABORATORIES-AMERICA-INC

FYI:

NEC Laboratories America [nec-labs.com]

New bot Java/1.5.0_06 grabs all pages
[webmasterworld.com...]

Java/1.5.0_06 Spider Sighting, and Questions
[webmasterworld.com...]

incrediBILL

6:52 pm on Jul 1, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Also see:
[webmasterworld.com...]

Pfui

9:10 pm on Jul 1, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Thanks, Bill! I do make a point of searching WW for info before posting -- both to avoid dupes and because I may have posted something already and my neurons need refreshing:) I even keep the Google-WW search box up top.

But there's no Ken [google.com] bot info, and only one post for "nec-labs [google.com]" and it's totally unrelated. And re the other two, new GSA threads, the most 'current' GSA [google.com] posts are March, 2006, then 2003, and 2001. Yet you pulled up a thread from two days ago.

(Actually, I rarely find any of our current posts lately. It's been bugging me but not enough to file an official bug report. Yet. And just a few months ago, it felt like G cruised through here almost hourly! Probably outrageously bandwidth-costly but so convenient and extremely useful. Oh, well. I guess I'll just eyeball this (and each) forum's post titles back a bunch of pages.)

incrediBILL

10:34 pm on Jul 1, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Didn't imply you didn't check, and you have even more info as this one is kind of elusive IMO.

Just thought you might find their link to PlanetLabs experiments amusing.

Pfui

12:10 am on Jul 2, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Thanks, Bill, and no prob re checking. (I love looking up stuff; I just hate not finding stuff I know it's on WW. Somewhere.)

And the PlanetLabs info was definitely amusing. And interesting. And reinforcing of my decision years ago to err on the 403 side of darn near everything from anything-dot-planet-dot-anywhere, including:

Mozilla.*PlanetWeb

.planet.com
.speed.planet.nl
.theplanet.com
.earth.theplanet.net
.reverse.theplanet.net
.ipplanet.net
.planetarabia.com

(Alas,

SetEnvIfNoCase Remote_Host "planet" no_way
probably is a bit extreme:)

Remember how the very worst of the Host bunch was/is .reverse.theplanet.com? Gack. If I so much as see 'planet' in my logs, I start to twitch.

(Note to Self: Add .planet-lab.org)

incrediBILL

3:20 am on Jul 2, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



If I so much as see 'planet' in my logs, I start to twitch

Can you upload an MPEG of that?

We'd all like to see the Planet Twitch, could be a huge success on MTV