homepage Welcome to WebmasterWorld Guest from 54.237.125.89
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Visit PubCon.com
Home / Forums Index / Search Engines / Search Engine Spider and User Agent Identification
Forum Library, Charter, Moderators: Ocean10000 & incrediBILL

Search Engine Spider and User Agent Identification Forum

    
Very annoying robot
msgraph




msg:396110
 12:57 pm on Dec 6, 2000 (gmt 0)

Is anyone else out there getting hammered by 200 variations of robots from petersnews.com? Too many to list but the UA's look like...

user135.petersnews.com
user136.petersnews.com
user137.petersnews.com

216.200.195.53-216.200.195.60

They all come from Dave's Engine. They have similar UA's like...

user135.ip3000.com
user136.ip3000.com
user137.ip3000.com

It's really annoying to see some small engine like this one, go out and spider like crazy without showing any end results. Actually they don't even offer searches yet.

I have been pretty lenient in the past since I support anyone who is trying to start something on their own. Now I am really thinking about banning their IP and UA from the server. Does anyone know what these guys plan on doing?

 

Machiavelli




msg:396111
 2:01 pm on Dec 6, 2000 (gmt 0)

Ah, now Davesengine, there is a thing. How many different sites are just different forms of this same thing? My list is:

Davesengine
Internet Portal 3000
Image Lock
Bit Shark

...anything I've missed?

mglick




msg:396112
 8:31 pm on Jan 30, 2001 (gmt 0)

My site started getting hammered by it as well. Anyone else? I'm not too thrilled about giving up bandwidth to a site engine that doesn't really exist.

Froggyman




msg:396113
 8:52 pm on Jan 30, 2001 (gmt 0)

Dave's Engine [208.185.131.220]

I dug into this site a bit and I must say the pages are not very professional looking. Looks like a high school student put it together.

BoneHeadicus




msg:396114
 10:24 pm on Jan 30, 2001 (gmt 0)

Daves Engines' Office [anywho.com]
This is the address from the bottom of their web site. PMB stands for Personal Mail Box...ie. MailBoxes etc.

Dave's Engine
PMB #74
350 Bay Street
San Francisco, CA 94133
415 772-9198

Does anyone know the area there????

BoneHeadicus




msg:396115
 10:37 pm on Jan 30, 2001 (gmt 0)

I dont know for sure....

Registrant:
Dave Rand (BUNGI-DOM)
15864 Highland Drive
San Jose, CA 95127-1743 US

Domain Name: BUNGI.COM

Administrative Contact, Technical Contact, Billing Contact:
Rand, David (DR223) dlr@BUNGI.COM
15864 Highland Drive
San Jose, CA 95127-1743
+1 408 254-7111

Record last updated on 12-Jan-2001.
Record expires on 07-Feb-2003.
Record created on 05-Feb-1990.
Database last updated on 30-Jan-2001 14:59:08 EST.

Froggyman




msg:396116
 10:53 pm on Jan 30, 2001 (gmt 0)

I'd put my money on this one [networksolutions.com]. Who say's his name has to be Dave?

"Dave" first aquired the following domains in May 2000

davesengine.com
davesengine.net
davesengine.org
petersnews.com
petersnews.net
petersnews.org

These domains will expire in May 2001 which begs the question- Why register a serious search engine site for only one year?

skirril




msg:396117
 11:15 pm on Jan 30, 2001 (gmt 0)

Good question.

I was hit once by userxxx.ip3000.com, but never saw them again (That was like July 2000).

Apart from copyright infringing activities, is there any reason I'd hide info on my robot?

Also if I want to do a search engine would I not tell my robot to ocme back regularly, to check for expired links. The internet is a very changing medium. Spiodering the web once and then living on that data just dones't cut it.

BoneHeadicus




msg:396118
 11:45 pm on Jan 30, 2001 (gmt 0)

How about this one????

organization-name WILDBRAIN.COM Wildbrain (WILDBRAIN-DOM)
organization-postal 2650 18th Street
organization-postal San Francisco, CA 94110
organization-country-code US
handle NETBLK-ABOVE-WILDBRAIN
network-name NET-ABOVE-WILDBRAIN
class-ip-network 216.200.195.128
class-ip-network-block 216.200.195.191
ip-network 216.200.195.0/24
admin-contact Garza, Michael
admin-contact MG13815
admin-contact michaelg@WILDBRAIN.COM
admin-contact 415-553-8000 (FAX) 415-553-8009
updated 24-Apr-2000

Froggyman




msg:396119
 12:46 am on Jan 31, 2001 (gmt 0)

I'm sure Dave will be around here shortly. Maybe he'll be kind enough to post some info about his spiders when he visits.

I too, have been hit hard by these petersnews.com spiders but that was 8 months ago and like skirril says, spidering the web once and then living on the data just doesn't cut it.

PeteU




msg:396120
 1:05 am on Jan 31, 2001 (gmt 0)

Pretty nasty spider trap [bungi.com...]
on Dave's emailing domain

Air




msg:396121
 3:11 am on Jan 31, 2001 (gmt 0)

Nasty indeed! Guess if you're Dave you need this kind of protection?

Froggyman




msg:396122
 3:47 am on Jan 31, 2001 (gmt 0)

I found this [dotcomeon.com].

BoneHeadicus




msg:396123
 4:50 am on Jan 31, 2001 (gmt 0)

Thats a good read froggyman.

Dave if you're reading this tonight, please come forward. We promise to treat you fairly and although we may blacklist you and block your IP, we still respect you.

You've been slurping files for no particular reason for many months now and quite frankly we all want to see this dark chapter in internet history come to an end.

Listen to your conscience Dave. Do the right thing and go on home.

Go home and unplug that OC 12 you're so visciously abusing with that contraption you call an engine. Stop your haphazard misuse of innocent citizens and their servers. Face the reality Dave. Look in the mirror and ask yourself, why?

Why have I wasted all this time and offended so many people? What have I got to show for all of this mayhem in the log files of all those trusting, patient webmasters who have been waiting to see DavesEngine finally emerge?

Face reality Dave...it isn't going to happen. Go home.

Go home and do something useful like burning your name into the workbench with your soldering iron. Or hook up the oscilloscope to the stereo and crank up Tom Petty's "Don't Come Around Here No More" and watch the sqiggly lines dance to the beat. Find something else to do with your time.

And if you're really serious about starting an engine Dave, I know somebody who might be interested in selling their Mickey Mouse operation that didn't quite work out either...

Machiavelli




msg:396124
 10:22 am on Jan 31, 2001 (gmt 0)

Or, of course, he might just be having a laugh.

lak12




msg:396125
 7:26 am on Mar 7, 2001 (gmt 0)

Hi everybody!
Well, I have about 100 visits a day on almost every domain I mange (sometimes the number is over 500 hits) which makes it totally worthless.
At first I gave it a green light, but now it's in my black list and until I see the Daves' robot working as a search engine - this robot gonna get some simple 5 bite long page.

han solo




msg:396126
 2:23 pm on Mar 7, 2001 (gmt 0)

To add my frustrations, and to list a few ips I got hit with this morning. This time, the User agent is generic, so I banned them by ip.

208.185.131.219 Mozilla/3.01C-PBWF (Unix; I)

through

208.185.131.222

I haven't checked all of my logs, so I'm not betting these are the only ones. If you find any others, that behaved this way, please post away. I'm tired of this guy wasting my bandwidth, too.

Cheers,

Han solo

lak12




msg:396127
 4:16 pm on Mar 7, 2001 (gmt 0)

my guess is that davesengine got the picture and trying to scam fellow webmasters. IP backtrace shows porky.davesengine.com for 208.185.131.219 and so. I am starting to ban them big time by blocking the whole set of IPs etc.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Search Engines / Search Engine Spider and User Agent Identification
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved