Forum Moderators: open
As for spider ips, you can check Brett's extensive list here:
http://www.searchengineworld.com/spiders/spider_ips.htm
Their site is at: [whizbang.com...]
From their "About Us" file on site:
WhizBang! Labs has developed software that builds application-specific databases by automatically finding and extracting user-defined content from an unlimited number of Web pages located anywhere on the internet. The company's proprietary software:
Crawls the Web, searching for and identifying new domains
Classifies pages in each domain, identifying those that contain the user defined target data
Captures the target data, extracting it from the pages it has found and classified, whether that target data is embedded in the text or stored behind forms
Compiles the extracted data, storing it in a relational database where it can then be searched, sorted, filtered, and otherwise manipulated with traditional RDBMS tools, either directly or through a public or private portal
Seems they're selling their software.
The IP will not resolve. However, it is from an entirely different range than that of
www.whizbang.com (216.160.248.170).
As this spider does not service any search engine, we categorize it "DC" (decloaking hazard).
This means that if you are cloaking your site you should not feed it with cloaked or phantom
pages.
Hope this helps.
ROBOT 2000-03-11, 20:56 -- 204.162.96.124 -- 204.162.96.124 --
/dir2/page-2.html -- -- InfoSeek Sidewinder/0.9
ROBOT 2000-03-11, 20:56 -- 204.162.96.124 -- 204.162.96.124 --
/dir2/page-2a.html -- -- InfoSeek Sidewinder/0.9
ROBOT 2000-03-11, 20:56 -- 204.162.96.124 -- 204.162.96.124 --
/dir2/page-2b.html -- -- InfoSeek Sidewinder/0.9
ROBOT 2000-03-11, 20:56 -- 204.162.96.124 -- 204.162.96.124 --
/dir2/page-2c.html -- -- InfoSeek Sidewinder/0.9
ROBOT 2000-03-11, 20:56 -- 204.162.96.124 -- 204.162.96.124 --
/dir2/page-2d.html -- -- InfoSeek Sidewinder/0.9
ROBOT 2000-03-11, 20:57 -- 204.162.96.124 -- 204.162.96.124 --
/dir2/page-2e.html -- -- InfoSeek Sidewinder/0.9
USER 2000-03-12, 06:55 -- 216.250.143.106 -- 216.250.143.106 --
/dir2/page-2a.html -- -- WhizBang! Lab
USER 2000-03-12, 06:56 -- 216.250.143.106 -- 216.250.143.106 --
/dir2/page-2b.html -- -- WhizBang! Lab
USER 2000-03-12, 06:56 -- 216.250.143.106 -- 216.250.143.106 --
/dir2/page-2e.html -- -- WhizBang! Lab
USER 2000-03-12, 06:56 -- 216.250.143.106 -- 216.250.143.106 --
/dir2/page-2ba.html -- -- WhizBang! Lab
USER 2000-03-12, 06:56 -- 216.250.143.106 -- 216.250.143.106 --
/dir2/page-2.html -- -- WhizBang! Lab
USER 2000-03-12, 06:56 -- 216.250.143.106 -- 216.250.143.106 --
/dir2/page-2b.html -- -- WhizBang! Lab
USER 2000-03-12, 06:56 -- 216.250.143.106 -- 216.250.143.106 --
/dir2/page-2c.html -- -- WhizBang! Lab
USER 2000-03-12, 06:57 -- 216.250.143.106 -- 216.250.143.106 --
/dir2/page-2d.html -- -- WhizBang! Lab
And here's what he has to say about it:
"From its quick following of Infoseek Sidewinder (..)it sure looks like a checking bot. It is unlikely that WhizBang could have found these pages any other way. These are only days old, and have never been visited by anyone else. They were submitted by e-mail to Infoseek. There only links to these pages are a hallway page (which, because of Infoseek's recent policy of spidering only the home page, has a link on the home page.), which also was only submitted to Infoseek. (Had I known that IS would spider from e-mail so quickly, I wouldn't have bothered with the home page - hallway link.) It does not appear that WhizBang visited either the home page (index.htm) or the hallway page (which it could have only gotten from Infoseek anyway if it never hit the home page.) I will reexamine the logs going back a day or two & see if I missed a WhizBang visit at an earlier date where it might have picked up this link. At the moment, though, it looks VERY suspicious. Good thing Infoseek never updates their index - I think this site is screwed."
I'm inclined to agree that it looks rather fishy - anyone got similar logs to check out this issue?
Mike Mackin in a similar thread noted the spider is used by Flipdog.com who claim to collate Job information, (see [flipdog.com...] ) though this doesnt seem a terribly efficient way to find jobs!
Whiz sells software which can be configured.
[whizbanglabs.com...]
Contacting them a waste of your time and theirs unless you desire to purchase their software.
What you need to do is have a hit man eliminate the person using the software
or
just add the IP to your
htaccess ;-)