Forum Moderators: open
Flunky has been crawling my site at the rate of about one page per hour. Never seen or heard of it before... picks up my robots.txt, and also looks at my php pages.
Anyone know anything about it?
My searches didn't reveal anything ...
From ggrot -
I saw it too - just today. No insight on it though. The site it spidered isn't very high profile, so it is probably bigger than some individual on a cable modem.
66.28.20% This is the RIPE Whois server.
% The objects are in RPSL format.
% Please visit [ripe.net...] for more information.
% Rights restricted by copyright.
% See [ripe.net...]inetnum: 0.0.0.0 - 255.255.255.255
netname: IANA-BLK
descr: The whole IPv4 address space
country: NL
admin-c: IANA1-RIPE
tech-c: IANA1-RIPE
status: ALLOCATED UNSPECIFIED
To me that reads as -- "ripe doesn't know".
So I tried the US equiv.
[arin.net...]
Cogent Communications (NETBLK-COGENT-NB-0000)
1015 31st Street, NW
Washington, DC 20007
USNetname: COGENT-NB-0000
Netblock: 66.28.0.0 - 66.28.127.255
Maintainer: COGCCoordinator:
Cogent Communications (ZC108-ARIN) dns@cogentco.com
+1-877-875-4311Domain System inverse mapping provided by:
AUTH1.DNS.COGENTCO.COM66.28.0.14
AUTH2.DNS.COGENTCO.COM66.28.0.30ADDRESSES WITHIN THIS BLOCK ARE NON-PORTABLE
Record last updated on 22-Aug-2001.
Database last updated on 17-Sep-2001 23:12:41 EDT.
So what are Cogent up to?
(I think Cogent are some miltary communications company related to Nortel ... talks about optical on the website???)
[edited -- wrong ip submitted to arin -- * slaps forehead *]
From metacarta.com:
Company ProfileMetaCarta maps information to the physical world, increasing the value of an organization's knowledge by leveraging existing data. In a world of global competition, understanding complex relationships determines success. MetaCarta digests massive amounts of information in a novel way, creating a cohesive picture to accelerate decision making.
blahhblahhblahh...
If you do a traceroute on any of the .12* ips the last hop is 66.28.20.194, which has the same type of authentication message -> "Pleas contact jrf with questions."
Its fairly low impact - (on my site) I think I'll leave banning it just in case it turns out to be something cool.
MetaCarta maps information to the physical world, increasing the value of an organization's knowledge by leveraging existing data. In a world of global competition, understanding complex relationships determines success. MetaCarta digests massive amounts of information in a novel way, creating a cohesive picture to accelerate decision making.
Translated:
We steal your content and sell it.?
RRC (crawler_admin@bigfoot.com)
This means nothing to me (I guess we could mail them :-). It only hit three pages, so that's pretty small beer for now.
66.28.68.234
66.28.68.235
66.28.68.236
66.28.68.237
66.28.250.171
66.28.250.172
66.28.250.173
66.28.250.174
Some substantiation comes from the last two hops on a traceroute:
19 65 ms 70 ms 71 ms metacarta.demarc.cogentco.com
20 70 ms 66 ms 68 ms 66.28.68.237
As reported in the New York Times [nytimes.com] last January, Metacarta is keen on getting a spook contract from the feds for Internet snooping and mapping.
Contrary to some of the posts above, I don't see any relation between Metacarta and bigfoot.com. The former is in Massachusetts, the latter in the Philippines. While Metacarta has a website that says almost nothing, if you look under "jobs" you will see that they want a sales engineer with a security clearance to work out of Washington DC (where cogentco.com is located).
I suspect that they got their spook contract. Does anyone have any other IP numbers for Metacarta crawlers?