Forum Moderators: bakedjake
[gigablast.com...]
Real time spidering on the AddUrl page.
[gigablast.com...]
Feels like: Infoseek
Tastes like: Peppermint (minty fresh ;)
Looks like: squeaky clean.
Relavance like: who cares. it's a new se with instant addurl - done.
I have a few questions:
1. Have you figured out a user agent for your spider and if so, what will it be?
2. Robots.txt- will your spider obey it?
3. (Just for my amusement) What is your policy on cloaking?
Key_Master
it will be gigabot.
2. Robots.txt- will your spider obey it?
yes, except top level pages.
it also obeys the meta disallow tags, too.
3. (Just for my amusement) What is your policy on cloaking?
if it's used to abuse the search engine and mislead searchers, then my policy is to ban that site. I index the meta keyword/description tags so use those instead.
Thanks for the thrill. I agree, it reminds me of the old Infoseek rush. Those were the days.
If you wanted to stir up a bunch of web'sters, you've certainly done that. Good luck on your venture.
I searched for
place link <here> for googlebot
and it sorta went a bit off track. Check it out
[gigablast.com...]
First, littleman's UA and ip - I can't find it anywhere in the last week.
Compare this to the link in Brett's first post. Who is 'Guest from 208.254.87.133'? if not the spider and if that's really a cached page.
I can't tell if these pages are cached or not. I submitted a site about 8:00 am this morning, and the cached page shows an update 2 1/2 hours later. No sign of any spider in my logs, AT ALL.
I submitted 5 sites, they're all in, all saying today's date as having been spidered, but NO evidence that anything actually came around? The only spider that's been around lately is whizbang labs.
I'm glad the sites are in, and the SE looks promising. If I look at the data I have, it looks as if the sites were pre-spidered, and the cache (mirror) is showing the current page. I guess I could change a page and look at the cached copy to verify this. Has anybody seen evidence of a crawler? Or or my logs wrong?