Forum Moderators: open
-.-.216.239.46.43.crawl2.googlebot.com - - [07/Feb/2003:09:57:28 +0700] "
-.-.216.239.46.184.crawl7.googlebot.com - - [07/Feb/2003:09:57:37 +0700] .
-.-.216.239.46.88.crawl4.googlebot.com - - [07/Feb/2003:09:57:46 +0700]
-.-.216.239.46.48.crawl2.googlebot.com - - [07/Feb/2003:09:58:10 +0700]
-.-.216.239.46.27.crawl1.googlebot.com - - [07/Feb/2003:09:58:11 +0700]
-.-.216.239.46.140.crawl5.googlebot.com - - [07/Feb/2003:09:58:11 +0700]
If it shows up as "Googlebot/2.1 (+http://www.googlebot.com/bot.html)" instead of 'crawl2.googlebot.com', does it make any difference?
No, that's just a difference in the way different web servers log requests.
crawlXX.googlebot.com is the host name
Googlebot/2.1 (+http://www.googlebot.com/bot.html) is the user agent name.
Some logs won't log the host name, just the ip address. Some logs won't log the user agent name. Some logs will log both.
coolshop -
Record number of pages crawled per day for me
on the 6th (tens of thousands), still going
strong (although not as many pages) on the 7th,
still crawling.
I see that you have no post count. Evidently an administrator decided that this thread does not count. But I hope that you will join discussions in other threads too.
Pegasus
you will be surprised to see how many members have thousands of pages. Others have less than hundred. The diversity in this community is incredible and one of its strengths.
I'm wondering if the sites are penalized for some reason. I'm still getting surfer SE traffic from last month's crawl and the feb update, but the fact that the crawler isn't here yet is worrying me.
Anyone else still not seen the bot yet?
.any help will be greately appreciated
These hits were coming each 30 seconds.
PR3 page at the moment- really hope i see higher PR around 5ish.. which is where im grouped in the crawl que it seems.
/pray
It's a mission to find the most appropriate deepcrawl thread to post our comments on, and all the extra threads push the other valuable topics down the page toward oblivion.
Yep, got's about 60 links there, about every message board index plus a few other pages. They have been listed for atleast four days. Instead of dumping them at the next fresh update, it just keeps updating them. So if your very lucky, it treats you like your listed in the main database! (Might be because the site is listed in ODP.)
Last updated according to Google: Feb 10, 2003, and Feb 11, 2003.
As of about 2:00 AM this morning, it's deepcrawled atleast 1166 pages.
[edited by: Marcia at 1:16 am (utc) on Feb. 14, 2003]
[edit reason] removed link, no specifics please [/edit]
For example, this week I had 7091 pages, last week 6806 pages, and the week before 1518 pages requested by the deepcrawler (216.239.*).
The freshbot has been a lot quieter recently, but normally visits my site most days. This week 544 pages crawled by the freshbot (64.68.*), last week 380 requests, and the week before 1337 requests.