Forum Moderators: open

Message Too Old, No Replies

Deep crawl bot out?......strange?

crawlxx - not crawlerxx

         

kstprod

3:11 am on Nov 13, 2002 (gmt 0)

10+ Year Member



If I am right, then I am getting deep crawled for some reason. Any ideas? FreshBot spidered me this morning, but I was just suprised to see that the Deep Crawl Bot has also spidered me tonight.

I was told that...

crawlerXX.googlebot.com = FreshBot
crawlXX.googlebot.com = Deep Crawl

Both bots have spidered me today, so I supposed one of them has to be the Deep Crawl, right? Why would this be happening already, isn't this a bit early?

Thanks!

Karen

Slade

3:30 am on Nov 13, 2002 (gmt 0)

10+ Year Member



Most of my sites have already had their deep crawl this period.

Having said that, it does take a while to spider all 3 billion pages...

kstprod

3:35 am on Nov 13, 2002 (gmt 0)

10+ Year Member



Slade,

I already have too.....this is the second one for this month....I was deep crawled the 4th and 5th too. Any reason to be deep crawled twice?

Karen

Slade

4:10 am on Nov 13, 2002 (gmt 0)

10+ Year Member



Is the bot picking up the same pages she got before?

kstprod

10:55 am on Nov 13, 2002 (gmt 0)

10+ Year Member



Slade,

Yes, she picked up all the pages from the deep crawl starting the 4th, and then some. She has also picked up pages that I have created only a couple of days ago. Puzzles me.

Karen

Grumpus

11:28 am on Nov 13, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I've been being crawled continuously all month. Haven't checked to see if there are duplicate pages, but as far as I can tell, it's all the same crawl on my site...

G.

h_b_k

11:42 am on Nov 13, 2002 (gmt 0)

10+ Year Member



I have noticed this fact too.

my site (toolbar pr 5) has been full deep crawled today for the second time this month.

the same I have seen in october.

annej

5:43 pm on Nov 13, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I'm not getting
crawlerXX.googlebot.com
crawlXX.googlebot.com

Instead I appear to have visits from
crawler11.googlebot.com
crawler12.googlebot.com
crawler13.googlebot.com

I'm trying to figure out if I have been deep crawled as Google is still links old discontinued pages when I search.

Anne

cyndyb109

6:00 pm on Nov 13, 2002 (gmt 0)

10+ Year Member



Certain pages on our site have been re-spidered every week, some have not been spidered since August. Any comments on why Google in not picking up these pages? I can't form any pattern. Some of those spidered as late as yesterday have links from the index page and some do not. Some are product pages and some are content pages. Content was re-freshed on some very recently and yet they were not spidered. Content that has stayed the same on other pages for over 2 months was spidered. What am I missing?

Cyndy

troels nybo nielsen

6:10 pm on Nov 13, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Welcome to WW, Cyndy

> I can't form any pattern.

Not even Google is perfect.

optisoft

6:34 pm on Nov 13, 2002 (gmt 0)

10+ Year Member



In the last two days -

crawl4.googlebot.com
crawl5.googlebot.com
crawl7.googlebot.com
crawl1.googlebot.com

- has been to my site, deep crawling.

The bot is out and about ;)

mahlon

7:44 pm on Nov 13, 2002 (gmt 0)

10+ Year Member



it does take a while to spider all 3 billion pages...

Wow you have a big website there Slade! :o

Just kidding!

I have seen both crawlers here!

cyndyb109

6:46 pm on Nov 14, 2002 (gmt 0)

10+ Year Member



Thanks for the welcome:

I'm getting
crawler10.googlebot.com
crawler12.googlebot.com picking up almost the same pages every day for the last three. no deep crawls

Since we are an e-commerce site and part of it is password protected with robots txt, how can this affect what gets crawled that is not protected?

annej

4:46 am on Nov 15, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



How do you tell if it is a deep crawl?

Anne

andreasfriedrich

4:51 am on Nov 15, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



/crawl\d{1,2}.googlebot.com/  # deep crawler 
/crawler\d{1,2}.googlebot.com/ # fresh bot

Andreas

Hoople

4:58 am on Nov 15, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



googlebot.com (216.239.46.xxx) - Spider/Robot
12 Nov -- 21:09:37 -- 00:00 -- /robots.txt
12 Nov -- 21:09:37 -- -- /

Nothing since then. Deep crawl was Oct 30/Nov 2