Forum Moderators: open

Message Too Old, No Replies

Is Freshbot now Deepbot?

The line is getting drawn ever thinner

         

trillianjedi

4:18 pm on May 22, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I've seen several postings about this now in the last few days, although this is my first actual experience of it.

I'm being hit very hard by google's freshbot at the moment, and going deep too. At first glance at what is currently going on with the little guys, I had to check and double check that the IP's were 64.... (they are).

It's behaviour, in terms of hard hitting and depth of crawl (it's going through the entire site) is more like the character of the old deepbot.

In fact, it's identical behaviour to deepbot the last time it crawled this site back in April.

I'm interested in hearing from others who are seeing the same.

TJ

ncsuk

10:29 am on May 28, 2003 (gmt 0)

10+ Year Member



My site gets crawled about 3/4 times a day. I have also noticed that FB seems to do some sort of directory browsing because it picked up a ton of pages I have in a test called "test1" on my server of which there are 0 links to.

Kind of a problem really.

vincevincevince

10:38 am on May 28, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



ncsuk....
just a guess, but maybe you followed a link from one of your test pages which ended up in some else's referrer logs, which google crawled?

x_m

10:53 am on May 28, 2003 (gmt 0)

10+ Year Member



I don't see what this indicates...freshbot has always been about finding new links and new pages.

I agree. I have been playing with freshbot for few months and seems it picks up everything what is linked from page considered by bot to be important regardless if the linked page is old or brand new. I can't see any evidence of deepbot like behaviour (i.e. spidering low PR pages on new domains)

XM

vik_c

7:17 am on May 29, 2003 (gmt 0)

10+ Year Member



Maybe the guys at G-plex just reassigned IPs. So the IP addresses (64..) we've attributed to Freshbot now belong to Deepbot.

trillianjedi

12:09 pm on May 29, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Remember guys that GG only stated that he was "glad that people noticed freshie behaving like deepbot" and not that this was going to be the new way from now on.

What was concluded in this thread (from evidence left in site logs) is that, for this particular update, freshbot was crawling the April deepbot data and not the sites direct.

I think it was generally agreed that this was a means of getting the rolled-back index (possibly March or Feb.) that was the basis for Dominic reasonably up to date. In other words, they freshbotted in the April deepcrawl data.

We noticed, GG responded to say "glad you noticed".

There is nothing to indicate at the moment that this is the new way for the future. Only speculation (by myself included).

Fun though speculation is, it's also quite dangerous.

TJ

g1smd

12:24 am on May 30, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Hmm, with the 23 May 2003 fresh tags, my #1 and #2 sites suddenly dropped to #64 and #6. Both are index.html pages that have been top of the rankings for nearly 2 months (occasionally dropping 1 or 2 places with another fresh site temporarily placed higher), and were new sites first published late March. I had just changed the #2 page to have two outgoing links (the new one to another new site) instead of only one to the, as was, #1 page. At the same time the #1 page dropped from PR6 to PR4. I removed the link and yesterday these pages bounced back to #14 and #1 with 27 May 2003 fresh tags. I don't know if the link had anything to do with it or whether it was a Google burp; but this was the first time in ages the sites changed position by more than 2 places.

However, I notice that the date format for the fresh tags on -sj and -fi and others is now the old US format of May 27, 2003 instead. I'm in the UK. Another subtle change. These always had 27 May 2003 format dates before.

bether2

12:36 am on May 30, 2003 (gmt 0)

10+ Year Member



So, we are now saying that we are not going to see deepbot anymore i.e. We will only see freshbots. which means the monthly update is not going to take place again...

darkroom,

Googleguy has said in the follwing thread that we should expect "at least another update of the form where the crawl/index cycle finishes and then data centers are updated in the traditional dance."

[webmasterworld.com...]
(search this thread for the two references to "traditional update")

Beth

mfishy

1:50 am on May 30, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



bether2,

I am glad you pointed this out again.

It seems many keep wondering if we are now in some sort of perpetual update process while GoogleGuy plainly stated that we should expect at least one more traditional update (made it sound like many more as the system is new).
It's almost as though every other post should have this dropped in.

IMHO, we will see a huge change in SERPS after the next update when we see some recent backlink data.

Critter

11:38 pm on Jun 1, 2003 (gmt 0)

10+ Year Member



Deepbot (from freshbot IPs) is definately in my site now, because it's getting pages that have never been gotten by Google before.

Before this time the Googlebot (fresh I guess) was simply retrieving pages that had been crawled in April.

So...this leads me to believe that the deepcrawl has started (at least for me), although it may have started some time ago with higher PR sites.

TJ, have any comments on your logs?

Peter

Stefan

11:53 pm on Jun 1, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



it's getting pages that have never been gotten by Google before

I've been seeing this from freshie too, the last couple of days, but it still seems pretty slack, i.e. not finding all of the new pages and not anywhere near as busy as a deepcrawl. Maybe it will be a slow fresh deepcrawl and find them all soon, I hope :-)

Clark

12:03 am on Jun 2, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Some new pages, but that doesn't mean it's deepcrawl-style. To my memory, freshbot typically used to catch some new pages too. So maybe it's just freshbot's functionality being brought back.

hotice_2002

1:50 am on Jun 2, 2003 (gmt 0)

10+ Year Member



Yes, the freshboy(64.68.XX.XX) has grasped hundreds of my new webpages. I believe this kind of behaviour should be "deep crawler"!
lol

catch2948

3:35 am on Jun 2, 2003 (gmt 0)

10+ Year Member



Not only is freshbot following deeper than most times before, it is also following too ...

Strange though ... On websites where freshbot is picking up some new links that I started acquiring, the websites that the links are on are showing up under keyword searches that my links are targetted toward, rather than my site which the links are pointing to ... This should mean that the deep crawl has yet happened?

I wondering if there is now going to be some sort of "double crawl" per index cycle ...

trillianjedi

10:31 am on Jun 2, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hiya Critter.

Yes, this actually started for us on Friday, although we didn't notice that she was picking up stuff that was not in April deepcrawl data until Saturday.

I didn't post because, to be completely honest, I'm a little bored with google now and will just wait until it's settled.

I can confirm your findings though, although whether or not it's anything out of the usual for freshbot is a little less clear. She's been known to go very deep in the past.

TJ

Stefan

10:57 am on Jun 2, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



steveb, we're just desperate for a deepcrawl and a new index... anything looks good right now. You're right, though, freshie was the first to pick my site up a couple of weeks after it went online last year.
This 211 message thread spans 15 pages: 211