homepage Welcome to WebmasterWorld Guest from 54.205.7.136
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Google / Google News Archive
Forum Library, Charter, Moderator: open

Google News Archive Forum

This 96 message thread spans 4 pages: 96 ( [1] 2 3 4 > >     
Google hits!
kosar




msg:149122
 2:25 pm on Oct 25, 2004 (gmt 0)

60,000 google hits so far this month never have seen anything like this. Anyone else having this experience?

 

Brett_Tabke




msg:149123
 1:06 pm on Oct 31, 2004 (gmt 0)

referrals? Or GoogleBot visits?

Neither is uncommon.

webnewton




msg:149124
 10:06 am on Nov 2, 2004 (gmt 0)

O Man you're being hit badly! :>) Take it easy its common.

ThomasB




msg:149125
 5:54 pm on Nov 2, 2004 (gmt 0)

You are not alone with the enormous amounts of requests.

[webmasterworld.com...]

Brett_Tabke




msg:149126
 5:58 pm on Nov 2, 2004 (gmt 0)

We get that per day here.

Critter




msg:149127
 6:24 pm on Nov 2, 2004 (gmt 0)

Yep, Google sends around 30K per day to me, and expecting a *large* increase in traffic once the new links kick in.

jnmconsulting




msg:149128
 7:12 pm on Nov 2, 2004 (gmt 0)

I'm getting the same thing, this is the second in the last 30 days.

siteseo




msg:149129
 7:33 pm on Nov 2, 2004 (gmt 0)

We got hit so hard by GBot today that all our stores are offline - inadvertent Denial Of Service

kosar




msg:149130
 7:34 pm on Nov 2, 2004 (gmt 0)

i have never been hit so hard, why all of the action?

itisgene




msg:149131
 9:13 pm on Nov 2, 2004 (gmt 0)

One of my sites had more Gbot hits yesterday than the whole month of October.

PhraSEOlogy




msg:149132
 10:38 pm on Nov 2, 2004 (gmt 0)

Googlebot crawling like crazy here. 2 gig in less than 2 days. COOL! Maybe googlebot will be able to keep up with AJ and MSN who have been outcrawling (and referring more) than Google in the past few months.

dickbaker




msg:149133
 10:58 pm on Nov 2, 2004 (gmt 0)

My site has been in the sandbox, or whatever you prefer to call it, since I submitted in June. G was only coming around a couple hundred times a month. My site is 1400-1500 pages. In the last couple of days, it has visited 1501 times, and has now indexed 885 pages.

Now to work on more incoming links!

jnmconsulting




msg:149134
 11:18 pm on Nov 2, 2004 (gmt 0)

This might be somewhat off topic, but I was not able to find any information while performing a preliminary search of the forums.

How does Google acctually index a site, I understand there are multiple bots that perform diff tasks, find links, index the page...etc. Is there any documentation on this. what is the order of occurance by each bot?

pipster2004




msg:149135
 11:26 am on Nov 3, 2004 (gmt 0)

The darn thing is doing "in" my site :-
looking through the stats I get blasts of activity
up to 20 requests a second!
This is hurting too much!
I like google...but this is silly!

I keep having to ban and unban that IP!

Mozilla/5.0+(compatible;+Googlebot/2.1;++http://www.google.com/bot.html)

on IP 66.249.66.51

GoogleGuy




msg:149136
 6:47 pm on Nov 3, 2004 (gmt 0)

pipster2004, the crawl team is looking into it. We don't want to crawl so hard that you have to take action like that.

Essex_boy




msg:149137
 7:09 pm on Nov 3, 2004 (gmt 0)

One thing ive noticed, when something big is in the offing Googleguy pops up.

Is it me? Am I paranoid? Or has anyone else noticed this?

WebFusion




msg:149138
 7:45 pm on Nov 3, 2004 (gmt 0)

[BEGIN COMPLETE SPECULATION]

Personally, I think that google has finally developed a system that has overcome the space limitations of their previous version, and have now begun a full crawl using a newly developed crawler (that attempts to evaluate the speed/capacity of a site's server on the fly for maximum indesing speed) in earnest to rebuild their entire index from the ground up.

I think in the next 3-6 weeks there will be both a MAJOR update, as well as an release by google saying "now searching XXX billion and/or trillion pages".

[END SPECULATION]

StupidScript




msg:149139
 7:51 pm on Nov 3, 2004 (gmt 0)

GoogleGuy,

Is there an imposter Googlebot roaming around?
What's with the Mozilla user-agent?
Why isn't it requesting my "robots.txt" files, anymore?

Critter




msg:149140
 7:57 pm on Nov 3, 2004 (gmt 0)

200 pages per second crawled...

Not even breathing hard :)

iblaine




msg:149141
 8:12 pm on Nov 3, 2004 (gmt 0)

3 million pages crawled...this morning. However this is spread across several servers on several IPs so it's not painful, just surprising.

RFranzen




msg:149142
 8:49 pm on Nov 3, 2004 (gmt 0)

I concur with webfusion's speculation. Something's up. An additional point is MSN may soon graduate its techpreview. IMO they wouldn't want to face comparative reviews showing MSN's 25 billion (or whatever) pages to Google's 4 billion. ... at least not without a way to compete in the numbers game.

-- Rich

creative craig




msg:149143
 8:54 pm on Nov 3, 2004 (gmt 0)

I have had a new site crawled and now ranking in under two weeks now - visiting daily at the moment.

A few of my older sites have been crawled deeply as well!

mark1615




msg:149144
 9:08 pm on Nov 3, 2004 (gmt 0)

Here is a quick point and question:

One of our sites was getting hit hard and fast by G. It seemed to build throughout October. Then on 11/1 it all but stopped. 11/1 G requested about 3% of what it was on 10/31. Ok, now the embarassing part - I made a small (really small) change to the index page on 10/31 that cause it not to validate. Could the lack of the code validating inhibit G and the other bots?

communitynews




msg:149145
 9:20 pm on Nov 3, 2004 (gmt 0)

We have 250,000 pages on one of our sites but only see 6 or so pages per second at peak from GoogleBot. I'm thinking that our database is a limiting factor and that if I put in more RAM it could handle closer to the 200 pages per second reported here. Two questions: 1.)Do you think GoogleBot figures out how fast the machine is and backs off if it can't handle it? and 2.)if the machine was faster would GoogleBot get more pages (Google only reports between 100,000 and 125,000 with site:domain.com for the site in question)?

communitynews




msg:149146
 9:23 pm on Nov 3, 2004 (gmt 0)

Oh, one more question. Does anyone know if speed of the machine is factored in to Page Rank?

BigJay




msg:149147
 9:52 pm on Nov 3, 2004 (gmt 0)

66K pages this week.

based on Googleguy's comment we can call this a HARD CRAWL.

rlkanter




msg:149148
 10:02 pm on Nov 3, 2004 (gmt 0)

24k first two days of month on one site compared to 7k all last month....

phaze




msg:149149
 11:29 pm on Nov 3, 2004 (gmt 0)

There's been some interesting speculation over the last month on why GoogleBot has so much energy. No one seems to have caught on. It's not 'panic crawling' or somesuch nonsense. Google have simply upgraded their infrastructure thanks to a cash injection.

I run a search engine, and besides your various algo's, the two most important things are the size of your index, and its freshness. If I were google, that's the first thing I'd throw money at if I had spare cash and was worried about Microsoft and Yahoo on my heels. I'd upgrade my crawler farm and add massive capacity to the servers that carry my indices. Then I'd crawl as hard and as deep as possible.

And I'd make sure I have a small team lurking on the boards checking if webmasters start squeaking about bandwidth and load - as above. I may even contact the owner of the board, and call in a favor to bump the 'Google hits' discussion to the home page. ;)

m.

GodLikeLotus




msg:149150
 11:38 pm on Nov 3, 2004 (gmt 0)

phaze

I would bet that Google is already doing everything you just mentioned and much much more.

GLL

killroy




msg:149151
 11:51 pm on Nov 3, 2004 (gmt 0)

Google basically frooze my server for a while, I saw up to 50 scripts running simultaneously all hit from the same googlebot IP. In the meantime it spiders 10s of 1000s of pages but only a couple of 100 urls not previously spidered. So I'm not even gonna get new pages indexed for my hassle :(

SN

This 96 message thread spans 4 pages: 96 ( [1] 2 3 4 > >
Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google News Archive
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved