Forum Moderators: open

Message Too Old, No Replies

Googlebot and how it works

How much does Googlebot look at?

         

Cheery Ragdoll

2:48 am on Oct 31, 2003 (gmt 0)

10+ Year Member



When Googlebot is crawling a page, does it crawl the entire page or just up to certain amount of bytes?

If it does stop crawling after a certain amount of bytes (say for example 40,000Kb) if I have an image at the top of the page that is 40,000KB, will Googlebot stop at the end of the image and not pay any attention to the HTML text with keywords below?

Jesse_Smith

5:57 am on Oct 31, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



It stops at 100K. 100K of the HTLP file. Anything after that it doesn't see.

DerekH

7:25 am on Oct 31, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Cheery_Ragdoll wrote
if I have an image at the top of the page that is 40,000KB, will Googlebot stop at the end of the image and not pay any attention to the HTML text with keywords below?

The image isn't *in* the HTML, only a reference to it is in the HTML, so only about 100 bytes are associated with the image...

 

Jesse_Smith wrote

It stops at 100K. 100K of the HTLP file. Anything after that it doesn't see.

I think there has been anecdotal evidence that links are extracted from the whole page, but only the first 101K is cached... Anyone confirm?

DerekH

Jesse_Smith

3:22 pm on Oct 31, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I've had link files that were WAY bigger than 100K, and only the links in the first 100K got crawled. If it's not in the first 100K, Goolge won't see it.