Forum Moderators: open

Message Too Old, No Replies

Google Bot

         

Raven_X

3:10 pm on Aug 13, 2003 (gmt 0)

10+ Year Member



Hello

I noticed that when Google Bot visits my index page it always reads to 51.47 KB when my index page is a lot bigger than that. And it never gets to the end of the page. Can anyone explain to me how exactly does it read or to be more specific if I was to look into HTML properties of index how would I calculate where exactly does it stop.

Thank you

AthlonInside

3:39 pm on Aug 13, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



1 character = 1 byte.

1 kilobyte = 1024 bytes = 1024 character

51.47 kilobytes = 51.47 x 1024 = 52,706 bytes = 52,705 characters.

Write a php script to read the index file for 52,705 characters and that's the point.

Hmm, do I made any mistakes? :)

dougmcc1

3:56 pm on Aug 13, 2003 (gmt 0)

10+ Year Member



You might have an error in your code at that point which is causing Googlebot to not index beyond that point.

Raven_X

3:57 pm on Aug 13, 2003 (gmt 0)

10+ Year Member



Can you give me easy directions on how to write PHP script? I'm not really good at technical specs

Chndru

3:58 pm on Aug 13, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Will viewing a cached version of the page show where the Gbot stopped?

Raven_X

3:58 pm on Aug 13, 2003 (gmt 0)

10+ Year Member



dougmcc1
Very interesting though. What kind of common errors are they that prevent Google Bot from going trough them?

dougmcc1

4:07 pm on Aug 13, 2003 (gmt 0)

10+ Year Member



Sorry, I don't have specifics as to exactly what coding errors might stump Google.

Raven_X

4:09 pm on Aug 13, 2003 (gmt 0)

10+ Year Member



Do you think Flash buttons would have something to do with it? If so is there a script that can make Bot by pass them and go trough the page?

dougmcc1

6:25 pm on Aug 13, 2003 (gmt 0)

10+ Year Member




Do you think Flash buttons would have something to do with it?

Not unless the code is wrong or something.


If so is there a script that can make Bot by pass them and go trough the page?

If you think that's what's causing the problem, you can do something like this:
<script language="JavaScript" src="flashscript.js"></script>

And then put the Flash embed code into the script like this:
document.write("<embed flash movie code>");

Would you mind posting your code?

Raven_X

8:10 pm on Aug 13, 2003 (gmt 0)

10+ Year Member



<script type="text/javascript" language="JavaScript1.2">
<!--
beginSTM("yfatchr","static","0","0","none","false","false","310","20","0","250","","blank.gif");
beginSTMB("auto","0","0","vertically","blank.gif","0","0","0","1","

Middle.................

endSTMB();
endSTMB();
endSTM();
//-->
</script>

dougmcc1

11:11 pm on Aug 13, 2003 (gmt 0)

10+ Year Member



I thought you said Googlebot wasn't indexing your page after the Flash buttons? If that's the case then you should have an <object> and/or <embed> tag, not JavaScript.

But if that is the code that Googlebot isn't indexing after then what I would do is move this code into an external JavaScript:


<!--
beginSTM("yfatchr","static","0","0","none","false","false","310","20","0","250","","blank.gif");
beginSTMB("auto","0","0","vertically","blank.gif","0","0","0","1","
Middle.................

endSTMB();
endSTMB();
endSTM();
//-->

Then I would save that script in a separate folder called 'scripts' and reference it after the </HTML> tag if possible, like this:


</html>
<script language="JavaScript" src="scripts/externaljsfile.js"></script>

Because I don't think Googlebot indexes anything after the </HTML> tag - though I could be wrong.

And finally I would use the robots.txt file to disallow Googlebot from the 'scripts' folder.

That way there is no way that Googlebot can trip over your JavaScript code.

g1smd

11:23 pm on Aug 13, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



It is not good coding to continue after the </html> tag.

You'll need a type="text/javascript" attribute on that <script> tag too.

dougmcc1

11:40 pm on Aug 13, 2003 (gmt 0)

10+ Year Member



Ah thanks for the correction g1smd.

Yeah it is bad coding. But how will Googlebot know if it doesn't go past the </HTML> tag?

Although I suppose there are other things to consider such as HTML validation and how different browser's will handle it.

Anyways, after looking at your site Raven_X, it wouldn't be possible to move that script anywhere anyways because it's your navigation. If you move the code, you move your navigation. Just move it to an external JavaScript, place it in a separate folder and disallow that folder from the SE's and you should be fine.