Forum Moderators: open

Message Too Old, No Replies

Google and message boards

How does Google determine whether to catalogue posts on message boards?

         

gibbergibber

12:40 pm on Jul 31, 2003 (gmt 0)

10+ Year Member



Hi,

I've found it increasingly fruitful to integrate message boards into my website, so instead of just publishing an article as a webpage I might post it to my site's message board system and link to this post from my site's main page.

My question is, how does Google decide whether or not to catalogue posts from message boards?

I can understand that the bots would want to avoid trawling through every script-based link ad infinitum. However, assuming there's a direct link to a message board post from an already catalogued HTML page, how likely is Google to index the post?

I've noticed that it's quite willing to index the indices of message boards, for example:

[mydomainname...]

but it hasn't so far (after several updates which managed the rest of my site) indexed any individual threads I've linked to.

I'm also trying out a PHP board system, and was wondering if this would be treated differently to a perl cgi one.

-gibbergibber

roddy

2:17 pm on Jul 31, 2003 (gmt 0)

10+ Year Member



I don't know about CGI ones. However, I do know that PHP boards (specifically PHPbb) can be indexed by google - mine is.

The vanilla version used session ID's, which google avoids. However, after adding in a mod which recognises the googlebot and serves its a SID free board, everything's gone swimmingly.

It seems quite happy to index every daft little page - I've seen googlebot simultaneously read 25 identical search pages having followed the links from the actual threads. About 390 pages went in the first time round, and I'm expecting a similar number to reach the index soon - there were about 30 Googlebot 'guests' online on my forum for most of Thursday.

Roddy

gibbergibber

2:46 pm on Jul 31, 2003 (gmt 0)

10+ Year Member



Anyone care to speculate why CGI posts aren't indexed but PHP ones are?

There's good content on message boards (okay, so it's in there with 99.9% rubbish sometimes!), it's a shame Google won't let you search it the way they let you search Usenet through Google Groups.

Google Guy, any chance of a Google tab devoted to message boards? (hopeful look)

-gibbergibber

ruserious

3:28 pm on Jul 31, 2003 (gmt 0)

10+ Year Member



There is no distinction between CGI and PHP (remember that PHP can even run as a CGI extension. Also google has no way to know how a served document resides on the server, it can only guess wether its static, dynamic etc. and IMHO it does not even do that).

You have to be considerate when "crafting urls". The consensus is that you should try to have no more than 2 Parameters in the url. With more parameters the links are likely to be ignored. You should also avoid having "id=" as a parametername (for gbots fear of session-ids). You should also be careful to not serve session-ids to all users. Remember that most dynamic applications use cookies by default, and fall back to session-ids only when cookies are disabled. Since Googlebot does not use cookies, it will get a session-id, even if you as a user might not be aware of it (if you have cookies enabled).

You can get every Forumsoftware to be spidered, it's just a matter knowing what and how to do it. You should also think carefully about which parts you do want and which you do not want to be indexed (use meta-tags). It's an important decision.

antoshka

8:28 pm on Jul 31, 2003 (gmt 0)



i heard that Google will spider your dynamic pages, only if your PR is high enough. is it true?

[edited by: rcjordan at 8:34 pm (utc) on July 31, 2003]
[edit reason] sorry, no references to your site, please. [/edit]

edit_g

8:32 pm on Jul 31, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Can you get rid of your URL please antoshka
- this is the second post of yours that I've read where you have a sig. It is really irking me.
No sigs: [webmasterworld.com...] (check number 13).

GrinninGordon

12:58 am on Aug 1, 2003 (gmt 0)



gibbergibber

Please! Cgi posts aren't indexed! Do a search for wwwboard/messages

geckofuel

1:35 am on Aug 1, 2003 (gmt 0)

10+ Year Member



gibbergibber,
Hi. I and many others who use UBB, for example, can sympathize with you. The average UBB board is not indexed. There are some hacks out there though to make your CGI bulletin board look like static html pages.

cayleyv

6:03 pm on Aug 1, 2003 (gmt 0)

10+ Year Member



I have a forum setup with phpbb and it is not indexed. all external links pass through a header redirect just like here at WebmasterWorld. however, the content is not indexed. I saw the googlebots trawl through it a few months ago and never return. I have seen other phpBB indexed, and my conclusion is that the nature of my board is related to search engine optimization.

My hypothesis is that googlebot discriminates between those who have something to gain online (SEO) and joe the plumber's board where people dont use signatures for search engine benefits.

Nicke

9:04 am on Aug 2, 2003 (gmt 0)

10+ Year Member



I liked WWWBoard in the old days, always indexed by search engines.

Nowadays I am using vBulletin. Google should theoretically be able to index the main page and the main category page and threads. But not the following page(s) since more than 2 parameters are used to define some settings.

Actually, Google has only indexed the main category pages at my forum and none of the threads. Any good hacks for vBulletin? The main page of my forum is PR5.

petertdavis

11:45 am on Aug 2, 2003 (gmt 0)

10+ Year Member



Nicke, which version of vBulletin are you using? I have a small forum that I recently upgraded to vBulletin 3.0. It went from having about ten pages indexed prior to the upgrade to having 2000+ pages indexed after the upgrade. It's got PR5, if that matters.

Jesse_Smith

11:45 pm on Aug 2, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



::My question is, how does Google decide whether or not to catalogue posts from message boards?

Does it have session ID in the URL?

Yes= You don't get listed.
No= You do get listed.

With VBulletin boards I went almsot two years with the Google Death Penelty then I took the session ID out of the URL in January and right now there are 15,600 URLs indexed.

:::i heard that Google will spider your dynamic pages, only if your PR is high enough. is it true?

Nope. I got a second vBulletin site, a new site, and before even registering a PR or back-links, it's indexing threads and boards.

Jenstar

11:52 pm on Aug 2, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



cayleyv - phpbb boards are indexible by Google - head on over to phpbb.com and go to the forums. There is a huge thread there about how to allow Googlebot to index your site so the threads appear in the serps.

Nicke

1:17 pm on Aug 4, 2003 (gmt 0)

10+ Year Member



How do you turn off session id for vBulletin to make it more searchengine friendly?

I am currently using 2.2.9 and I know 2.3.0 is out for a while. Do you use the beta of 3.0?