Forum Moderators: open

Message Too Old, No Replies

Freshbot and how to qualify?

Why some sites just don't get the fresh bot!

         

lasko

11:25 am on Apr 13, 2003 (gmt 0)

10+ Year Member



I have many sites in Google and most qualify for the fresh bot however two new sites that have over 80 baclinks and
75 pages each within Googles database can not attract the freshbot.

The index pages are changed often in images and in size but to no joy.

Google is showing one of my pages before it was optomised. The page is 6 weeks old. Now new fresh dates are appearing on all web sites except this web site.

Some website have less backlinks and less pages but have the freshbot nearly everyday.

Any thoughts on how best to qualify for the fresh bot or is it again pot luck.

le_gber

2:19 pm on Apr 13, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hi,

I think you can 'qualify' for freshbots IF:

  • your site has a high PR (I think 6)
  • your site is linked to by a site with high PR
  • some changes in content occur quite often

    Anybody else?

    leo

  • takagi

    2:55 pm on Apr 13, 2003 (gmt 0)

    WebmasterWorld Senior Member 10+ Year Member



    I agree with most of Leo's remarks. However a new site (i.e. no pages in the index so far and therefore a gray PageRank) will also be visited by freshbot if a link to this site is found. On general having a higher PR (starting from PR5?) means more pages will be visited by freshbot. Especially if changes were found by freshbot or deepbot during previous visits. One last remark; a dynamic page could be tricky if you only update the database but not the php/asp file.

    BGumble

    3:36 pm on Apr 13, 2003 (gmt 0)

    10+ Year Member



    Hi tagaki :)

    >>a dynamic page could be tricky if you only update the database but not the php/asp file.

    Why do you say that please? If the database has changed, the data has changed, and the returned file size will be different. You can use header tags like this to properly inform the search engine when things change:

    header("Cache-Control: must-revalidate,proxy-revalidate,max-age=120,s-maxage=120");
    header('Last-Modified: '.date('D, d M Y H:i:s', ((time()+(3600*8))-60)).' GMT');

    pixel_juice

    3:47 pm on Apr 13, 2003 (gmt 0)

    10+ Year Member



    The 2 most important factors i've found to get frshbot visits are 1) reasonable content changes and 2)a link from a site that gets regularly freshbotted itself (and with a PR6 or above). Your site doesn't need a PR6 to get the freshbot - I have some PR4s that get visited pretty often by virtue of content updates. Freshbot likes dynamic pages.

    takagi

    4:21 pm on Apr 13, 2003 (gmt 0)

    WebmasterWorld Senior Member 10+ Year Member



    Hi BGumble. I wrote on purpose "could be tricky" because I personally don't have any experience with it.

    Please read the Tricking Googlebot on a dynamic site [webmasterworld.com] thread. If it is possible to get the 'last modified date' information without loading the complete page, freshbot might not load the page anyway.

    PatrickDeese

    4:28 pm on Apr 13, 2003 (gmt 0)

    WebmasterWorld Senior Member 10+ Year Member



    I believe that PR5 begins the freshbot cycle. I have a number of sites that are PR5 and they are freshbotted on a regular basis during the crawling cycle.

    plasma

    4:42 pm on Apr 13, 2003 (gmt 0)

    10+ Year Member



    I believe that PR5 begins the freshbot cycle. I have a number of sites that are PR5 and they are freshbotted on a regular basis during the crawling cycle.

    I have many sites with 0(grey),1,2,3 PR and they all get freshbot hits daily. Some of them don't even have backlinks (the grey one).

    They are all dynamic pages (cgi / hashed urls).

    The trick?

    validators (last-modified etc.)
    [w3.org...]

    You can check your site's cacheability here:
    [ircache.net...]

    lasko

    4:48 pm on Apr 13, 2003 (gmt 0)

    10+ Year Member



    Some of my sites have a pr 4 and change nearly everyday

    I wish GoogleGuy could explain more in detail the operations of the Fresh Bot.

    All my pages are static html pages with more pages then my competitors and more links, but still it looks like its going to be another month before Google will change my page.

    Would it not be better for Google to freshbot every main page (index.htm) of all the websites in the database to keep it fresh. Some sites have every page re-freshed whilst smaller sites get nothing.

    mipapage

    5:37 pm on Apr 13, 2003 (gmt 0)

    WebmasterWorld Senior Member 10+ Year Member



    I managed to get a link to our site from a news oriented website, something that gets updated frequently, and now our site is regularily hit by f-bot.

    BGumble

    7:15 pm on Apr 13, 2003 (gmt 0)

    10+ Year Member



    plasma>> The trick? validators (last-modified etc.)

    Exactly. Make sure you declare everything that you should, make it real fresh content, and freshie will come.

    You can even send the correct filesize of a dynamic file in the headers BEFORE the file is sent. Here is how in PHP-- you must put the page in to the buffer, read the size, then deliver the buffer.

    <?

    // BEFORE ANY PAGE CONTENT
    ob_start('ob_gzhandler');

    ..... rest of page goes here .....

    header("Cache-Control: must-revalidate,proxy-revalidate,max-age=120,s-maxage=120");
    header('Last-Modified: '.date('D, d M Y H:i:s', ((time()+(3600*8))-60)).' GMT');
    header('Content-Length: ' . ob_get_length());
    ob_end_flush();

    // HEADERS NOW INCLUDE PROPER VALIDATION INCLUDING CONTENT-LENGTH OF THIS DYNAMIC PAGE
    // ADJUST THE DATE/TIME IN LAST-MODIFIED TO REFLECT YOUR TIME ZONE, THIS IS FOR PST (GMT-8)

    ?>

    ALbino

    7:33 pm on Apr 13, 2003 (gmt 0)

    10+ Year Member



    I have a PR4 and it gets freshbotted probably twice a month. With that said we add/modify at least 15 - 20 pages a day so our content is very dynamic.

    As a side note, I've noticed absolutely zero usefulness from a practical standpoint in Fresh Bot coming to visit, but people here treat it like it's the holy grail so I've learned to appreciate it as a sign of respect ;)

    netguy

    7:40 pm on Apr 13, 2003 (gmt 0)

    WebmasterWorld Senior Member 10+ Year Member



    ALbino... FreshBot IS the holy grail for many of us. We have several news sites that when people want current information on a topic, they can find it quickly, rather than trying to dig through the news sites.

    We also have clients with a variety of products that often change or add new products every week. Why make a customer wait a month for 'fresh' information if the information is setting there?

    I can't really imagine many who wouldn't want fresh information indexed for their site.....

    plasma

    7:51 pm on Apr 13, 2003 (gmt 0)

    10+ Year Member



    You can even send the correct filesize of a dynamic file in the headers BEFORE the file is sent. Here is how in PHP-- you must put the page in to the buffer, read the size, then deliver the buffer.

    BTW: simply sending the last-modified header is only have the job

    The idea behind if-modified-since is that you only send the content if it changed since the specified timestamp.

    So always keep track of your created pages (e.g. by implementing caching , which is always a good idea).
    Then respond to the request by either sending "304 Not Modified" (and a Date header) or "200 OK"

    I too can confirm that sending the content-length is an important thing and can speed up things enormously (HTTP/1.0 persistant connections).

    I believe, the more you stick to the standards the more google likes you :)

    ALbino

    8:50 pm on Apr 13, 2003 (gmt 0)

    10+ Year Member



    I can't really imagine many who wouldn't want fresh information indexed for their site.....

    I guess that's true. My point was only that I've never once seen a jump in sales or even hits for that matter as a result of Fresh Bot. I suppose though that the hits I'm getting are more relevant then because it's more up-to-date. Don't get me wrong, I'll take Fresh Bot over no Fresh Bot any day of the week :)

    nhyrvana

    1:07 am on Apr 14, 2003 (gmt 0)

    10+ Year Member



    i have a pr5 ( maybe a 6.varies on the day of week. grr) but all sites i link, seem to get freshbot after me. at least their index. i am talking about 30 sites i can verify, both via the fresh tag and stats access.
    so i would say link from a site that is getting freshbot is the best key if you can not change content regularly, and do not have a high enough pr to garner it.
    le_gber said it simpler. just backing them up :)