Forum Moderators: open

Message Too Old, No Replies

GoogleBot is requesting the same pages over and over

Google spider trapped?

         

waxtactics

12:55 pm on Mar 17, 2003 (gmt 0)

10+ Year Member



Hi,

I submitted to Google about 6 weeks ago and have been waiting for a visit. It came on Saturday and has been filling my logs ever since.

My pages are dynamic, but the php has been modified to convert the urls to search engine 'friendly'
e.g - /product_info.php/products_id/321

It seems to be indexing the same pages over and over. Is this normal behaviour? I suspect it is just following the links within the sub-pages and ending up back on the top level pages. Can the spiders get 'trapped' on sites that are not dynamic? If so, can I stop the loop and how?

Typical. I wait 6 weeks in a funeral parlour-esque log file and now its like Picadilly Circus!

Thanks,
IC

Alternative Future

12:59 pm on Mar 17, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Where is my manners ;-)

Hi and welcome to WebMasterWorld waxtactics,

Looks as though this good be a spider trap! This is where the spider gets trapped in your pages... Not sure on it getting trapped in non dynamic pages though.

[added]Or it could be the freshbot, which should only do the index page[/added] Are your logs reporting on it hitting the same page many times, on the same day as in a ridiculous amount?

-gs

waxtactics

4:01 pm on Mar 17, 2003 (gmt 0)

10+ Year Member



I wouldnt say ridiculous amounts, but for example login.php has been hit 8 times in the last 40 minutes. Its grabbing a whole lot of other stuff as well, so maybe its just ending up there as a link from the products - although my add to cart button is a form, so it shouldnt be able to follow that..

Ho hum, having read the 'google bots are back' post, it seems that a few people are getting some real hungry ones over the last coupla days.

I dont think its fresh bot - its going as deep as it can - eg /default.php/manufacturers_id/33/page/1/sort/3d.

Its crawl1.googlebot.com to crawl9.googlebot.com