why do crawlers only get first page and go? (302 redirect)

Glad to be in the forums, as I found a lot answers here while asking google my php questions.. :)

I wonder why searchbots like googlebot only get the first page and do not crawl deeper.

access.log:
64.68.82.169 - - [28/Oct/2003:09:01:48 +0100] "GET /robots.txt HTTP/1.0" 200 81 "-" "Googlebot/2.1 (+http://www.googlebot.com/bot.html)"
64.68.82.169 - - [28/Oct/2003:09:01:57 +0100] "GET / HTTP/1.0" 200 3006 "-" "Googlebot/2.1 (+http://www.googlebot.com/bot.html)"
64.68.82.168 - - [28/Oct/2003:11:47:46 +0100] "GET /en/main/welcome HTTP/1.0" 200 3006 "-" "Googlebot/2.1 (+http://www.googlebot.com/bot.html)"

..and gone.
That's with other bots, too.

robots.txt valids ok.

I use Apache's mod_rewrite to make the page look static and do not use sessions (had this in the beginning - that's still in google's archive).

The site to be regarded is thequod.de [thequod.de]. From there you'll be redirected (with status 302) according to your language, probably to [thequod.de...] - and that's where the robots don't get deeper.

Is this because of absolute links in href (like "/en/main/othercategory") and bots will only climb down, not up?

Or is it the "DC.Identifier" meta tag that does not refer to the URL itself, but to the executing php script? (Just recognized that, but cannot imagine that this would prevent bots from crawling the rest).

Please have a look at this..

why do crawlers only get first page and go? (302 redirect)

blueyed

BlueSky

blueyed

BlueSky

blueyed

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week