Page is a not externally linkable
- Google
-- Google SEO News and Discussion
---- Googlebot is obsessed with index.php... how best to fix?


Sgt_Kickaxe - 12:23 pm on Dec 7, 2012 (gmt 0)


Google will request every URL they have ever seen, forever.


That's what it feels like, for sure!

Since I'm redirecting non-www to www I haven't worried too much about redirecting to remove /index.php/ but then Googlebot doesn't request my non-www copies very often, it's the /index.php/ they want on 90% of requests, they visit the index.php version first that often still.

I also see this in my logs a lot for pages that are 410 (expired content), after googlebot requests the index.php version of course.

"GET www.example.com/expired-content HTTP/1.1" 410 636 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" 0 "redirect-handler" "redirect:/index.php"


I don't remember seeing the parts in bold until fairly recently.


Thread source:: http://www.webmasterworld.com/google/4525804.htm
Brought to you by WebmasterWorld: http://www.webmasterworld.com