Sgt_Kickaxe - 12:23 pm on Dec 7, 2012 (gmt 0)
Google will request every URL they have ever seen, forever.
That's what it feels like, for sure!
Since I'm redirecting non-www to www I haven't worried too much about redirecting to remove /index.php/ but then Googlebot doesn't request my non-www copies very often, it's the /index.php/ they want on 90% of requests, they visit the index.php version first that often still.
I also see this in my logs a lot for pages that are 410 (expired content), after googlebot requests the index.php version of course.
"GET www.example.com/expired-content HTTP/1.1" 410 636 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" 0 "redirect-handler" "redirect:/index.php"
I don't remember seeing the parts in bold until fairly recently.