301 (redirects) are immediately followed (no delay), as well as the robots.txt
Once again: Funny you should say that. Yesterday I noticed Barkrowler on one site it hasn't yet crawled, prompting me to check earlier logs for possible blocked visits. Now, this happens to be a site that went HTTPS earlier in the year, so all HTTP requests
including robots.txt* get a 301 response.
30 August (before Barkrowler was mentioned in robots.txt, and hence before it was authorized at all), in chronological order:
HTTP robots.txt 301
HTTPS robots.txt 200
HTTP one page 403
12 September (Pacific time, would be 13 September in Paris)
HTTP no less than
six requests for robots.txt, all 301
HTTPS ... nothing
Odd. It made me wonder if the robot got confused by redirected robots.txt requests, and didn't know what to do next. (Did its code get tweaked in the intervening two weeks? It sounds as if it did.)
On the main question I agree with keyplyr: If you get a redirect, it makes most sense to follow it up immediately. After all, the redirect target might be something different if you come back the next day. (Sure, it isn't
supposed to change that fast with a 301, but you never know.) If you're meeting a 302, there's no point in following the redirect at all unless you do it immediately. This seems to be the trend with major search engines: follow redirects within a few minutes, unless you happen to have crawled the target page within the past hour or so.
A more worrying observation is that, as of 16:00 on 12 September (Pacific time, so around 04:00 on 13 September, Paris time), on my primary site the robot seems to have forgotten all about robots.txt. Up to that time, it carefully requested robots.txt at the beginning of each visit.
* I have been thinking about excluding robots.txt from the redirect, for various reasons, but haven't yet got around to it.