Forum Moderators: phranque
Will bots then follow this pointer to retreive and parse the object?
There seems to be very little documentation on the net reqarding how http compliant bots are. Obviously using a browser works fine and a few little net resourses for checking the headers return that its valid - but is the same for "REAL" bots?
I think the reason you won't find documentation on se bots is because they constantly change and they would want their competitors getting any good ideas off them. I would setup a small site with no real value to test you theory before putting it in practice.
Chris
Welcome to WebmasterWorld [webmasterworld.com]!
This sounds like a bad idea to me. The Standard for Robots Exclusion states that the name of the robots-control file will be robots.txt, and makes no explicit allowances for 404 or 302 server responses. Robots need to be simple in order to be fast, and it wouldn't surprise me if many can't handle any redirection.
I would suggest the following:
Use a server-internal (silent) redirect from robots.txt to your script.
Design your script to log the visit, and then open and serve the contents of your actual robots.txt file from within the script itself.
In this way, the robot is never aware that the script was present, and no redirects are necessary.
HTH,
Jim
It looks like the method lies between the originating host and the user agent which will produce the best filters.
Thanks again for your suggestions/comments.