Page is a not externally linkable
- Search Engines
-- Search Engine Spider and User Agent Identification
---- And Now Google's Doing It. JS Stats Show GoogleBot
scooterdude - 8:41 pm on May 14, 2011 (gmt 0)Thread source:: http://www.webmasterworld.com/search_engine_spiders/4312058.htm
as far as i know, robots bypass robots.txt when they are following a direct link to a webpage/file.
Some web hosting packages show a linked list of all the files in that hosting package in certain circumstances, this can have interesting results
Brought to you by WebmasterWorld: http://www.webmasterworld.com