Forum Moderators: open
My goal of late is to find some reason to allow access.
You can tell the mod what error code to return.Yes, that's what Apache Docs says - however, the server config can, and usually does with shared hosting accounts, override what we do with htaccess, especially with response codes.
>What are you doing with the data you collect?
I collect only domain names, don’t store any other data.
Crawler just download index page and search for links to other sites.
>Why does your bot not support robots.txt?
Because I visit every page only once, and never get back :)
>Give me a reason to allow your bot :)
Because its tiny and small puppy ;)
Its made only for fun, I just want to know how many pages it can reach
starting from one.
Now its above 3000000 after one week, my server is very sloooow, so i
think its good score ;)
p.s. sorry for my english ;)
however, the server config can, and usually does with shared hosting accounts, override what we do with htaccess
Because its tiny and small puppy