Forum Moderators: DixonJones
Well "behind" makes no sense in our case since we just re-appeared. Coincidently, I just now received a reply stating they "would look into it." But from what I have read at numerous newsgroups, robots.txt is intermittently ignored by this crawler. Because our .htaccess is no longer working, I was thinking they had changed or added an additional identifier and would like to know what it is?
(keep outa the wayback machine) Just out of interest why? - tigger
Mainly because our website looked like crap 4 years ago - LOL
Don't expect a followup. I received the exact same message a month ago when I complained about their practice of posting public links to the robots.txt files of sites that block them. I find this extremely rude and spiteful.
Since it's now a public document thanks to Alexa, I've included text in our robots.txt file explaining our reasons for blocking them and also point out sections of their "privacy" policy that explains the tracking the Alexa toolbar does since most people aren't aware when the download what is considered by many to be "spyware."
I understand when people question why I won't have the site archived. I'd like to believe I have valid reasons when I update our sites and am the best judge of its timeliness and the relevance of the material. Links change, we've moved, content is no longer relevant, events have long passed, our opinions may have changed, etc.
WM