Forum Moderators: phranque
Is it OK to do it at all without the permission of the webmaster? Or should I perhaps treat it like a bot and just obey robots.txt files?
Any input gratefully received.
Doing all this makes this action perfectly ethical in my view. Possible uses of data might not be ethical however, but this is a separate question that does not affect ethics of doing the very same thing as Google, Yahoo and every other search engine.
If I'm just storing a page to note changes does that constitute an 'unethical' use of data in your view
In my view what major search engines such as Google do constitues ethical and legal from Fair Use point of view activity.
They crawl and store pages, check for updates regularly, publish them in form of "cached" copy and make lots of money in process and this is deemed perfectly acceptable by majority of people, including those who publish content on open web.
I therefore see no reason why anyone else should not be allowed same freedoms without being accused of doing anything unethical. If anyone has different point of view then I'd like to hear your arguments.