Forum Moderators: open
[edited by: keyplyr at 1:01 am (utc) on Jun 25, 2018]
[edit reason] obscured private IP address & delinked URL [/edit]
It is supposed to archive French sites, without respecting the robots.txt (because the law gives them this right!)Say what now? Does the law also give websites the obligation to let them crawl, or do we still retain the right to physically block them? What if we don't live in France? (As it happens, I personally approve of archiving such as the Wayback Machine, but wtf.)
at some point in a near future, it will be forbidden to block the access to a site, if a visitor refuses cookies?I don't think that will have any effect on robots, since all they have to do is “accept” the cookie and then quietly throw it away. A cookie is only meaningful on a subsequent request, when you send the cookie back to the originating site. And if at this point the site says “Oi! I know you, and you’re supposed to have a cookie at this point!” ... well, that’s a whole nother field of interesting discussion.
I don't think that will have any effect on robot