Forum Moderators: open

Message Too Old, No Replies

Any value in letting AI bots scrape my website?

         

Skips

7:19 am on Oct 16, 2023 (gmt 0)

Top Contributors Of The Month



I noticed a recent increase in AI bot traffic on my site. I wouldn't bother much, if it weren't for the fact that website has an immense number of pages, significant part of which are dynamically generated and there is a fair amount of computation behind each generated page. The bots seem quite greedy, making requests every second, sometimes several per second, so it is starting to affect site performance. Not to the point where it becomes slow, but to the point where it stops being lightening fast (just went through a recent server upgrade). So, my question is - is there any use for me as a webmaster to have AI bots cruise around my site or should I just disallow them in robots.txt?

I'm strongly leaning toward disallowing bots, as I don't see any value they could potentially bring to the site - they are only using information my website provides without any linking or references to my site, so they can't bring any traffic to me. Right? I have zero interest in paying for server resources just to provide various AI models with tons of data. Or is there any potential benefit in letting them scrape my site, something that's not immediately apparent to me?

not2easy

12:06 pm on Oct 16, 2023 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



I see no reason to host an AI bot cafe for the benefit of their users.

Skips

2:37 pm on Oct 16, 2023 (gmt 0)

Top Contributors Of The Month



Thanks. Right along the lines of what I was thinking :)

engine

3:32 pm on Oct 16, 2023 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



Skips, there's some great info over here on bots and blocking [webmasterworld.com...]

tangor

9:19 pm on Oct 16, 2023 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Start with robots.txt (voluntary, easily ignored) and then move to deny if necessary.

I see no reason to feed my competition (AI) with new intelligence it is too stupid to produce organically.

Skips

7:26 pm on Oct 25, 2023 (gmt 0)

Top Contributors Of The Month



Thank you for the info, engine, and for the tips, tangor (sorry for delayed reply, didn't get email notification about your answers). I have disallowed 3 specific AI bots in robots.txt for now. Monitoring bot traffic. If new bots keep coming up, I'll probably just allow major SE crawlers and disallow all others. For now, seems not too bad.