I run a large website with around 500,000 URLs under "example.com". Years ago, I also had an AMP version of the site, which created another 500,000 URLs under "amp.example.com" (one AMP for each canonical URL).
At some point I realized that having 1M URLs in total was hurting my crawl budget, so I decided to return 410 Gone for every URL under "amp.example.com" in order to get Google to forget them.
I’ve checked with tools like Ahrefs and confirmed that there are no external backlinks pointing to any of the AMP URLs.
The issue is that Googlebot is still crawling thousands of URLs from "amp.example.com" every day.
Now I’m debating between three options:
1) Keep returning 410 Gone for every AMP URL.
2) Remove the DNS A record for "amp.example.com", so the host no longer exists (NXDOMAIN).
3) A hybrid approach: keep returning 410 for 6–9 months, and then remove the DNS A record afterwards.
My main goal is to optimize crawl budget so that Googlebot focuses on crawling and ranking the main domain (example.com).
What do you think is the best option among these three?