Google can learn over time that particular directories aren't worth crawling or de-value the pages contained in those directories. I've witnessed some interesting things:
- pages contained in /dir1/ will have an index rate of 5%, but other directories (/dir2/, /dir3/, etc.) will have a more standard 75%+ index rate. *There are no technical on-page issues with /dir1/ pages that make them unique to other directories.
- when pages from /dir1/ are indexed, they typically will either be de-indexed, or outranked by the same domain's different directory (e.g. thin /dir2/ pages will outrank the superior content on /dir1/)
The way to solve this would be to 1.) fix the content contained in /dir1/ (remove garbage, de-dupe, and enhance remaining pages), then 2.) allow Google to see that /dir1/ is clearly the most valuable content on the domain.
The problem is; how does Google learn that /directory1/ has been cleaned up? It's already been trained to devalue /directory1. Give it time? How much is enough time? We've seen it's been nearly a year and Google hasn't learned.
We've tried manually submitting with Search Console the 10 URLs per day to show Google that our quality has improved. While some of the URLs will get indexed, if I check back a month later they are de-indexed. *Again, this is UNIQUE high quality (to any reader) content that Google currently has very little to no other information indexed from other domains. In other words, these aren't highly contested keywords and Google's index doesn't have much if anything about it from any websites.
It seems clear to me that /directory1/ has a big red "X" on it in Google's eyes.
My Question: Have you ever, or do you know of a situation, where you've switched directories to store your pages? If so, what was the impact on indexation after at minimum 2 months? Want to make sure when we discuss results that we remove the temporary 'honeymoon' indexing that is typical of G.
Yes, obviously 301 redirect, update sitemaps, internal links, yadda yadda. :c)