Robots.txt case-sensitivty detection causing loss in rankings?

Can anyone offer any advice on suggestions on the following:

I have a 10+ year-old established domain that is hosted on an ancient Windows webserver platform (O'Reilly & Associates Webserver V1.1). I have a lot of custom code written here, so it is not practical for me to port to IIS or other webserver.

Anyway, I made a decision years ago, to make use of mixed case in my URL's (I know, I can hear the groans......). For example, my home page looks something like this:

www.domain.com/HomePage.htm

I have ranked high in Google for many keyword terms for the last 10 years. Googlebot regularly indexes my home page:

www.domain.com/HomePage.htm

On occassion (maybe quarterly), Googlebot has fetched my home page which is devoid of case:

www.domain.com/homepage.htm

Whenever this happens, several days later I get hammered in the SERPs (probably for a duplicate content penalty - Google thinks that I have 2 pages that are indentical, even though my webserver is serving up the same page from an Operating System standpoint - Windows cannot distinguish between upper and lower case filenames).

Whenever this happens, about 1 week later, my SERPs return - my guess is that Google figures out (perhaps from a different phase of filtering) that this is one and the same page, and removes this "duplicate penalty", and all is restored to normal. Like I said, this happens about once a quarter (or 4 times a year).

Now, to alleviate this "quarterly" problem, I decided to try and resolve this temporary problem by creating a ROBOTS.TXT file, and attempting to "disallow" the all-lowercase file entry from being spidered. In Robots.txt, I did this via a:

Disallow: /homepage.htm

2 weeks ago, this lowercase entry was finally fetched by Googlebot (it was noted in the Google Sitemap utility Diagnostics page, under "URL's restricted by robots.txt"). Several days later, my rankings tanked incredibly - to the point that I'm at about Ranking # 800, for terms that I normally ranked in the Top 10 for, AND after waiting my normal week (plus a 2nd week), these rankings have not returned.

My simplistic thoughts here are that given that I am no longer being hit with the Duplicate Content penalty (by virtue of trapping the lowercase page entry with Robots.txt), I am instead being hit with some other sort of penalty. My naive solution at this point, is to restore things to where they once were (i.e., remove Robots.txt disallow statement), but that will mean that I may have to wait 3 months to test this theory out.

BTW, this page has had minimal content pages made to it over the last couple of years, so there is nothing else on my end that could have caused this problem.

Of course, a recent algo change could have effected this new behavior, but it is too coincidental to the Googlebot filtering of my Robots.txt entry, in my opinion.

Does anyone have any further thoughts on this?

Robots.txt case-sensitivty detection causing loss in rankings?

doughayman

g1smd

doughayman

Asia_Expat

doughayman

doughayman

doughayman

g1smd

doughayman

g1smd

doughayman

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week