Google indexing large volumes of (unlinked?) dynamic pages

Here's an odd one for a small site (around 300 pages) with medium pagerank.

In the last week or so Google has indexed a succession of URLs that appear to be unlinked from anywhere. These are in two categories:

- Search result pages

Google is up to 2,130 of these. They are all single word searches for words that do actually appear somewhere on the site. The search itself is simple and does not link to any search results other than next/previous pages.

- Results for an online tool

This involves a user-entered URL (using GET). I've tracked down a few hundred of these that Google has requested, for a bizarre mix of URLs, from massive sites to individual blog posts.

I'm only at the start of my detective work for this (I'm going to grab all of the search keywords indexed and the URLs checked and see if that throws up any clues, and do a bit more in-depth log analysis). I can't find any links to any of the pages indexed on Google or Yahoo.

Here's my initial speculations:

- Someone may be linking to these pages deliberately, perhaps with a bit of noindex/follow . Would seem to be a bit pointless.

- Google might be indexing the pages based solely on the toolbar or another mechanism

- These pages have either been indexed for some time, or have built up over time. It is some change at Google that has made them visible now. This would also explain why the two very different types of page both suffer from the same problem now.

- I've screwed something up so that the pages are being linked to from the site, via some misbehaving script.

I can easily block the content from search engines, but for now I'm interested in tracking down the source, and I may as well see what the effect of thousands of junk pages on the site's performance is! ;)

Anyone have any suggestions as to what may have happened here?

One aside: Google really seems to likes to make troubleshooting difficult these days. The amount of hacking around just to get a complete list of indexed pages is starting to be an annoyance!

Google indexing large volumes of (unlinked?) dynamic pages

Receptional Andy

tedster

Receptional Andy

Receptional Andy

The Contractor

Receptional Andy

Receptional Andy

tedster

Receptional Andy

g1smd

Receptional Andy

The Contractor

Receptional Andy

Receptional Andy

g1smd

Receptional Andy

obottek

Receptional Andy

obottek

Receptional Andy

obottek

incrediblehelp

Receptional Andy

tedster

pageoneresults

pageoneresults

Receptional Andy

obottek

Receptional Andy

FromRocky

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week