Google indexing large volumes of (unlinked?) dynamic pages

Here's an odd one for a small site (around 300 pages) with medium pagerank.

In the last week or so Google has indexed a succession of URLs that appear to be unlinked from anywhere. These are in two categories:

- Search result pages

Google is up to 2,130 of these. They are all single word searches for words that do actually appear somewhere on the site. The search itself is simple and does not link to any search results other than next/previous pages.

- Results for an online tool

This involves a user-entered URL (using GET). I've tracked down a few hundred of these that Google has requested, for a bizarre mix of URLs, from massive sites to individual blog posts.

I'm only at the start of my detective work for this (I'm going to grab all of the search keywords indexed and the URLs checked and see if that throws up any clues, and do a bit more in-depth log analysis). I can't find any links to any of the pages indexed on Google or Yahoo.

Here's my initial speculations:

- Someone may be linking to these pages deliberately, perhaps with a bit of noindex/follow . Would seem to be a bit pointless.

- Google might be indexing the pages based solely on the toolbar or another mechanism

- These pages have either been indexed for some time, or have built up over time. It is some change at Google that has made them visible now. This would also explain why the two very different types of page both suffer from the same problem now.

- I've screwed something up so that the pages are being linked to from the site, via some misbehaving script.

I can easily block the content from search engines, but for now I'm interested in tracking down the source, and I may as well see what the effect of thousands of junk pages on the site's performance is! ;)

Anyone have any suggestions as to what may have happened here?

One aside: Google really seems to likes to make troubleshooting difficult these days. The amount of hacking around just to get a complete list of indexed pages is starting to be an annoyance!

Google indexing large volumes of (unlinked?) dynamic pages

Receptional Andy

tedster

Receptional Andy

pageoneresults

Receptional Andy

LunaC

pageoneresults

Receptional Andy

bouncybunny

Receptional Andy

bouncybunny

theBear

Receptional Andy

bouncybunny

theBear

Receptional Andy

ecmedia

pageoneresults

theBear

theBear

pageoneresults

Receptional Andy

theBear

keepontruckin

Oliver Henniges

Receptional Andy

Oliver Henniges

Receptional Andy

Oliver Henniges

theBear

theBear

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week